v0.15.2
This patch fixes a bug that resulted in prompt learning methods like P-tuning not to work (#2477).
Fetched April 7, 2026
This patch fixes a bug that resulted in prompt learning methods like P-tuning not to work (#2477).
Fetched April 7, 2026
Fixed an exponential backtracking bug in Qwen3/Qwen3.5/GLM4MoE response parsing that caused GRPOTrainer to hang indefinitely on truncated t…
Hugging Face · Fine-tuningPatch release v5.6.2 Qwen 3.5 and 3.6 MoE (text-only) were broken when using with FP8. It should now work again with this :salutingface: Fi…
Hugging Face · Transformers