prepare_model_for_kbit_training to save VRAM by @sergiopaniego in https://github.com/huggingface/trl/pull/4335add_generation_prompt to processor_kwargs in GRPO and RLOO trainer by @qgallouedec in https://github.com/huggingface/trl/pull/4361trl.experimental by @qgallouedec in https://github.com/huggingface/trl/pull/4312add_generation_prompt=True for conversational only by @qgallouedec in https://github.com/huggingface/trl/pull/4362max_length explanation for VLM in online trainers by @sergiopaniego in https://github.com/huggingface/trl/pull/4220_generate in GRPO/RLOO: Move forward_kwargs outside generation method by @qgallouedec in https://github.com/huggingface/trl/pull/4154trl.experimental by @qgallouedec in https://github.com/huggingface/trl/pull/4312_generate in GRPO/RLOO: Insert images in the prompt by @qgallouedec in https://github.com/huggingface/trl/pull/4155prepare_model_for_kbit_training to save VRAM by @sergiopaniego in https://github.com/huggingface/trl/pull/4335add_generation_prompt to processor_kwargs in GRPO and RLOO trainer by @qgallouedec in https://github.com/huggingface/trl/pull/4361add_generation_prompt=True for conversational only by @qgallouedec in https://github.com/huggingface/trl/pull/4362max_length explanation for VLM in online trainers by @sergiopaniego in https://github.com/huggingface/trl/pull/4220Full Changelog: https://github.com/huggingface/trl/compare/v0.24.0...v0.25.0
Fetched April 7, 2026