by @qgallouedec in https://github.com/huggingface/trl/pull/2565
disable_dropout by @qgallouedec in https://github.com/huggingface/trl/pull/2511PreferenceCollator to DataCollatorForPreference by @qgallouedec in https://github.com/huggingface/trl/pull/2510formatting_func's documentation in ConstantLengthDataset by @SamuelLarkin in https://github.com/huggingface/trl/pull/2549max_prompt_length parameter in tests by @qgallouedec in https://github.com/huggingface/trl/pull/2588max_seq_length instead of max_length by @skandermoalla in https://github.com/huggingface/trl/pull/2590max_prompt_length and loop usage in logp computation by @qgallouedec in https://github.com/huggingface/trl/pull/2598truncation_mode in DPOTrainer by @anakin87 in https://github.com/huggingface/trl/pull/2551num_logits_to_keep to reduce memory usage in GRPO by @qgallouedec in https://github.com/huggingface/trl/pull/2683Full Changelog: https://github.com/huggingface/trl/compare/v0.13.0...v0.14.0
Fetched April 7, 2026