SFTTrainer and PPOTrainer bug fixes_prepare_dataset function by @BeibinLi in https://github.com/lvwerra/trl/pull/464CI] Fix CI RM by @younesbelkada in https://github.com/lvwerra/trl/pull/468float instead of double to avoid issues with MPS device by @younesbelkada in https://github.com/lvwerra/trl/pull/499PPOTrainer] Add prefix tuning support by @younesbelkada in https://github.com/lvwerra/trl/pull/501PPOTrainer] Add prompt tuning support on TRL by @younesbelkada in https://github.com/lvwerra/trl/pull/500SFTTrainer] Fix the sequence length check of SFTTrainer by @younesbelkada in https://github.com/lvwerra/trl/pull/512Full Changelog: https://github.com/lvwerra/trl/compare/v0.4.6...v0.4.7
Fetched April 7, 2026