v0.7.8: Unsloth tag, DPO fixes, PEFT support for DDPO
xxxTrainerIf users use Unsloth library, the unsloth tag gets automatically pushed on the Hub.
xxxTrainer] Add unsloth tag by @younesbelkada in https://github.com/huggingface/trl/pull/1130Some important fixes for DPO has been introduced to address: https://twitter.com/jon_durbin/status/1743575483365699809 and to make DPO faster
Now DDPO supports PEFT
peft in ddpo. by @sayakpaul in https://github.com/huggingface/trl/pull/1165Full Changelog: https://github.com/huggingface/trl/compare/v0.7.7...v0.7.8
Fetched April 7, 2026