releases.shpreview

v0.7.11

v0.7.11: IPO & DPO fixes, faster data processing for multi-GPU, Automatic tagging for all models

February 16, 2024TRLView original ↗
$npx -y @buildinternet/releases show rel_AaKlfEzgHX-gyp4XbB6Ur

DPO important fixes

We fixed issues with respect to IPO loss, leading to consistent results according to newest experiements:

We also fixed important bugs with respect to DPO / PEFT and Flash Attention

Data processing is now faster for multi-GPU envs

Other DPO bugfixes:

Faster data processing and other enhancements:

Automatic tagging for all models

Models now gets tagged correctly even if users do not call trainer.push_to_hub()

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/trl/compare/v0.7.10...v0.7.11

Fetched April 7, 2026