releases.shpreview

v0.4.0

$npx -y @buildinternet/releases show rel__1m2i1wY5PKezbEkcuP8h

v0.4.0: peft integration

Apply RLHF and fine-tune your favorite large model on consumer GPU using peft and trl ! Share also easily your trained RLHF adapters on the Hub with few lines of code

With this integration you can train gpt-neo-x (20B parameter model - 40GB in bfloat16) on a 24GB consumer GPU!

What's Changed

New Contributors

Full Changelog: https://github.com/lvwerra/trl/compare/v0.3.1...v0.4.0

Fetched April 7, 2026