Patch release to fix a bug on google colab with PPOTrainer & PPOConfig + wandb
Full Changelog: https://github.com/lvwerra/trl/compare/v0.4.5...v0.4.6
Fetched April 7, 2026