doc-builder in https://github.com/lvwerra/trl/pull/59push_to_hub with PPOTrainer in https://github.com/lvwerra/trl/pull/68nbdev dependency by @younesbelkada in https://github.com/lvwerra/trl/pull/52xxxForCausalLM support by @younesbelkada in https://github.com/lvwerra/trl/pull/53VHead] Fix slow convergence issue by @younesbelkada in https://github.com/lvwerra/trl/pull/60accelerate integration by @younesbelkada in https://github.com/lvwerra/trl/pull/58PPOTrainer] make the reference model optional by @younesbelkada in https://github.com/lvwerra/trl/pull/67main by @lvwerra in https://github.com/lvwerra/trl/pull/77step method by @younesbelkada in https://github.com/lvwerra/trl/pull/76PPOTrainer] Support generic optimizers by @younesbelkada in https://github.com/lvwerra/trl/pull/78dataset attribute optional by @younesbelkada in https://github.com/lvwerra/trl/pull/85v_head when using AutoModelForCausalLMWithValueHead by @younesbelkada in https://github.com/lvwerra/trl/pull/86wandb dependency by @younesbelkada in https://github.com/lvwerra/trl/pull/92dev0 unless it is a release version by @mishig25 in https://github.com/lvwerra/trl/pull/99core] Advise to use fbs=1 by @younesbelkada in https://github.com/lvwerra/trl/pull/102Full Changelog: https://github.com/lvwerra/trl/commits/v0.2.0