releases.shpreview

4.54.1

Patch release 4.54.1

$npx -y @buildinternet/releases show rel_hRJaYh704W2VXOBGlnTVZ

We had quite a lot of bugs that got through! Release was a bit rushed, sorry everyone! ๐Ÿค— Mostly cache fixes, as we now have layered cache, and fixed to distributed.

  • Fix Cache.max_cache_len max value for Hybrid models, @manueldeprada, @Cyrilvallez, #39737
  • [modenbert] fix regression, @zucchini-nlp, #39750
  • Fix version issue in modeling_utils.py, @Cyrilvallez, #39759
  • Fix GPT2 with cross attention, @zucchini-nlp, #39754
  • Fix mamba regression, @manueldeprada, #39728
  • Fix: add back base model plan, @S1ro1, #39733
  • fix cache inheritance, #39748
  • Fix cache-related tests, @zucchini-nlp, #39676
  • Fix Layer device placement in Caches, @Cyrilvallez, #39732
  • PATCH: add back n-dim device-mesh + fix tp trainer saving, @S1ro1, @SunMarc, #39693
  • fix missing model._tp_size from ep refactor, @winglian, #39688

Fetched April 7, 2026