releases.shpreview

v2.3.1

$npx -y @buildinternet/releases show rel_egkXtuVMd2ErXz2Bm0BYm

Important changes

  • Added support for Mllama (3.2, vision models). Flashed, unpadded.
  • FP8 performance improvements
  • Moe performance improvements
  • BREAKING CHANGE - When using tools, models could answer with a tool call notify_error with the content error, it will instead output regular generation.

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v2.3.0...v2.3.1

Fetched April 7, 2026