releases.shpreview

v2.4.1

$npx -y @buildinternet/releases show rel_RfpZ_og2OR7Y_jUITQS9L

Notable changes

  • Choose input/total tokens automatically based on available VRAM
  • Support Qwen2 VL
  • Decrease latency of very large batches (> 128)

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v2.3.0...v2.4.1

Fetched April 7, 2026