v3.2.2
What's Changed
- Minor fixes. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3125
- configurable termination timeout by @ErikKaum in https://github.com/huggingface/text-generation-inference/pull/3126
- CI: enable server tests for backends by @baptistecolle in https://github.com/huggingface/text-generation-inference/pull/3128
- Torch 2.6 by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3134
- Gaudi: Fix llava-next and mllama crash issue by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3127
- nix-v3.2.1 -> v3.2.1-nix by @co42 in https://github.com/huggingface/text-generation-inference/pull/3129
- Gaudi: Use exponential growth to replace BATCH_BUCKET_SIZE by @yuanwu2017 in https://github.com/huggingface/text-generation-inference/pull/3131
- Add llama4 by @mht-sharma in https://github.com/huggingface/text-generation-inference/pull/3145
- Preparing for release. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3147
New Contributors
- @co42 made their first contribution in https://github.com/huggingface/text-generation-inference/pull/3129
Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v3.2.1...v3.2.2
Fetched April 7, 2026

