v3.2.1
What's Changed
- Update to
kernels0.2.1 by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3084 - Router: add
gemma3-textmodel type by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3107 - We need gcc during runtime to enable triton to compile kernels. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3103
- Release of Gaudi Backend for TGI by @baptistecolle in https://github.com/huggingface/text-generation-inference/pull/3091
- Fixing the docker build. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3108
- Make the Nix-based Docker container work on non-NixOS by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3109
- xpu 2.6 update by @sywangyi in https://github.com/huggingface/text-generation-inference/pull/3051
- launcher: correctly get the head dimension for VLMs by @danieldk in https://github.com/huggingface/text-generation-inference/pull/3116
- Gaudi: Sync TGI with the latest changes from the TGI-Gaudi fork by @baptistecolle in https://github.com/huggingface/text-generation-inference/pull/3117
- Bug Fix: Sliding Window Attention by @mht-sharma in https://github.com/huggingface/text-generation-inference/pull/3112
- Publish nix docker image. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3122
- Prepare for patch release. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3124
- Intel docker. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/3121
Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v3.2.0...v3.2.1
Fetched April 7, 2026



