This release updates TGI to Torch 2.7 and CUDA 12.8.
round_up_seq logic to align with prefill warmup phase on… by @kaixuanliu in https://github.com/huggingface/text-generation-inference/pull/3224Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v3.3.0...v3.3.1
Fetched April 7, 2026