--cuda-graphs 0 work as expected (bis) by @fxmarty in https://github.com/huggingface/text-generation-inference/pull/1768GenerateParameters by @Wauplin in https://github.com/huggingface/text-generation-inference/pull/1798HF_HUB_OFFLINE support in the router. by @Narsil in https://github.com/huggingface/text-generation-inference/pull/1789tool_prompt parameter to Python client by @maziyarpanahi in https://github.com/huggingface/text-generation-inference/pull/1825Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v2.0.1...v2.0.2
Fetched April 7, 2026