decoder_input_details on OpenAI-compatible chat streaming, pass temp and top-k from API by @EndlessReform in https://github.com/huggingface/text-generation-inference/pull/1470/tokenize route to get the tokenized input by @Narsil in https://github.com/huggingface/text-generation-inference/pull/1471Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v1.3.4...v1.4.0
Fetched April 7, 2026