releases.shpreview

Hugging Face/Text Generation Inference/v0.8.0

v0.8.0

May 30, 2023Text Generation InferenceView original ↗

$npx -y @buildinternet/releases show rel__VVO1EsJTnGTwAXDHRMbo

Features

router: support vectorized warpers in flash causal lm (co-authored by @jlamypoirier )
proto: decrease IPC proto size
benchmarker: add summary tables
server: support RefinedWeb models

Fix

server: Fix issue when load AutoModelForSeq2SeqLM model (contributed by @CL-Shang)

New Contributors

@CL-Shang made their first contribution in https://github.com/huggingface/text-generation-inference/pull/370
@jlamypoirier made their first contribution in https://github.com/huggingface/text-generation-inference/pull/317

Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v0.7.0...v0.8.0

Fetched April 7, 2026