releases.sh
preview
Search
Docs
Hugging Face
/
Text Generation Inference
Text Generation Inference
$
npx -y @buildinternet/releases show text-generation-inference
Activity
Timeline
All
90d
30d
Sun
Mon
Tue
Wed
Thu
Fri
Sat
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
Less
More
Releases
2
Avg
0/wk
Versions
v3.3.6 → v3.3.7
All Releases
Mar 26, 2023
v0.4.1
↗
#
Features
server
: New faster GPTNeoX implementation based on flash attention
Fix
server
: fix input-length discrepancy between Rust and Python tokenizers
Mar 9, 2023
v0.4.0
↗
#
Features
router
: support best_of sampling
router
: support left truncation
server
: support typical sampling
launcher
: allow local models
clients
: add text-generation Python client
launcher
: allow parsing num_shard from CUDA_VISIBLE_DEVICES
Fix
server
: do not warp prefill logits
server
: fix formatting issues in generate_stream tokens
server
: fix galactica batch
server
: fix index out of range issue with watermarking
Mar 3, 2023
v0.3.2
↗
#
Features
router
: add support for huggingface api-inference
server
: add logits watermark with "A Watermark for Large Language Models"
server
: use a fixed transformers commit
Fix
launcher
: add missing parameters to launcher
server
: update to hf_transfer==0.1.2 to fix corrupted files issue
Feb 24, 2023
v0.3.1
↗
#
Features
server
: allocate full attention mask to decrease latency
server
: enable hf-transfer for insane download speeds
router
: add CORS options
Fix
server
: remove position_ids from galactica forward
Feb 16, 2023
v0.3.0
↗
#
Features
server
: support t5 models
router
: add max_total_tokens and empty_input validation
launcher
: add the possibility to disable custom CUDA kernels
server
: add automatic safetensors conversion
router
: add prometheus scrape endpoint
server, router
: add distributed tracing
Fix
launcher
: copy current env vars to subprocesses
docker
: add note around shared memory
Feb 7, 2023
v0.2.1
↗
#
Fix
server
: fix bug with repetition penalty when using GPUs and inference mode
Feb 3, 2023
v0.2.0
↗
#
Features
router
: support Token streaming using Server Side Events
router
: support seeding
server
: support gpt-neox
server
: support santacoder
server
: support repetition penalty
server
: allow the server to use a local weight cache
Breaking changes
router
: refactor Token API
router
: modify /generate API to only return generated text
Misc
router
: use background task to manage request queue
ci
: docker build/push on update
Previous
2
3
4
Next
Latest
v3.3.7
Source
@huggingface/text-generation-inference
Tracking Since
Feb 3, 2023
Last checked Apr 21, 2026
.json
·
.md
·
.atom
Similar sources
AI SDK
Vercel
·
1514 releases
·
latest Apr 17, 2026
Language Tools
Prisma
·
216 releases
·
latest Apr 20, 2026
LangGraph
LangChain
·
111 releases
·
latest Apr 17, 2026
LangServe
LangChain
·
65 releases
·
latest Oct 17, 2025
LangChain (Python)
LangChain
·
117 releases
·
latest Apr 20, 2026
Langfuse
Langfuse
·
109 releases
·
latest Apr 17, 2026
Other sources from this team
Text Embeddings Inference
Hugging Face
·
33 releases
·
latest Mar 23, 2026
Tokenizers
Hugging Face
·
100 releases
·
latest Dec 2, 2025
Transformers
Hugging Face
·
104 releases
·
latest Apr 13, 2026
huggingface_hub
Hugging Face
·
105 releases
·
latest Apr 16, 2026
Datasets
Hugging Face
·
100 releases
·
latest Mar 23, 2026
Diffusers
Hugging Face
·
90 releases
·
latest Mar 25, 2026
Similar releases
v0.4.0
Mar 9, 2023
via
Hugging Face · Text Generation Inference
v0.9.4
Jul 27, 2023
via
Hugging Face · Text Generation Inference
v0.3.1
Feb 24, 2023
via
Hugging Face · Text Generation Inference
v0.4.2
Mar 30, 2023
via
Hugging Face · Text Generation Inference
v0.9.0
Jul 1, 2023
via
Hugging Face · Text Generation Inference
v0.5.0
Apr 11, 2023
via
Hugging Face · Text Generation Inference
v0.2.0
Feb 3, 2023
via
Hugging Face · Text Generation Inference
v0.2.1
Feb 7, 2023
via
Hugging Face · Text Generation Inference
More from this team
v0.4.0
Mar 9, 2023
via
Hugging Face · Text Generation Inference
v0.9.4
Jul 27, 2023
via
Hugging Face · Text Generation Inference
v0.3.1
Feb 24, 2023
via
Hugging Face · Text Generation Inference
v0.4.2
Mar 30, 2023
via
Hugging Face · Text Generation Inference
v0.9.0
Jul 1, 2023
via
Hugging Face · Text Generation Inference
v0.5.0
Apr 11, 2023
via
Hugging Face · Text Generation Inference
v0.2.0
Feb 3, 2023
via
Hugging Face · Text Generation Inference
v0.2.1
Feb 7, 2023
via
Hugging Face · Text Generation Inference