releases.shpreview
Hugging Face/Text Embeddings Inference

Text Embeddings Inference

$npx -y @buildinternet/releases show text-embeddings-inference
Mon
Wed
Fri
AprMayJunJulAugSepOctNovDecJanFebMarApr
Less
More
Releases3Avg0/wkVersionsv1.9.0 → v1.9.2
Apr 16, 2024

What's Changed

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.2.1...v1.2.2

Apr 15, 2024

TEI is now Apache 2.0!

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.2.0...v1.2.1

Mar 22, 2024

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.1.0...v1.2.0

Mar 1, 2024

Highlights

  • Splade pooling

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.0.0...v.1.1.0

Feb 23, 2024

Highlights

  • Support for Nomic models
  • Support for Flash Attention for Jina models
  • Metal backend for M* users
  • /tokenize route to directly access the internal TEI tokenizer
  • /embed_all route to allow client level pooling

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v0.6.0...v1.0.0

Nov 30, 2023

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v0.5.0...v0.6.0

Nov 20, 2023
Nov 15, 2023

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v0.3.0...v0.4.0

Oct 13, 2023
  • No compilation step
  • Dynamic shapes
  • Small docker images and fast boot times. Get ready for true serverless!
  • Token based dynamic batching
  • Optimized transformers code for inference using Flash Attention, Candle and cuBLASLt
  • Safetensors weight loading
  • Production ready (distributed tracing with Open Telemetry, Prometheus metrics)
Latest
v1.9.3
Tracking Since
Oct 13, 2023
Last fetched Apr 18, 2026