v1.7.1
What's Changed
- [Docs] Update quick tour by @NielsRogge in https://github.com/huggingface/text-embeddings-inference/pull/574
- Update
README.mdandsupported_models.mdby @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/572 - Back with linting. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/577
- [Docs] Add cloud run example by @NielsRogge in https://github.com/huggingface/text-embeddings-inference/pull/573
- Fixup by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/578
- Fixing the tokenization routes token (offsets are in bytes, not in by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/576
- Removing requirements file. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/585
- Removing candle-extensions to live on crates.io by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/583
- Bump
sccacheto 0.10.0 andsccache-actionto 0.0.9 by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/586 - optimize the performance of FlashBert Path for HPU by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/575
- Revert "Removing requirements file. (#585)" by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/588
- Get opentelemetry trace id from request headers by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/425
- Add argument for configuring Prometheus port by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/589
- Adding missing
head.prefix in the weight name inModernBertClassificationHeadby @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/591 - Fixing the CI (grpc path). by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/593
- fix xpu env issue that cannot find right libur_loader.so.0 by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/595
- enable flash mistral model for HPU device by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/594
- remove optimum-habana dependency by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/599
- Support NomicBert MoE by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/596
- Remove duplicate short option '-p' to fix router executable by @cebtenzzre in https://github.com/huggingface/text-embeddings-inference/pull/602
- Update
text-embeddings-router --helpoutput by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/603 - Warmup padded models too. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/592
- Add support for JinaAI Re-Rankers V1 by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/582
- Gte diffs by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/604
- Fix the weight name in GTEClassificationHead by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/606
- upgrade pytorch and ipex to 2.7 version by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/607
- upgrade HPU FW to 1.21; upgrade transformers to 4.51.3 by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/608
- Patch DistilBERT variants with different weight keys by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/614
- add offline modeling for model
jinaai/jina-embeddings-v2-base-codeto avoidauto_mapto other repository by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/612 - Add mean pooling strategy for Modernbert classifier by @kwnath in https://github.com/huggingface/text-embeddings-inference/pull/616
- Using serde for pool validation. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/620
- Preparing the update to 1.7.1 by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/623
New Contributors
- @NielsRogge made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/574
- @cebtenzzre made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/602
- @kwnath made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/616
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.7.0...v1.7.1
Fetched April 7, 2026
