v1.7.0
Notable changes
- Upgrade dependencies heavily (candle 0.5 -> 0.8 and related)
- Added ModernBert support by @kozistr !
What's Changed
- Moving cublaslt into TEI extension for easier upgrade of candle globally by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/542
- Upgrade candle2 by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/543
- Upgrade candle3 by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/545
- Fixing the static-linking. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/547
- Fix linking bis by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/549
- Make
sliding_windowforQwen2optional by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/546 - Optimize the performance of FlashBert on HPU by using fast mode softmax by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/555
- Fixing cudarc to the latest unified bindings. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/558
- Fix typos / formatting in CLI args in Markdown files by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/552
- Use custom
serdedeserializer for JinaBERT models by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/559 - Implement the
ModernBertmodel by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/459 - Fixing FlashAttention ModernBert. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/560
- Enable ModernBert on metal by @ivarflakstad in https://github.com/huggingface/text-embeddings-inference/pull/562
- Fix
{Bert,DistilBert}SpladeHeadwhen loading from Safetensors by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/564 - add related docs for intel cpu/xpu/hpu container by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/550
- Update the doc for submodule. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/567
- Update
docs/source/en/custom_container.mdby @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/568 - Preparing for release 1.7.0 (candle update + modernbert). by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/570
New Contributors
- @ivarflakstad made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/562
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.6.1...v1.7.0
Fetched April 7, 2026
