v1.7.2
Notable change
- Added support for Qwen3 embeddigns
What's Changed
- Adding suggestions to fixing missing ONNX files. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/624
- Add
Qwen3Modelby @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/627 - Add
HiddenAct::Silu(removeserdealias) by @alvarobartt in https://github.com/huggingface/text-embeddings-inference/pull/631 - Add CPU support for Qwen3-Embedding models by @randomm in https://github.com/huggingface/text-embeddings-inference/pull/632
- refactor the code and add wrap_in_hpu_graph to corner case by @kaixuanliu in https://github.com/huggingface/text-embeddings-inference/pull/625
- Support Qwen3 w/ fp32 on GPU by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/634
- Preparing the release. by @Narsil in https://github.com/huggingface/text-embeddings-inference/pull/639
New Contributors
- @randomm made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/632
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.7.1...v1.7.2
Fetched April 7, 2026

