v1.3.0

Notable changes

New truncation direction parameter
Cuda support for JinaCode model architecture
Cuda support for Mistral model architecture
Cuda support for Alibaba GTE model architecture
New prompt name parameter: you can now add a prompt name to the body of your request to add a pre-prompt to your input, based on the Sentence Transformers configuration. You can also set a default prompt / prompt name to always add a pre-prompt to your requests.

What's Changed

Ci migration to K8s by @glegendre01 in https://github.com/huggingface/text-embeddings-inference/pull/269
chore: map compute_cap from GPU name by @haixiw in https://github.com/huggingface/text-embeddings-inference/pull/276
chore: cover Nvidia T4/L4 GPU by @haixiw in https://github.com/huggingface/text-embeddings-inference/pull/284
feat(ci): add trufflehog secrets detection by @McPatate in https://github.com/huggingface/text-embeddings-inference/pull/286
Community contribution code of conduct by @LysandreJik in https://github.com/huggingface/text-embeddings-inference/pull/291
Update README.md by @michaelfeil in https://github.com/huggingface/text-embeddings-inference/pull/277
Upgrade tokenizers to 0.19.1 to deal with breaking change in tokenizers by @scriptator in https://github.com/huggingface/text-embeddings-inference/pull/266
Add env for OTLP service name by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/285
Fix CI build timeout by @fxmarty in https://github.com/huggingface/text-embeddings-inference/pull/296
fix(router): payload limit was not correctly applied by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/298
feat(candle): better cuda error by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/300
feat(router): add truncation direction parameter by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/299
Support for Jina Code model by @patricebechard in https://github.com/huggingface/text-embeddings-inference/pull/292
feat(router): add base64 encoding_format for OpenAI API by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/301
fix(candle): fix FlashJinaCodeModel by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/302
fix: use malloc_trim to cleanup pages by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/307
feat(candle): add FlashMistral by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/308
feat(candle): add flash gte by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/310
feat: add default prompts by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/312
Add optional CORS allow any option value in http server cli by @kir-gadjello in https://github.com/huggingface/text-embeddings-inference/pull/260
Update HUGGING_FACE_HUB_TOKEN to HF_API_TOKEN in README by @kevinhu in https://github.com/huggingface/text-embeddings-inference/pull/263
v1.3.0 by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/313

New Contributors

@haixiw made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/276
@McPatate made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/286
@LysandreJik made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/291
@michaelfeil made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/277
@scriptator made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/266
@fxmarty made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/296
@patricebechard made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/292
@kir-gadjello made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/260
@kevinhu made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/263

Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.2.3...v1.3.0

Notable changes

What's Changed

New Contributors

More from Hugging Face

More from Hugging Face