Notable changes
- New truncation direction parameter
- Cuda support for JinaCode model architecture
- Cuda support for Mistral model architecture
- Cuda support for Alibaba GTE model architecture
- New prompt name parameter: you can now add a prompt name to the body of your request to add a pre-prompt to your input, based on the Sentence Transformers configuration. You can also set a default prompt / prompt name to always add a pre-prompt to your requests.
What's Changed
New Contributors
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.2.3...v1.3.0