When releasing ghcr.io/huggingface/text-embeddings-inference:cuda-1.9 with CUDA 12.9 and
cuda-compat-12-9there was an issue when running that same container on instances with CUDA 13.0+, as thecuda-compat-12-9set inLD_LIBRARY_PATHwas leading to aCUDA_ERROR_SYSTEM_DRIVER_MISMATCH = 803, which is now solved with a custom entrypoint that dynamically includes thecuda-compaton theLD_LIBRARY_PATHdepending on the instance CUDA version.
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.9.0...v1.9.1
Fetched April 7, 2026