Skip to content

Update Text Embedding Inference container image

0

Huggingface (https://github.com/huggingface/text-embeddings-inference) has released the image to support the new embedding models of Qwen. I really need them for our project. But the latest tei-huggingface image provided by AWS is just 1.7, which does not support the models.

Could you please add this support?

Besides, I couldn't find the information on the available of the tei images anywhere on the AWS Deep Learning Container repository.

2 Answers
5

This can be a bug topic, in short, you may start to deploy the latest Qwen model manually:

model=Qwen/Qwen3-Embedding-0.6B
volume=$PWD/data

docker run --gpus all -p 8080:80 -v $volume:/data --pull always \
ghcr.io/huggingface/text-embeddings-inference:1.7 \
--model-id $model

This pulls the latest image from Hugging Face’s GitHub Container Registry and loads the Qwen model directly.

EXPERT
answered 10 months ago
  • Thanks @Kidd Ip. I used https://sagemaker.readthedocs.io/en/stable/frameworks/huggingface/sagemaker.huggingface.html#sagemaker.huggingface.HuggingFace to deploy the model and it needs AWS images, without accepting any images from ghcr.io/huggingface.

    I noticed that AWS is pretty slow in updating its mages that match updates in huggingface Text Generation Inference and Text Embedding Inference.

0

Hey,

Hope you're keeping well.

AWS Deep Learning Containers are versioned and published on the AWS Deep Learning Container images page, but they may lag behind upstream releases from Hugging Face. If you need Qwen embedding support immediately, you can build a custom SageMaker inference container by starting from the Hugging Face text-embeddings-inference image on GHCR and adding your required model, then push it to Amazon ECR for deployment. This approach lets you run the latest TEI version in SageMaker or ECS without waiting for the official AWS DLC update.

Thanks and regards,
Taz

answered 6 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.