- Newest
- Most votes
- Most comments
This can be a bug topic, in short, you may start to deploy the latest Qwen model manually:
model=Qwen/Qwen3-Embedding-0.6B
volume=$PWD/data
docker run --gpus all -p 8080:80 -v $volume:/data --pull always \
ghcr.io/huggingface/text-embeddings-inference:1.7 \
--model-id $model
This pulls the latest image from Hugging Face’s GitHub Container Registry and loads the Qwen model directly.
Hey,
Hope you're keeping well.
AWS Deep Learning Containers are versioned and published on the AWS Deep Learning Container images page, but they may lag behind upstream releases from Hugging Face. If you need Qwen embedding support immediately, you can build a custom SageMaker inference container by starting from the Hugging Face text-embeddings-inference image on GHCR and adding your required model, then push it to Amazon ECR for deployment. This approach lets you run the latest TEI version in SageMaker or ECS without waiting for the official AWS DLC update.
Thanks and regards,
Taz
Relevant content
- AWS OFFICIALUpdated a year ago

Thanks @Kidd Ip. I used
https://sagemaker.readthedocs.io/en/stable/frameworks/huggingface/sagemaker.huggingface.html#sagemaker.huggingface.HuggingFaceto deploy the model and it needs AWS images, without accepting any images from ghcr.io/huggingface.I noticed that AWS is pretty slow in updating its mages that match updates in huggingface Text Generation Inference and Text Embedding Inference.