Specifying model version in sagemaker endpointconfig

0

I'm trying to link a specific model version to a sagemaker inference endpoint. I can see that the model details are specified in the endpointconfig. How do add a model version to it? I want to deploy a specific version from the model registry to the inference endpoint.

https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateEndpointConfig.html

Thanks!

vik
asked 9 months ago176 views
1 Answer
1
Accepted Answer

Hi, ProductionVariant is the key parameter to define the model associated to the endpoint, hence its version in the doc that you point to. See on this page:

you define a ProductionVariant, for each model that you want to deploy. Each 
ProductionVariant parameter also describes the resources that you want SageMaker
 to provision. This includes the number and type of ML compute instances to deploy.

You probably should read this blog post to setup a multi-endpoint / multi-model inference service on Sagemaker: https://aws.amazon.com/blogs/machine-learning/part-3-model-hosting-patterns-in-amazon-sagemaker-run-and-optimize-multi-model-inference-with-amazon-sagemaker-multi-model-endpoints/

Indeed, I'd recommend the full series: go at bottom to access them all.

Best,

Didier

profile pictureAWS
EXPERT
answered 9 months ago
  • Thanks Didier_Durand. I had confused models with model package. I think I'm good for now.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions