Are Graviton instance types not supported for SageMaker multimodel endpoints?

0

I am trying to create a multimodel endpoint using an ml.c7g.xlarge instance type. The error I get is: “MultiModel mode is not supported for instance type ml.c7g.xlarge.” However, according to the documentation on AWS, “Multi-model endpoints are currently supported for all CPU instances types and on single-GPU instance types.” (https://docs.aws.amazon.com/sagemaker/latest/dg/multi-model-endpoints.html#multi-model-endpoint-instance). Subsequently, I tried ml.c6g.xlarge and see that that also is not supported. Are Graviton instance types not supported in general? If so, why?

질문됨 일 년 전538회 조회
1개 답변
0

Hi,

Unfortunately, Graviton based instances are not supported for multi-model endpoints in Amazon SageMaker.

Multi-model endpoints currently support all the CPU instance types and on single-GPU instance types. -> Please find the below list of Compute optimized instance types (no GPUs): ml.c5.large ml.c5.xlarge ml.c5.2xlarge ml.c5.4xlarge ml.c5.9xlarge ml.c5.12xlarge ml.c5.18xlarge ml.c5.24xlarge -> Supported instances for multi-model endpoints using GPU backed instances: [+] https://docs.aws.amazon.com/sagemaker/latest/dg/multi-model-endpoints.html#multi-model-support-gpu

AWS
지원 엔지니어
답변함 일 년 전
  • Thanks. However, the documentation says all CPU types. Are Graviton instance types not CPU instance types? Is the documentation wrong? Can you give context on why Graviton instance types are not enabled? Is this a feature that may become available in the future?

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠