1 Answer
- Newest
- Most votes
- Most comments
0
Hi,
Unfortunately, Graviton based instances are not supported for multi-model endpoints in Amazon SageMaker.
Multi-model endpoints currently support all the CPU instance types and on single-GPU instance types. -> Please find the below list of Compute optimized instance types (no GPUs): ml.c5.large ml.c5.xlarge ml.c5.2xlarge ml.c5.4xlarge ml.c5.9xlarge ml.c5.12xlarge ml.c5.18xlarge ml.c5.24xlarge -> Supported instances for multi-model endpoints using GPU backed instances: [+] https://docs.aws.amazon.com/sagemaker/latest/dg/multi-model-endpoints.html#multi-model-support-gpu
Relevant content
- asked 2 years ago
- AWS OFFICIALUpdated 9 months ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated a year ago
Thanks. However, the documentation says all CPU types. Are Graviton instance types not CPU instance types? Is the documentation wrong? Can you give context on why Graviton instance types are not enabled? Is this a feature that may become available in the future?