1 Answer
- Newest
- Most votes
- Most comments
0
Unfortunately GPU based inference isn't currently supported on SageMaker Serverless Inference. From the feature exclusions section of the serverless endpoints documentation:
Some of the features currently available for SageMaker Real-time Inference are not supported for Serverless Inference, including GPUs, AWS marketplace model packages, private Docker registries, Multi-Model Endpoints, VPC configuration, network isolation, data capture, multiple production variants, Model Monitor, and inference pipelines.
Link here: https://docs.aws.amazon.com/sagemaker/latest/dg/serverless-endpoints.html
answered a year ago
Relevant content
- asked 2 months ago
- asked 4 months ago
- AWS OFFICIALUpdated 5 days ago
- AWS OFFICIALUpdated a month ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated 5 months ago