2 Answers
- Newest
- Most votes
- Most comments
0
Hi,
Why don't you try using SageMaker Serverless Inference instead ? It's purely serverless in nature so you pay only when the endpoint is serving inference.
See https://docs.aws.amazon.com/sagemaker/latest/dg/serverless-endpoints.html
Wouldn't that be a better solution for your use case?
Best,
Didier
0
AWS SageMaker Real-Time Inference: scaling down to zero feature is now launched: https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-sagemaker-scale-down-zero-ai-inference-save-costs/
answered 7 months ago
Relevant content
- asked a year ago
- asked 2 years ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated 3 months ago
Hello Didier,
Thank you for your answer. I have a few questions regarding SageMaker Serverless Inference:
Thank you for your help!