2 Antworten
- Neueste
- Die meisten Stimmen
- Die meisten Kommentare
0
Hi,
Why don't you try using SageMaker Serverless Inference instead ? It's purely serverless in nature so you pay only when the endpoint is serving inference.
See https://docs.aws.amazon.com/sagemaker/latest/dg/serverless-endpoints.html
Wouldn't that be a better solution for your use case?
Best,
Didier
0
AWS SageMaker Real-Time Inference: scaling down to zero feature is now launched: https://aws.amazon.com/about-aws/whats-new/2024/11/amazon-sagemaker-scale-down-zero-ai-inference-save-costs/
beantwortet vor einem Jahr
Relevanter Inhalt
- AWS OFFICIALAktualisiert vor einem Jahr
- AWS OFFICIALAktualisiert vor 8 Monaten
- AWS OFFICIALAktualisiert vor einem Jahr

Hello Didier,
Thank you for your answer. I have a few questions regarding SageMaker Serverless Inference:
Thank you for your help!