1 Answer
- Newest
- Most votes
- Most comments
0
ModelLatency is helpful because Sagemaker requires the container to respond within 60 seconds [1]: if you see the ModelLatency at or above 60 seconds that confirms the container isn't responding fast enough. At that point you'll need to debug why your container isn't fast enough: if it's a custom container you wrote you'll need to debug it; if it's a built-in container you should reach out to AWS support for assistance.
[1] Timeout documentation https://docs.aws.amazon.com/sagemaker/latest/dg/your-algorithms-inference-code.html#your-algorithms-inference-code-container-response
answered 5 years ago
Relevant content
- asked 6 months ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated 8 months ago
- AWS OFFICIALUpdated 7 months ago