- Newest
- Most votes
- Most comments
Hello,
I understand that you are concerned about creating Sagemaker Endpoint for 2 models (Segment Anything & YOLOv8) and Invoking it and would like to gather more information on the same.
Firstly, I would like to mention that this error is usually observed when the worker (instance) called to process the inference request did not respond within the given time, which is 60 seconds for real-time endpoints. This led to the worker being overwhelmed and ultimately dying.
Further, in order to resolve the issue, kindly follow one of the below workarounds -
- It is recommended to either increase the instance type so that it is able to handle the load and process it within 60 mins.
- Reduce the Payload Size limit.
Kindly refer the below doc for more information on the above- [+] https://docs.aws.amazon.com/sagemaker/latest/dg/async-inference-troubleshooting.html
Additionally, kindly make a note that the error means that the customer container returned an error. SageMaker does not control the behavior of customer containers. SageMaker simply returns the response from the ModelContainer and does not retry. If you want, you can configure the invocation to retry on failure. We suggest that you turn on container logging and check your container logs to find the root cause of the 500 error from your model.
I would request that you please refer to the aforementioned documentation once, and please reach out to AWS [4] with the detailed use case so that we can assist you better.
If you have any difficulty verifying any of the above-mentioned points or if you still run into issues, please reach out to AWS Support [4] (Sagemaker) along with your issue or use case in detail, and we would be happy to assist you further.
References:
[1] https://docs.aws.amazon.com/sagemaker/latest/dg/async-inference.html
[2]https://github.com/pytube/pytube/issues/815
[3] https://aws.amazon.com/premiumsupport/
[4] Creating support cases and case management - https://docs.aws.amazon.com/awssupport/latest/user/case-management.html#creating-a-support-casehttps://docs.aws.amazon.com/awssupport/latest/user/case-management.html#creating-a-support-case
Relevant content
- Accepted Answerasked 7 months ago
- asked 2 years ago
- asked 9 months ago
- AWS OFFICIALUpdated 9 months ago
- AWS OFFICIALUpdated 13 days ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 18 days ago