Selecting minimum instance for a Sagemaker Endpoint

0

Good day,

I currently trying to find a way of getting the minimum instance type for deploying a Sagemaker Endpoint and is suitable for a batch job. However i have not been able to get an instance with less than 50% error rate while running performance test.

I have also explored the Sagemaker Inference Recommender, but i am getting the below error: "The inference recommender job 022cc1d4-676b-11ee-9232-1609dc98c377 has failed. INVALID_INPUT : Benchmark(s) failed. To investigate the failure reason for each benchmark, please use the ListInferenceRecommendationsJobSteps API."

1개 답변
0

Hi Seun,

What type of endpoint are you deploying and what error are you getting at inference time? Not sure if you are using Batch Transform. When using Batch inference, you have a maximum payload size of GBs and maximum runtime of days. If you're using another type of deployment, such as a real-time endpoint, you might be exceeding a limit of payload size or response timeout. See here for more details.

See this link for a simple script to see the cause of error in the Inference Recommender job. Inference Recommender is the easiest way to assess the minimum instance type that will suit the workload.

Hope this helps.

profile pictureAWS
jnavrro
답변함 7달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠