How to debug invocation timeouts for Redshift ML BYOM remote inferences

0

I have an existing SageMaker inference endpoint that I'm successfully calling from Aurora PostgreSQL using the aws_ml extension's invoke_endpoint function. I'm now trying to use the same endpoint from Redshift.

Based on Getting started with Amazon Redshift ML, I've set up the necessary IAM policies, created a model for the endpoint in Redshift, and called it via the model's registered function. However, I'm getting an error after 370 seconds no matter what I try.

Query 1 ERROR: ERROR:  Received server error (0) from primary with message "Your invocation timed out while waiting for a response from container primary. Review the latency metrics for each container in Amazon CloudWatch, resolve the issue, and try again.". See https://us-east-
DETAIL:  
  -----------------------------------------------
  error:  Received server error (0) from primary with message "Your invocation timed out while waiting for a response from container primary. Review the latency metrics for each container in Amazon CloudWatch, resolve the issue, and try again.". See https://us-east-
  code:      32207
  context:   
  query:     4076
  location:  exfunc_client.cpp:136
  process:   query1_125_4076 [pid=29885]
  -----------------------------------------------

I can see work being performed in the endpoint containers, and there's no errors reported. One major difference between Aurora PostgreSQL and Redshift is that there's no controls for batch size from Redshift. In Aurora PostgreSQL, I typically pass a batch size of around 1000 to invoke_endpoint. Redshift is sending 50000 to 220000 rows per batch, which can take a couple minutes to complete.

Does anyone have any suggestions on how I can debug this? The query failure is always at 370 seconds. I'm not sure what the significance of that number is.

profile picture
已提問 1 年前檢視次數 299 次
1 個回答
0

Hello,

To answer your question, we would require details that are non-public information. Please open a support case with AWS using the following link

https://console.aws.amazon.com/support/home#/case/create

If a support case has already been created please be assured that we will get back to you and assist you in the best way possible.

AWS
已回答 1 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南