Why does ECS deploy fail ALB healthchecks randomly?

0

I'm seeing this error in random CDK deploys

Task failed ELB health checks in (target-group arn:aws:elasticloadbalancing:ap-southeast-2:649922175064:targetgroup/....

Basically the first task will fails with the above message and is stopped but ECS will retry launching another task and then everything works. The problem is this adds at least an additional 5 minutes to my deploy time which is not ideal. The only change being deployed is a new ECR image and the task definition pointing to the new image. I don't understand why ECS tasks occasionally fails ELB health checks but it means having to waiting for the failed task to completely stop before ECS will retry to launch another task.

How could we debug the issue?

1 回答
0

Hi,

How long is the bootstart of your container before he becomes up and running ? This is an article of Nathan Peck, he gives some advices about to deploy ECS containers correctly. https://nathanpeck.com/speeding-up-amazon-ecs-container-deployments/

profile picture
Donov
已回答 9 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则