Why does ECS deploy fail ALB healthchecks randomly?

0

I'm seeing this error in random CDK deploys

Task failed ELB health checks in (target-group arn:aws:elasticloadbalancing:ap-southeast-2:649922175064:targetgroup/....

Basically the first task will fails with the above message and is stopped but ECS will retry launching another task and then everything works. The problem is this adds at least an additional 5 minutes to my deploy time which is not ideal. The only change being deployed is a new ECR image and the task definition pointing to the new image. I don't understand why ECS tasks occasionally fails ELB health checks but it means having to waiting for the failed task to completely stop before ECS will retry to launch another task.

How could we debug the issue?

1 Answer
0

Hi,

How long is the bootstart of your container before he becomes up and running ? This is an article of Nathan Peck, he gives some advices about to deploy ECS containers correctly. https://nathanpeck.com/speeding-up-amazon-ecs-container-deployments/

profile picture
Donov
answered 9 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions