- Newest
- Most votes
- Most comments
Hi, Here’s how you can troubleshoot this issue:
Quick Checks:
-
Health Check Configuration:
- Verify the health check settings on your ELB. Ensure that the health check is configured correctly for your web server and backend services. Sometimes, a slight misconfiguration can cause instances to be marked as unhealthy.
-
Check Instance Logs:
- Review the application and system logs on your EC2 instances, especially around the time when they go unhealthy. Look for any signs of resource exhaustion, network issues, or application errors.
-
Monitor Instance Performance:
- Use CloudWatch to monitor CPU, memory, and network metrics. Spikes or unusual patterns could indicate the root cause of the issue.
-
Network and Security Groups:
- Ensure that the security groups and network ACLs are not causing connectivity issues between the ELB and the instances. Check for any recent changes in your VPC settings.
-
Auto-Recovery:
- Consider enabling EC2 auto-recovery for critical instances. This can automatically reboot an instance if it fails a system status check, which might reduce the downtime you're experiencing.
-
ELB Logs:
- Double-check the ELB access logs for any anomalies or repeated patterns that might provide insight into why the instances are being marked unhealthy.
I hope this helps! 😁
If someone has experienced something similar, all the support help us!
Hello, It seems you firstly need to check healthy check reason codes [1] during outgage. As you explain one is web server and the other two are backend services, I guess all nodes will get unhealthy once one of them fails to respond health check requests.
I also suggest you to take a look at instance performance using CloudWatch, monitoring CPU, Memory and network metrics.
Kindly share two loadbalancers' health check configuration and reason codes here. It would help a lot to resolve this issue
References [1] https://docs.aws.amazon.com/elasticloadbalancing/latest/application/target-group-health-checks.html
Relevant content
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated a month ago