Intermittent cross zone health check issue

0

I have an application load balancer with an https listener that forwards all requests to a target group with instances in two availability zones. The target group's health check expects a 200 response. All of the intervals/thresholds/timeouts for the health check are at defaults.

The monitoring graphs show the target group fluctuating between 1 and 2 healthy instances. The logs on the instances show consistent 200 responses at 30 second intervals. However, sometimes they log requests from load balancer IPs in both AZs, and sometimes only from the load balancer IP in their own AZ. Both instances fluctuate between these two states, and each state can last for hours at a time. There seems to be a regular pattern to the fluctuating target health status. See e.g., https://imgur.com/a/oB73TrK

The fluctuations match up with what I see in the instance request logs. I.e., the times when the graph shows a change from 2 healthy targets to fewer than 2, that's the same time that an instance stops logging cross AZ health checks. The instances are not at all resource constrained. They're pretty much idle.

So cross-zone health checks are not logged intermittently, while same-zone health checks are logged consistently. I am having trouble understanding what might be causing this. I'm contemplating setting up notifications for unhealthy instances, but this seems like a lot of work with potentially little reward if it doesn't tell me why the check failed. E.g., if it says something like "connection timed out" or "response timeout" I still wouldn't know why that was happening intermittently.

Any idea what I should look at?

질문됨 3년 전410회 조회
1개 답변
0

I figured it out. The ephemeral port range allowed for NACL ingress traffic to the ELB was not broad enough. So there were periods of time where health checks across AZs were using source ports that caused the return traffic to be blocked.

답변함 3년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠