Kubernetes NodePool didn't get properly scaled out - errorCode=timeout?

0

Hi there.

There was a case last Saturday (18th, UTC) that one of my EKS Cluster's Node Pool failed to scale out.

As a result, a few pods were pending for a long while, till I manually scaled out the Node Pool on the AWS console by increasing the desired instance count.

I checked the Cluster Autoscaler logs and found something like this.

W1118 22:35:09.646968       1 clusterstate.go:264] Scale-up timed out for node group eks-on-demand-c5a-2xlarge-<ID> after 15m5.306683546s
W1118 22:35:09.647071       1 clusterstate.go:287] Disabling scale-up for node group eks-on-demand-c5a-2xlarge-<ID> until 2023-11-18 22:40:09.641115003 +0000 UTC m=+425439.629820752; errorClass=Other; errorCode=timeout

It seemed like the node creation got timed out during its process. But I didn't get any warnings or indications about this from anywhere else. Even the node group page on the EKS console didn't report any node health issues.

Is there a way to further investigate this? I just want to understand what exactly happened at that moment.

I suspect that it's an AWS-side infrastructure issue, but not 100% sure.

This ain't happening anymore since after the case.

Thanks a lot.

1 Answer
1
Accepted Answer

It appears that the Cluster Autoscaler (CA) was unable to scale up the node group within the specified timeout period. The timeout is set to 15 minutes in your case (15m5.306683546s), and the error indicates that the node creation process took longer than expected. I recommend reviewing the AWS EKS events with the following command: kubectl get events. Additionally, check the** AWS CloudTrail logs** and inspect the control plane logs. With this information, you can contact AWS Support, providing them with the timestamp and date of the incident for further investigation, especially if the issue persists

profile picture
EXPERT
answered 5 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions