- Newest
- Most votes
- Most comments
I understand you're experiencing etcd throttling issues on your EKS cluster. However, I need to clarify an important point: the etcd rate limit in Amazon EKS is not configurable or adjustable. The etcd database has a fixed size limit, and when exceeded, it can cause your cluster to enter a degraded state.
Based on the symptoms you're describing - excessive watch connections from Grafana sidecars and Helm release storms - you're likely hitting etcd's capacity limits rather than needing a rate limit increase. The solution is to reduce the load on etcd by addressing the root causes:
Immediate actions to reduce etcd load:
-
Reduce unnecessary Kubernetes objects - Clean up any unnecessary ConfigMaps, Secrets, or other objects that are contributing to the etcd database size
-
Address the Grafana sidecar watch connections - 120 watch connections per hour per instance is excessive. You'll need to:
- Optimize or reconfigure the sidecars to reduce watch frequency
- Consider alternative approaches to ConfigMap monitoring that don't require constant watch connections
- Potentially batch or consolidate ConfigMap updates
- Control Helm deployments - Avoid simultaneous Helm release storms by:
- Staggering deployments across your customer instances
- Implementing rate limiting or queuing for Helm operations
- Using deployment strategies that reduce concurrent API calls
The key issue is that your current architecture is generating too much load on etcd. Rather than increasing limits (which isn't possible), you need to reduce the number of API requests and watch connections your workloads are making. Once you've reduced this load, you should be able to make changes to your cluster more reliably.
If you continue experiencing issues after cleanup and optimization, you should contact AWS Support for additional guidance specific to your cluster's situation.
Sources
Troubleshoot problems with Amazon EKS clusters and nodes - Amazon EKS
Control Plane Monitoring - Amazon EKS
Relevant content
- asked 4 years ago
- asked 3 years ago
- asked 3 years ago
