운영 환경:
EKS(Amazon Elastic Kubernetes Service)를 사용하여 서비스 운영 중이며 Clodwatch로 모니터링 함
문제 상황:
9월 18일 이후 CloudWatch의 저장 사용량이 하루 만에 10000% 증가
하루에 수백 기가의 로그가 쌓임. CloudWatch의 IncomingBytes가 비정상적으로 증가
CloudWatch 요금이 하루에 500달러 이상으로 급증(이전: 1-2달러/일). 3일동안 유지하였음
현재는 CloudWatch Observability 플러그인을 비활성화하여 현재는 발생하지 않음.
문제 발생 시점:
EKS Plugin의 CloudWatch Observability 업데이트 이후 갑자기 발생
추정 원인:
CloudWatch Plugin의 오류로 판단됨
아래 로그가 1초에 수십~수백개씩 쌓임
@message
{"time":"2024-09-18T15:00:00.183420141Z","stream":"stdout","_p":"P","log":"2024-09-18 15:00:00 +0000 [warn]: #0 [in_tail_fluentd_logs] pattern not matched: "2024-09-18T14:59:57.020846089Z stdout P 2024-09-18 14:59:57 +0000 [warn]: #0 [in_tail_fluentd_logs] pattern not matched: \"2024-09-18T14:59:46.08281019Z stdout P 2024-09-18 14:59:46 +0000 [warn]: #0 [in_tail_fluentd_logs] pattern not matched: \\\"2024-09-18T14:59:40.624972596Z stdout P \\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\...............
요청 사항:
이 상황에 대한 증가분에 대한 요금 해결 문의 드립니다