1 Answer
- Newest
- Most votes
- Most comments
4
Hello,
There might some issue in metrics-collector daemon running on the primary node. Please check if the process is running in the primary node and status of the process.
ps -ef | grep metrics-collector
-or-
sudo systemctl status metricscollector.service
If it is down, please start the service back. If the master node run out of memory, cpu or disk capacity, then it might fail to collect the metric data. Please validate them using below commands,
free -m
ps auxwww --sort -%cpu | head -20
df -h
Besides, you can also check the instance-controller log & instance-state log on the primary node to see if any issue that blocked this daemon. Please refer the EMR log locations
Relevant content
- asked 5 years ago
- asked 3 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 3 months ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
and
I did restart with the commands you mentioned above. And does not seems to fix the problem, still not seeing any metrics.
There should be some clue in the instance-controller log & instance-state log on the primary node. Please check the metricscollector related info in the aforementioned logs. Also check if any IAM permission in the service role changed as there might a possibility that metric data not fetched by Cloudwatch due to permission issue.