Cloudwatch Alarm Observability Metric Disappeared

0

Hi everyone, I was recently using the 'glue.error.all' observability metric to send me a notification when my glue jobs fail through SNS topics. It was working fine, but recently I checked and saw that the 'glue.error.all' metric does not show any more on my CloudWatch Glue Observability Metrics anymore. Is there a reason for this or is there something I can do to make it available again?

Below are the configurations I used which worked for a couple months until recently I noticed the metric didn't exist anymore. Statistic : SUM

Period : 60

MetricName: glue.error.ALL

TreatMissingData :notBreaching

ComparisonOperator : GreaterThanOrEqualToThreshold

Threshold: 1

  • I believe if there are no errors the metric is not published

sg03
已提问 3 个月前265 查看次数
1 回答
0

Hello,

As mentioned by Gonzalo, this could be one of the reason that if there are no errors then the metric is not published. Please verify the latest job runs to confirm the same. Further to troubleshoot this could you please verify the IAM permissions for the Glue job's role to ensure there are all the required permissions. Also I would request you to once check your cloudtrail logs in the respective time period to see if there are any errors there which could explain the same.

Further, as a workaround you can instead follow the instructions from the below documentation to get SNS notifications based on when your job status changes to "FAILED" instead of using the 'glue.error.all' metric.

AWS
Ankur_J
已回答 3 个月前
profile picture
专家
已审核 1 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则