How to alert if an event doesn't happen within a period of time?

0

Scenario: For compliance reasons a monitor needs to be in place to alert if a backup either fails or otherwise doesn't succeed.

How can we alert if the the rds event category "backup" and id "0002" [Backup Finished] isn't emitted in the last 48 hours?

3 個答案
1

Checking for a negative is always a little tricky.

In this case, I'd have something that is triggered by the positive event (Backup Finished) which stores a timestamp somewhere. Then have another process which checks that timestamp at specified intervals - this process would emit an alert if the timestamp is too old.

profile pictureAWS
專家
已回答 2 年前
1

Please note, as the other contributors have mentioned here, checking for negative is tricky on the managed events and there is no built-in mechanism to achieve this test case. Hence, I would suggest you to consider the below work-around.

  1. Create a lambda function.

  2. Utilize the below script. Kindly make the necessary changes to the instance details.

  3. Kindly make sure the IAM role associated with this lambda function has appropriate permission to describe the snapshots and to subscribe to SNS.

  4. Configure via lambda script to execute every 24 hours

Python script to capture the latest snapshot details

  import json
    import dateutil.tz
    def lambda_handler(event, context):
    mydbInstances = ['sandboxinstance']
    for mydbInstance in mydbInstances:
        snaps1 = [[]*2]
        snapshot = []
       for snapshot in rds.describe_db_snapshots(DBInstanceIdentifier=mydbInstance,SnapshotType='manual')['DBSnapshots']:
           if snapshot['Status']=='available':
            snaps1.append([snapshot['DBSnapshotArn'],snapshot['SnapshotCreateTime']])
        snaps1.remove(snaps1[0])
        snaps1.sort(key=lambda x:x[1], reverse=True)
        print ("RDS Snapshot name " ,snaps1[0])
        SourceDBSnapshotIdentifierARN=snaps1[0][0]
        install_time = snaps1[0][1]
        right_now = datetime.datetime.now(dateutil.tz.tzlocal())
        diff = right_now - install_time
        diff_minutes = (diff.days * 24 * 60) + (diff.seconds/60)
        print diff_minutes

Trigger E-Mail based on the difference in minutes observed.

   notification = "Here is the SNS notification for Lambda function tutorial."
        client = boto3.client('sns')
        response = client.publish (
              TargetArn = "arn:aws:sns:us-east-1:xxxxxx:RDSNote",
              Message = json.dumps({'default': notification}),
              MessageStructure = 'json'
        )
AWS
支援工程師
已回答 2 年前
0

You could also create a file after the job finishes and then check if the file or object in a bucket exists after the certain time period expires. You'd have to delete it at some point, too, of course, like before the job starts.

已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南