After Recent Software Update in Storage Gateway services, one of our Fileshare keep on in Alarm state. FilesFailedtoUpload count is constant. Its a false positive alarm. how to rectify it.

0

After Recent Software Update in Storage Gateway services, one of our Fileshare keep on in Alarm state. FilesFailedtoUpload count is constant. Its a false positive alarm. how to rectify it.

demandé il y a 3 mois123 vues
2 réponses
2
Réponse acceptée

Hey there!

Let's dive into resolving this situation with a focused approach:

  1. Review CloudWatch HealthNotifications: Visit AWS CloudWatch and examine the HealthNotifications for your File share. These insights are key to understanding the alarm trigger and verifying if it's indeed a false positive.
  2. Inspect the FilesFailingUpload Metric: Utilize CloudWatch to monitor the FilesFailingUpload metric, focusing on the "Sum" statistic. This will help quantify any ongoing upload issues and determine if the problem persists after addressing potential causes.
  3. Check for Specific File Issues: Investigate if certain files are repeatedly failing to upload due to size, format, or corruption. Identifying and rectifying these specific issues can prevent recurring alarms.

If possible, could you please provide more detailed information on this matter?

Please note: For the metric (FilesFailingUpload), it is advisable to set the threshold at 1 datapoint within a 5-minute interval.

profile picture
EXPERT
répondu il y a 3 mois
profile picture
EXPERT
vérifié il y a 2 mois
  • Hi, I can help you with the Cloud watch log for this Issue.

    { "severity": "ERROR", "bucket": "bucketname", "roleArn": "arn:aws:iam::xxxxxxxx", "source": "share-xxxxxxxxxxx", "type": "ObjectMissing", "operation": "S3Upload", "key": "cdc/smbdata/Environmant/InstanceName/xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx.txt", "gateway": "sgw-XXXXXXXXX", "timestamp": "XXXXXXXXXXXXXX" }

    Please let me know if any thing we can do at server level

0

Could you please check the threshold settings for the alarm related to FilesFailedtoUpload count. Make sure the threshold values are appropriate for your workload and consider adjusting them if necessary.If the false positive alarm started occurring after a recent software update, consider rolling back to the previous version of the software to see if the issue resolves. Alternatively, check for any available patches or updates from AWS that might address the problem. You might also find some useful information here :- https://docs.aws.amazon.com/storagegateway/latest/vgw/monitoring-volume-gateway.html

Hope it clarifies and if does I would appreciate answer to be accepted so that community can benefit for clarity, thanks ;)

profile picture
EXPERT
répondu il y a 3 mois
profile pictureAWS
EXPERT
vérifié il y a 3 mois
  • Storage Gateways file share is a AWS Managed service. We can't go back to the previous version, as it is Updated by AWS Maintenenace window. We set a threshold limit of 10, DataPoint as 15 Minutes.

  • can you check the logs generated by the Storage Gateway service to identify any specific errors or issues related to file uploads. Look for error messages, warnings, or patterns in the logs that might indicate the cause of the false positive alarm

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions