- Newest
- Most votes
- Most comments
=>In response to your question:
=>The statement from the AWS documentation is correct. During the retention period of the Kinesis stream, any failures encountered by Firehose, or the associated Lambda function will trigger automatic retries until the data is successfully delivered or the retention period ends. No additional configuration is required to enable this feature.
=>Regarding the issue you encountered with data loss: =>It seems that the failure records were not successfully reprocessed by Firehose.
=>Check if the retry duration configured for Firehose is sufficient to allow for multiple retries within the retention period.
=>Ensure that the S3 destination bucket has the necessary permissions for Firehose to write data to it.
=>Double-check the data transformation function and ensure that it is correctly configured to handle the data format conversion errors.
Hello Andrew,
The retry mechanism in Kinesis Firehose is a default feature and you need not change any configuration to enable it.
Having said that, each Amazon Data Firehose destination has its own data delivery failure handling.
Data delivery to your S3 bucket might fail for various reasons. For example, the bucket might not exist anymore, the IAM role that Amazon Data Firehose assumes might not have access to the bucket, the network failed, or similar events. Under these conditions, Amazon Data Firehose keeps retrying for up to 24 hours until the delivery succeeds. The maximum data storage time of Amazon Data Firehose is 24 hours. If data delivery fails for more than 24 hours, your data is lost.
[+] Amazon Data Firehose Data Delivery - Data Delivery Failure Handling - https://docs.aws.amazon.com/firehose/latest/dev/basic-deliver.html#retry
Regarding the queries related to the Firehose instance which you created for testing purposes, I would not be able to answer since we require details that are non-public information. I kindly request you to open a support case with AWS using the following link.
Thank you Aparajith,
The maximum data storage time of Amazon Data Firehose is 24 hours. If data delivery fails for more than 24 hours, your data is lost. If the data source is Kinesis Stream, Should the maximum retry duration would be the retention period?
Also if we enabled S3backup. When Firehose put the record into that S3backup location? Will Firehose re-process the records in backup location automatically as well?
Relevant content
- asked 2 months ago
- asked 2 years ago
- AWS OFFICIALUpdated 6 months ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated a year ago
Thank you Mahankali,
If we enabled S3backup. When Firehose put the record into that S3backup location? Will Firehose re-process the records in backup location automatically as well?