Kinesis Transformation Buffering from Data Stream

0

Hi, the current pipeline I am implementing involves a Kinesis Data Stream -> Kinesis Delivery Stream -> S3 route, however when setting the buffer intervals (Both s3 and transformation) to near 5 minutes, data still rapidly appears within the S3 bucket.

For my purposes I need the data for every 5 minutes to be combined into a single large file to be pushed into S3, which I was under the assumption Kinesis delivery streams should handle under the hood when taking in from a Kinesis data stream.

I'd appreciate any help in pointing out where my implementation could potentially be going astray :).

2개 답변
0
수락된 답변

Issue resolved for anyone interested, by switching to direct PUTs into the delivery stream the files are properly aggregated.

However I was not able to find a reason for why the data stream -> delivery stream transition does not result in proper aggregation of data, although I suspect it may have to do with the data stream shards.

EY
답변함 2년 전
profile picture
전문가
검토됨 한 달 전
0

Kinesis firehose delivery stream has two options for buffering the data, Buffer Size and Buffer Interval. If buffering data exceed the Buffer Size, the data could be delivered to S3 in previous to Buffer Interval. What value is Buffer Size set?

https://docs.aws.amazon.com/firehose/latest/dev/basic-deliver.html#frequency

The frequency of data delivery to Amazon S3 is determined by the Amazon S3 Buffer size and Buffer interval value that you configured for your delivery stream. Kinesis Data Firehose buffers incoming data before it delivers it to Amazon S3. You can configure the values for Amazon S3 Buffer size (1–128 MB) or Buffer interval (60–900 seconds). The condition satisfied first triggers data delivery to Amazon S3. When data delivery to the destination falls behind data writing to the delivery stream, Kinesis Data Firehose raises the buffer size dynamically. It can then catch up and ensure that all data is delivered to the destination.

AWS
suzuki
답변함 2년 전
profile picture
전문가
검토됨 한 달 전
  • The size of the file never exceeds the max buffer size, which is anyways set to max (128)

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠