s3 - date folder structure for etl data processes (airflow, etc)

0

Trying to verify the best practice. Say I have a "folder" structure like so:

s3://my-bucket/events/2022/09/01
s3://my-bucket/events/2022/09/02
...

As an example, On 9/3, the client should export into the "folder" s3://my-bucket/events/2022/09/02 (after the 9/2 day).

We have a client who is running their etl on 9/3 and then placing the files on 2022/09/03 folder and we're trying to convince them to put it in the 9/2 folder.

Am I correct is saying that this is the best practice?

1 Resposta
0

Hello,

I would like to inform you that there is no general guidance or best practices prescribed by AWS for the query that you are asking. That being said, it is a subjective question and depends from use-case to use-case. Personally, I would agree with you in convincing your client to put the ETL processed data in the same named folder as the date on which the data was produced.

AWS
ENGENHEIRO DE SUPORTE
anil_d
respondido há 2 anos

Você não está conectado. Fazer login para postar uma resposta.

Uma boa resposta responde claramente à pergunta, dá feedback construtivo e incentiva o crescimento profissional de quem perguntou.

Diretrizes para responder a perguntas