Query data stored in S3 using Athena

0

Hi AWS, I have a folder inside s3 bucket where the cost and usage report data is stored both in .csv and .csv.gz format. When I am creating the TABLE using that LOCATION the records are not printed in proper format. Also, I need to segregate both type of files so that I not need to spend my time in data munging, cleaning etc.

Can you please help in knowing how to query .csv.gz file formats using Athena and print the proper formatted records.

Thanks

1 Risposta
0

CUR supports Apache Parquet compression format for Athena.

While creating CUR, select whether you want to enable your Cost and Usage Reports to integrate with Amazon Athena, Amazon Redshift, or Amazon QuickSight. The report is compressed in the following formats: Athena: parquet format Amazon Redshift or Amazon QuickSight: .gz compression

Refer to this for more details: https://docs.aws.amazon.com/cur/latest/userguide/cur-query-athena

AWS
Kiran
con risposta un anno fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande