Glue processing a csv

0

I am having a csv file on a s3 bucket and using a glue crawler to have a table created on Athena. The table gets created but the data gets messed up. The values gets shifted to the adjacent columns for some of the records. Is there any way to mitigate this? Does glue work fine for csv files or do the files need to be converted to a paruqet or json.

AWS
posta 4 anni fa407 visualizzazioni
1 Risposta
0
Risposta accettata

Glue should work fine with CSV or any other delimited format. When you mentioned "some records", can you validate that the csv row corresponding to those records are well-formatted ? You can compare how Glue perform by using EMR Hive to create a s3-backed table.

AWS
wijayah
con risposta 4 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande