Glue processing a csv


I am having a csv file on a s3 bucket and using a glue crawler to have a table created on Athena. The table gets created but the data gets messed up. The values gets shifted to the adjacent columns for some of the records. Is there any way to mitigate this? Does glue work fine for csv files or do the files need to be converted to a paruqet or json.

asked 3 years ago151 views
1 Answer
Accepted Answer

Glue should work fine with CSV or any other delimited format. When you mentioned "some records", can you validate that the csv row corresponding to those records are well-formatted ? You can compare how Glue perform by using EMR Hive to create a s3-backed table.

answered 3 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions