Glue Crawler CSV file with a field containing commas

0

I have a CSV file, which contains a text field enclosed in double-quotes with commas inside of it. By default Glue Crawler splits the field into columns at the commas. Is there a way to make it realize that it is one field because it is enclosed in double-quotes?

Below is an example of the data. The 3rd field called 'description' is the one containing commas in it.

id,country,description
0,Italy,"Aromas include tropical fruit, broom, brimstone and dried herb. The palate isn't overly expressive, offering unripened apple, citrus and dried sage alongside brisk acidity."
1,Portugal,"This is ripe and fruity, a wine that is smooth while still structured. Firm tannins are filled out with juicy red berry fruits and freshened with acidity. It's  already drinkable, although it will certainly be better from 2016."
2,US,"Tart and snappy, the flavors of lime flesh and rind dominate. Some green pineapple pokes through, with crisp acidity underscoring the flavors. The wine was all stainless-steel fermented."
AWS
Denis_A
已提问 3 年前3135 查看次数
1 回答
1
已接受的回答

I think for that you need a custom crawler csv classifier to specify the quote character, see here

profile pictureAWS
专家
已回答 3 年前
profile picture
专家
已审核 4 个月前
AWS
专家
已审核 1 年前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则