Glue Crawler CSV file with a field containing commas

0

I have a CSV file, which contains a text field enclosed in double-quotes with commas inside of it. By default Glue Crawler splits the field into columns at the commas. Is there a way to make it realize that it is one field because it is enclosed in double-quotes?

Below is an example of the data. The 3rd field called 'description' is the one containing commas in it.

id,country,description
0,Italy,"Aromas include tropical fruit, broom, brimstone and dried herb. The palate isn't overly expressive, offering unripened apple, citrus and dried sage alongside brisk acidity."
1,Portugal,"This is ripe and fruity, a wine that is smooth while still structured. Firm tannins are filled out with juicy red berry fruits and freshened with acidity. It's  already drinkable, although it will certainly be better from 2016."
2,US,"Tart and snappy, the flavors of lime flesh and rind dominate. Some green pineapple pokes through, with crisp acidity underscoring the flavors. The wine was all stainless-steel fermented."
AWS
Denis_A
質問済み 3年前3135ビュー
1回答
1
承認された回答

I think for that you need a custom crawler csv classifier to specify the quote character, see here

profile pictureAWS
エキスパート
回答済み 3年前
profile picture
エキスパート
レビュー済み 4ヶ月前
AWS
エキスパート
レビュー済み 1年前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ