Glue Crawler CSV file with a field containing commas

0

I have a CSV file, which contains a text field enclosed in double-quotes with commas inside of it. By default Glue Crawler splits the field into columns at the commas. Is there a way to make it realize that it is one field because it is enclosed in double-quotes?

Below is an example of the data. The 3rd field called 'description' is the one containing commas in it.

id,country,description
0,Italy,"Aromas include tropical fruit, broom, brimstone and dried herb. The palate isn't overly expressive, offering unripened apple, citrus and dried sage alongside brisk acidity."
1,Portugal,"This is ripe and fruity, a wine that is smooth while still structured. Firm tannins are filled out with juicy red berry fruits and freshened with acidity. It's  already drinkable, although it will certainly be better from 2016."
2,US,"Tart and snappy, the flavors of lime flesh and rind dominate. Some green pineapple pokes through, with crisp acidity underscoring the flavors. The wine was all stainless-steel fermented."
AWS
Denis_A
질문됨 3년 전3135회 조회
1개 답변
1
수락된 답변

I think for that you need a custom crawler csv classifier to specify the quote character, see here

profile pictureAWS
전문가
답변함 3년 전
profile picture
전문가
검토됨 4달 전
AWS
전문가
검토됨 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠