Glue Crawler CSV file with a field containing commas

0

I have a CSV file, which contains a text field enclosed in double-quotes with commas inside of it. By default Glue Crawler splits the field into columns at the commas. Is there a way to make it realize that it is one field because it is enclosed in double-quotes?

Below is an example of the data. The 3rd field called 'description' is the one containing commas in it.

id,country,description
0,Italy,"Aromas include tropical fruit, broom, brimstone and dried herb. The palate isn't overly expressive, offering unripened apple, citrus and dried sage alongside brisk acidity."
1,Portugal,"This is ripe and fruity, a wine that is smooth while still structured. Firm tannins are filled out with juicy red berry fruits and freshened with acidity. It's  already drinkable, although it will certainly be better from 2016."
2,US,"Tart and snappy, the flavors of lime flesh and rind dominate. Some green pineapple pokes through, with crisp acidity underscoring the flavors. The wine was all stainless-steel fermented."
AWS
demandé il y a 4 ans3,6 k vues
1 réponse
2
Réponse acceptée

I think for that you need a custom crawler csv classifier to specify the quote character, see here

profile pictureAWS
EXPERT
répondu il y a 4 ans
profile picture
EXPERT
vérifié il y a 5 mois
profile picture
EXPERT
vérifié il y a 10 mois
AWS
EXPERT
vérifié il y a 2 ans

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions