Glue Crawler and Classifiers - Supported file encodings: is UTF16 supported?
Hi, AWS Glue Crawlers with CSV and XML Classifiers and works well with files encoded in UTF-8 but not with file encoded in UTF-16.
Public documentation does not clarify this point:
- Do Glue crawler and classifier support UTF-16?
- Is there please an available documentation on supported encodings with Glue crawlers and classifiers?
Glue at the moment supports UTF-8 encoded files only . If UTF-16 files are passed in, you may encounter the "Internal Service Exception" error message. The most feasible method would be to programatically convert the utf-16 files to utf-8 before passing it through Glue Crawler.
How to escape a comma in a csv file in AWS Glue?Accepted AnswerMODERATORasked 3 years ago
Glue Crawler and Classifiers - Supported file encodings: is UTF16 supported?Accepted Answerasked 2 years ago
AWS Glue crawler creating multiple tablesasked 5 months ago
escape caracter in AWS glueAccepted Answer
backslash in CSV with glue
Glue Crawler CSV file with a field containing commasAccepted Answerasked a year ago
AWS Glue crawler detecting a .(dot) before header of a csv fileasked 3 years ago
AWS Glue read a csv file encoded in Windows 1252 with extended charactersAccepted AnswerEXPERTasked 8 months ago
AWS Glue, crawlers and issue with money datatypeasked a month ago
AWS Glue crawlerasked a month ago