1 Answer
- Newest
- Most votes
- Most comments
0
Hi - You need to convert the character encoding from ISO-8859-1 to UTF-8 before letting AWS Glue process it.
https://docs.aws.amazon.com/glue/latest/dg/components-key-concepts.html
Text-based data, such as CSVs, must be encoded in UTF-8 for AWS Glue to process it successfully.
There are few examples listed here -https://github.com/aws-samples/aws-glue-samples/blob/master/examples/converting_char_encoding.md which use spark to convert the datatype.
Relevant content
- asked 7 years ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 2 years ago