AWS Glue read a csv file encoded in Windows 1252 with extended characters

0

Is there an option to enable the Dyanmicframe Reader to open a file encoded in cp1252 charset (Windows latin with extended characterset) ?

AWS
專家
已提問 2 年前檢視次數 721 次
1 個回答
0
已接受的答案

Currently, you would need to read the file using the spark.read and than convert to DynamicFrame, see an example below:


df_1252_all = spark.read.option("header", "true").option("charset", "cp1252").csv("s3a://My-BUCKET/cp1252_all.csv")
dyn_all= DynamicFrame.fromDF(df_1252_all, glueContext, "dyn_all")
AWS
專家
已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南