CSV to Parquet using AWS Glue

0

I converted a CSV(from S3) to parquet(to S3) using AWS glue and the file which is converted to Parquet was named randomly .How do i choose the name of the file that is to be converted to Parquet from CSV ?
Enter image description here

when i add data.parquet at the end of the s3 path (in target) without '/' ,AWS glues is creating a subfloder in the bucket with the name data.parquet instead of file name, where as the new file parquet file is created with the name like this "run-1678983665978-part-block-0-r-00000-snappy.parquet"

where should i give a name to the parquet file ?

1 個回答
1

You can't, the files are named so they are unique inside the job and between multiple runs.
You would need to rename them yourself using a library like boto3 (bear in mind rename really makes a copy so be conscious if the files are many or large).

profile pictureAWS
專家
已回答 1 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南