1개 답변
- 최신
- 최다 투표
- 가장 많은 댓글
0
It really depends on how your data is structured. If it's 1 GB file, then it's going to not benefit from Glue being able to fan out. If it's 1024 1MB files, then you're going to see the benefits. Also, it will depend on the block size of the Parquet to allow for optimal I/O (See tip #5 here https://aws.amazon.com/blogs/big-data/top-10-performance-tuning-tips-for-amazon-athena/).
I could only find some information on how to tune your DPUs optimally. The example given was 428 Gzipped JSON files converting to parquet files.
https://docs.aws.amazon.com/glue/latest/dg/monitor-debug-capacity.html
답변함 5년 전
관련 콘텐츠
- AWS 공식업데이트됨 2년 전
- AWS 공식업데이트됨 3년 전
- AWS 공식업데이트됨 일 년 전