Got error when running VACUUM to removed expired files for iceberg in Athena

3

Error message: ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 1000 files in this round of vacuum, but there are more files remaining. Please run another VACUUM command to process the remaining files

How can I run VACUUM to delete more files, or event all expired files?

已提问 1 年前452 查看次数
3 回答
0

You can either run a step function in loops in order to VACUUM again and again (https://devopstar.com/2023/07/28/vacuuming-iceberg-with-aws-step-functions/) or you can run a Spark job that expires snapshots and remove orphan files: https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-format-iceberg.html

已回答 9 个月前
  • Are you sure that looping step function is working, I tried with same but still I am getting same error.

0

It would be amazing to get an answer from AWS about this topic.

已回答 1 年前
0

At every 30 minutes it saying "ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 20000 files in this round of vacuum" but when I calculate the my table metadata size it didn't changed before and after the vacuum run. Can anyone tell me what to do ?. we have metadata size around 4.5 gb in icerberg table

Naresh
已回答 3 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则