Got error when running VACUUM to removed expired files for iceberg in Athena

3

Error message: ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 1000 files in this round of vacuum, but there are more files remaining. Please run another VACUUM command to process the remaining files

How can I run VACUUM to delete more files, or event all expired files?

feita há um ano452 visualizações
3 Respostas
0

You can either run a step function in loops in order to VACUUM again and again (https://devopstar.com/2023/07/28/vacuuming-iceberg-with-aws-step-functions/) or you can run a Spark job that expires snapshots and remove orphan files: https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-format-iceberg.html

respondido há 9 meses
  • Are you sure that looping step function is working, I tried with same but still I am getting same error.

0

It would be amazing to get an answer from AWS about this topic.

respondido há 10 meses
0

At every 30 minutes it saying "ICEBERG_VACUUM_MORE_RUNS_NEEDED: Removed 20000 files in this round of vacuum" but when I calculate the my table metadata size it didn't changed before and after the vacuum run. Can anyone tell me what to do ?. we have metadata size around 4.5 gb in icerberg table

Naresh
respondido há 3 meses

Você não está conectado. Fazer login para postar uma resposta.

Uma boa resposta responde claramente à pergunta, dá feedback construtivo e incentiva o crescimento profissional de quem perguntou.

Diretrizes para responder a perguntas