Did AWS Redshift support Apache Hudi Time Travel Query?

0

Recently, I started using AWS Redshift to query my Apache Hudi format data. And Hudi's time travel query fit my business well. So I wonder if AWS Redshift support Hudi time travel query?

Nep
已提问 3 个月前313 查看次数
1 回答
0
已接受的回答

Hello,

Please note that currently, Redshift does not Support the 'Time Travel' feature that exists in the Delta Tables.

Further, as you are aware, every time changes are made in the Delta Tables, they create new parquet files which contain the latest data. Thus, we are required to generate a new manifest file in order to get a consistent snapshot of the data stored in the table. That is how the Time Travel functionality works, i.e. it is able to look back through these older parquet files and see the previous values.

Considering the above, we can use a workaround where you can create a new table each time the data changes, that is, once the new parquet files are created and the manifest is updated and you can create an external table at the time using the specific manifest file. This would require that you maintain the copies of all the manifest files as well as all of the parquet files, doing so would allow you to do something similar to Delta Tables time travel but in Redshift

I do hope, that the above workaround can point you in the right direction or provide more information as to how you could possibly implement something similar as well as determine if it would be suitable for your use case.

profile pictureAWS
支持工程师
已回答 3 个月前
profile pictureAWS
专家
已审核 2 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则