- Newest
- Most votes
- Most comments
Hello,
As you might aware that both file format's metadata and data structure are different, choosing the right one is certainly depends on the individual use case. If you want to bring the Hudi dataset to Iceberg compatible, you can use Spark to read the Hudi table into dataframe and write it as Iceberg formatted files to S3 or Without copying data, you can use CTAS to convert the file format. Please also note that, post conversion the history of versions will be lost. A couple of external references for your review below,
I see this doc for delta to iceberg conversion which might applies to Hudi as well.
External solution that available for this conversion (no personal experience)
In addition, you can use Trino in EMR to read the Hudi tables which is more performance oriented.
Relevant content
- asked 2 years ago
- asked a year ago
- asked 2 years ago
- AWS OFFICIALUpdated a year ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated a day ago
- AWS OFFICIALUpdated 2 years ago