Hudi and S3 object versions


We are currently using HUDI for upserts via AWS Glue. We also have S3 bucket versioning applied on the bucket where HUDI is running on. My question is related to previous versions of objects, if we delete the previous version of an object will that impact the HUDI operation or is it the case that HUDI only works on the latest version of the object?

asked 2 years ago498 views
1 Answer

Per my understanding, HUDI works on the latest version of the S3 object. Hence deleting an older version of object shouldn't cause issues with HUDI upserts. You may want to look into S3 delete markers and deleting object version ID in that case -

answered 2 years ago

