Unable to run iceberg insert in hive deployed on EMR

0

Hello,

I configured iceberg formatted table with transaction in hive on EMR 6.4.1. When I insert data into the table, the operation get stuck, without any error.

Any insights are highly appreciated. Thanks

hive> insert into iceberg_test1 values(1);
Automatically selecting local only mode for query
Query ID = hadoop_20240318143559_03034f6a-a277-4e77-80e5-c19f68b41cbb
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks is set to 0 since there's no reduce operator
Job running in-process (local Hadoop)
2024-03-18 14:36:01,110 Stage-2 map = 100%, reduce = 0%
Mark
已提問 2 個月前檢視次數 417 次
1 個回答
4
已接受的答案

Hello,

I presume your issue might relies on any of the below considerations,

1. Only the MR (MapReduce) execution engine is supported for DML (data manipulation language) operations, and MR is deprecated in Hive 3.1.3.

2. AWS Glue Data Catalog is not currently supported for Iceberg with Hive.

Please check your hive engine as Tez is the default engine in EMR 6.4.1 and make sure it is mr. Also check if you are not using glue as metastore. It requires external metastore to interact with iceberg and transaction tables.

References:

[1] - https://docs.aws.amazon.com/emr/latest/ReleaseGuide/hive-considerations.html

AWS
支援工程師
已回答 2 個月前
profile picture
專家
已審閱 2 個月前
profile picture
專家
已審閱 2 個月前
  • Excellent, when I checked the execution engine to mr, it started working. Thanks a lot💪

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南