HIVE_INVALID_METADATA: Hive metadata for table region_us_east_2 is invalid: Table descriptor contains duplicate column

0

I have SSM/S3 "Resource Data Sync" enabled . We are using this data in Athena Query however soe some query we get error "HIVE_INVALID_METADATA: Hive metadata for table region_us_east_2 is invalid: Table descriptor contains duplicate column" . Some query works.

Example - If I query "billiginfo" from table it works however if I query "Instanceinfo' or "region" it does not work.

已提问 2 年前240 查看次数
1 回答
0

When you create an Inventory Resource Data Sync, it create a Glue database and the Amazon Athena queries are run against this database tables.

(-) https://docs.aws.amazon.com/systems-manager/latest/userguide/systems-manager-inventory-query.html

The "Table descriptor contains duplicate column" error indicates that there is a duplicate column name in Glue table schema.

(-) https://repost.aws/knowledge-center/athena-hive-invalid-metadata-duplicate

In order to fix the issue,

  1. Navigate to AWS Glue console --> Databases where you will find database with a name similar to the Resource Data Sync name
  2. Inside the database, find the table which is giving the error while querying through Athena (For Eg : AWS:InstanceInformation - Look at the "Location" column in the database)
  3. Click "Edit Schema" for the particular table and review the column names. You will find any one column being repeated twice.
  4. Delete one duplicate column
  5. Re-run the Athena query
AWS
支持工程师
Aamir_H
已回答 1 个月前

您未登录。 登录 发布回答。

一个好的回答可以清楚地解答问题和提供建设性反馈,并能促进提问者的职业发展。

回答问题的准则