Glue Crawler - skip adding partitions

0

Partitioned data in S3 as below -

some-dataset/2019-12-28/

some-dataset/2019-12-29/

some-dataset/2019-12-30/

some-dataset/2019-12-31/

some-dataset/2020-01-01/

some-dataset/2020-01-02/

some-dataset/2020-01-03/

When running a Glue crawler against the above dataset to create the schema in the catalog, how can we configure the crawler to avoid adding partitions for year 2019 into the table?

If that is not possible, is there a way to totally avoid registering ANY partitions and only create the table schema?

AWS
已提問 4 年前檢視次數 1160 次
1 個回答
0
已接受的答案

You can add excludes to the crawler, e.g. for anything 2019 "**/2019-**" The second question doesn't make any sense to me, if you have a partitioned table you need at least one partition. You can disable the crawler once the table is created with at least one.

profile pictureAWS
專家
已回答 4 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南