Using Glue Crawler to crawl parquet files

0

Created a crawler to crawl parquet files residing in S3. What I've learnt is that parquet format is in-built in crawler so there's no explicit configuration needed. However, I do not see any tables created when I run the crawler, which runs successfully. Has anyone done this and is there any special configuration needed for this?

Pankaj
已提問 8 個月前檢視次數 1034 次
1 個回答
1

Based on what's described here, it seems there is no error in cloudwatch logs for crawler.

Can you please make sure that there are no access denied errors. Also see if the role attached to crawler has access to that s3 path and s3 bucket KMS key if SSE-KMS CMK is the bucket encryption. Can you create a new role with required permissions and attach it to crawler and see if behavior changes. Verify that there are no explicit deny policies at bucket or KMS key level.

Also, I'd see if there are any already existing tables, which related to this data(files) in same database, as in that case also, you may not see new tables created.

Comment here how it goes, happy to assist further.

Abhishek

profile pictureAWS
專家
已回答 8 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南