Analyzing insights from reviews -comprehend

0

Hello,

I'm trying to do this tutorial about "Analyzing insights from reviews"(https://docs.aws.amazon.com/comprehend/latest/dg/tutorial-reviews-tables.html). When I arrive at step 5 I get this error in Athena : COLUMN_NOT_FOUND: line 5:3: Column 'file' cannot be resolved or requester is not authorized to access requested resources. You may need to manually clean the data at location 's3://..."

Why do I get this error ? is it related to the crawler?

Thanks

Mar10T
asked 5 months ago165 views
1 Answer
1

The problem was related to the extraction of the tar archive and the fact that Glue is unable to read it a tar file. As a solution I extracted the file from the archive as a txt format and this solved my problem when creating the Athena tables.

" It's important to decompress and unpack the original archives because AWS Glue can’t automatically read data from a tar archive. However, AWS Glue can read from files in Gzip format."

Mar10T
answered 5 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions