Analyzing insights from reviews -comprehend

0

Hello,

I'm trying to do this tutorial about "Analyzing insights from reviews"(https://docs.aws.amazon.com/comprehend/latest/dg/tutorial-reviews-tables.html). When I arrive at step 5 I get this error in Athena : COLUMN_NOT_FOUND: line 5:3: Column 'file' cannot be resolved or requester is not authorized to access requested resources. You may need to manually clean the data at location 's3://..."

Why do I get this error ? is it related to the crawler?

Thanks

Mar10T
preguntada hace 5 meses175 visualizaciones
1 Respuesta
1

The problem was related to the extraction of the tar archive and the fact that Glue is unable to read it a tar file. As a solution I extracted the file from the archive as a txt format and this solved my problem when creating the Athena tables.

" It's important to decompress and unpack the original archives because AWS Glue can’t automatically read data from a tar archive. However, AWS Glue can read from files in Gzip format."

Mar10T
respondido hace 5 meses

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas