Glue Table and Athena: What if some files are compressed and others are not in S3, will it break anything?

0

I have a glue table specified that refers to a location in S3. I noticed that when Firehose puts data there in parquet format it might be not always compressed.

Will it cause issues for Glue table, crawler or Athena to access the data (both compressed and not compressed)?

hRed
preguntada hace 6 meses297 visualizaciones
1 Respuesta
0
Respuesta aceptada

parquet files are self descriptive, it's even possible to use different compression inside a file for different columns/pages. Therefore it should work, but since it's not the normal case, it's possible that some tool has a bug and assume a single code is used.

profile pictureAWS
EXPERTO
respondido hace 6 meses

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas