Glue Table and Athena: What if some files are compressed and others are not in S3, will it break anything?

0

I have a glue table specified that refers to a location in S3. I noticed that when Firehose puts data there in parquet format it might be not always compressed.

Will it cause issues for Glue table, crawler or Athena to access the data (both compressed and not compressed)?

hRed
demandé il y a 6 mois297 vues
1 réponse
0
Réponse acceptée

parquet files are self descriptive, it's even possible to use different compression inside a file for different columns/pages. Therefore it should work, but since it's not the normal case, it's possible that some tool has a bug and assume a single code is used.

profile pictureAWS
EXPERT
répondu il y a 6 mois

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions