S3 Select ignores rows starting with #

0

Hi, I have uploaded a csv file to bucket, contents being

A,B
0,1
#1,2
1,2

I used the query SELECT * FROM s3object The result returned dropped the row starting with #.

A,B
0,1
1,2

If I download the file, the file still contains the row #1,2 Is this a known limitation? Any help is appreciated.

Ben
posta 2 mesi fa125 visualizzazioni
1 Risposta
2
Risposta accettata

Yes, what you're observing is actually an expected behavior when querying CSV files stored in Amazon S3 using SQL with AWS S3 Select or Amazon Athena. These services allow you to run SQL queries directly against files in S3 without needing to load them into a database. However, when working with CSV files, both S3 Select and Athena treat lines that start with # as comments by default. This is why the row starting with # is being skipped in your query results.

The reason for this behavior is based on a common convention in many CSV and text file formats where lines beginning with # are considered as comments and are therefore ignored during processing. This can be particularly useful for including metadata or comments within a file that should not be treated as data.

You can see the example below:

example

profile picture
ESPERTO
con risposta 2 mesi fa
profile pictureAWS
ESPERTO
verificato 2 mesi fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande