S3 Select ignores rows starting with #

0

Hi, I have uploaded a csv file to bucket, contents being

A,B
0,1
#1,2
1,2

I used the query SELECT * FROM s3object The result returned dropped the row starting with #.

A,B
0,1
1,2

If I download the file, the file still contains the row #1,2 Is this a known limitation? Any help is appreciated.

Ben
已提問 2 個月前檢視次數 125 次
1 個回答
2
已接受的答案

Yes, what you're observing is actually an expected behavior when querying CSV files stored in Amazon S3 using SQL with AWS S3 Select or Amazon Athena. These services allow you to run SQL queries directly against files in S3 without needing to load them into a database. However, when working with CSV files, both S3 Select and Athena treat lines that start with # as comments by default. This is why the row starting with # is being skipped in your query results.

The reason for this behavior is based on a common convention in many CSV and text file formats where lines beginning with # are considered as comments and are therefore ignored during processing. This can be particularly useful for including metadata or comments within a file that should not be treated as data.

You can see the example below:

example

profile picture
專家
已回答 2 個月前
profile pictureAWS
專家
已審閱 2 個月前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南