S3 Select ignores rows starting with #

0

Hi, I have uploaded a csv file to bucket, contents being

A,B
0,1
#1,2
1,2

I used the query SELECT * FROM s3object The result returned dropped the row starting with #.

A,B
0,1
1,2

If I download the file, the file still contains the row #1,2 Is this a known limitation? Any help is appreciated.

Ben
質問済み 2ヶ月前125ビュー
1回答
2
承認された回答

Yes, what you're observing is actually an expected behavior when querying CSV files stored in Amazon S3 using SQL with AWS S3 Select or Amazon Athena. These services allow you to run SQL queries directly against files in S3 without needing to load them into a database. However, when working with CSV files, both S3 Select and Athena treat lines that start with # as comments by default. This is why the row starting with # is being skipped in your query results.

The reason for this behavior is based on a common convention in many CSV and text file formats where lines beginning with # are considered as comments and are therefore ignored during processing. This can be particularly useful for including metadata or comments within a file that should not be treated as data.

You can see the example below:

example

profile picture
エキスパート
回答済み 2ヶ月前
profile pictureAWS
エキスパート
レビュー済み 2ヶ月前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ