- Newest
- Most votes
- Most comments
This issue where only column headers appear in Athena after using a Glue crawler to process CSV files from S3 is a common problem that can occur for several reasons:
-
Crawler configuration: The crawler may be set to only extract headers from the CSV file without processing the actual data. Double-check your crawler settings to ensure it's configured to extract both headers and data.
-
File format issues: Ensure that your CSV files are in a format supported by AWS Glue. Sometimes, inconsistencies in delimiters, quote characters, or file encoding can cause problems.
-
Permissions: Verify that the IAM role assigned to the Glue crawler has the necessary permissions to access and read the data in your S3 bucket.
-
Data location: Make sure the S3 path specified in the crawler configuration matches the actual location of your data files.
-
Multiple runs: In some cases, running the Glue crawler multiple times may be necessary to fully populate the Glue Data Catalog.
-
File structure: If your CSV files have a complex structure or nested data, the crawler might have difficulty interpreting it correctly.
To resolve this issue:
- Review and adjust your crawler configuration if necessary.
- Check the format and structure of your CSV files.
- Verify IAM permissions for the Glue crawler.
- Ensure the correct S3 path is specified in the crawler settings.
- Try running the crawler multiple times.
- Review the Glue crawler logs for any error messages or warnings that might provide additional insights.
If the problem persists, you may need to consider using alternative methods, such as creating a custom classifier or manually defining the table schema in the Glue Data Catalog.
Sources
AWS Glue Crawler not extracting data from CSV file | AWS re:Post
crawler never load data | AWS re:Post
Hello.
Can you view crawled CSV data on the Glue console?
Also, would it be possible for you to share the CSV sample you are using?
Relevant content
- asked 2 years ago
- asked 3 years ago
- asked 2 years ago
- AWS OFFICIALUpdated 4 months ago

Hi Riku, thank you for the quick response, I can view all the tables on the Glue console. However, when I click on the tables name and look at its data it says under column statistics: no statistics. I feel that this suggests it is an issue with the Glue crawler rather than the transfer to Athena. The data I am working with is an Excel file that has been saved and imported as a CSV (cp932) to python and then exported as a CSV in UTF-8. I have got a highly simplified example data set that I will share with you that does not have Japanese characters, showing that the issue is not with the encoding. This data set is also only showing the column headers like the rest of my data. If you have any ideas that would be greatly useful!
Here is my very simple csv file I am using as a test: Age,Date,Name 1,11/01/2002,Dan 5,22/03/2001,Sam 3,23/05/2003,Ben 4,11/01/2002,Johm 3,22/03/2001,Harry 2,23/05/2003,Luke