Athena query error

0

Getting this error:

Error opening Hive split s3://commoncrawl/cc-index/table/cc-main/warc/crawl=CC-MAIN-2017-17/subset=crawldiagnostics/part-00189-ac1cf8ef-3644-4b49-ac73-0fe6bef46adf.c000.gz.parquet (offset=0, length=36266581): com.amazonaws.services.s3.model.AmazonS3Exception: Please reduce your request rate. (Service: Amazon S3; Status Code: 503; Error Code: SlowDown; Request ID: J88VWPD7BXAFP01E; S3 Extended Request ID: 89ZPRgZ/qx3n0Gs4zFonvfA50JUPjf9ep5vHxhCKHIFXwVr70vgbnLSL9Ctx22GNikrR+p/3gQU=; Proxy: null), S3 Extended Request ID: 89ZPRgZ/qx3n0Gs4zFonvfA50JUPjf9ep5vHxhCKHIFXwVr70vgbnLSL9Ctx22GNikrR+p/3gQU= This query ran against the "ccindex" database, unless qualified by the query. Please post the error message on our forum or contact customer support with Query Id: 066d4ff2-89ce-4683-a4c0-71e2f45348cb

已提問 2 年前檢視次數 4769 次
1 個回答
0

This is a rate limiting error from S3, your query is probably making too many requests to S3 at the same time. This is usually a sign of your data set being too many small files, tens of thousands or more. Reduce the number of files by combining small files into bigger files.

AWS
Theo
已回答 2 年前
AWS
專家
已審閱 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南