AWS Glue Crawler Scalability for Large Number of Delta Tables

Question: We currently have approximately 100 tables in delta format, partitioned by yyyy, mm, dd, hh, mm. Our current process involves reading these delta tables via a crawler, cataloging them, and utilizing spectrum tables in Redshift for building business logic.

However, we are encountering scalability limitations due to the maximum of 10 tables per crawler. As we continue to add more tables, adding additional crawlers becomes cumbersome. Additionally, the data volume on some of these tables is substantial, with up to 500k records per hour.

Considering these constraints, what would be the optimal approach to read the delta tables in parallel via the crawler? Can we configure the crawler to utilize an RDS database for improved scalability? Any insights or best practices would be appreciated.

Hamzah Chaudhry エキスパート
1ヶ月前
Could you share how you're creating these Delta tables? Where is the source data coming from for these tables?
pkgp-aws
1ヶ月前
We are creating the delta tables via Glue ETL. Source - API.

トピック

分析データベース

タグ

分析 AWS Glue データの抽出、変換、ロード Amazon Redshift

言語

English

pkgp-aws

質問済み 1ヶ月前356ビュー

回答なし

新しい順
投票が多い順
コメントが多い順

関連するコンテンツ

How to know record counts of the tables in Amazon Keyspaces
haruka
質問済み 1ヶ月前
サードパーティ製のGlue Connectorである「AWS Glue Connector for Elasticsearch」を使ったConnectionをTerraformで管理したい
tom
質問済み 6ヶ月前
ロードバランサが突然Out of Serviceになります
tmXhbVtMkxS3Zm
質問済み 7年前
RDS Aurora クラスタの再起動が終わらない
ti
質問済み 6年前
AWS Glue クローラーが内部サービス例外で失敗するのはなぜですか?
AWS公式更新しました 1年前
AWS DMS タスクを日付でフィルタリングするにはどうすればよいですか?
AWS公式更新しました 2年前
電話で AWS アカウントを確認しようとしたときに発生する「maximum number of failed attempts」(試行失敗回数の上限に達しました) エラーを解決する方法を教えてください。
AWS公式更新しました 2年前
EFS ファイルシステム上でスループットモードを変更すると、「maximum number of throughput mode changes or provisioned throughput value decreases」(スループットモードの最大変更数またはプロビジョンドスループット値の減少) エラーを受け取るのはなぜですか?
AWS公式更新しました 2年前
AWS Application Migration Service (MGN) とエージェントレス vCenter クライアントを利用して VMware 仮想環境から AWS への移行を加速させる
エキスパート
Koichi Takeda
公開済み 15日前
アベイラビリティーゾーン (AZ) の移行&インスタンスのアップグレードガイド
エキスパート
Sumikawa_M
公開済み 3ヶ月前