Glue Jobs & Multiple tables

0

A customer needs to ETL multiple tables from RDS into S3 and Redshift.

Let's say they need to combine data from 6 tables to load into S3.

I tried helping them setup the Glue Jobs for this process, but it's not clear what the best and efficient way is to load these tables into S3 or Redshift: When you create a Glue Job, you can only select 1 table as a data source.

Do they need to create a Glue Job for each table or customize the generated Glue jobs to include all tables?

質問済み 6年前5355ビュー
1回答
0
承認された回答

Yes, they need to customize the generated Glue job to include multiple tables and join them. The Glue Job creation UI just creates a simple template job with one source and one target but in reality most jobs needs multiple sources and some need multiple targets as well.

We have Join examples here: https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-python-samples-legislators.html

AWS
回答済み 6年前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ