SSIS Best Practices with Amazon Redshift

0

A Customer uses SSIS in their SQL Server environment to perform ETL. They would like to move to Amazon Redshift for all the value Redshift brings to data warehousing.

What are the best practices for integrating Redshift into a SSIS workflow?

AWS
已提問 5 年前檢視次數 755 次
1 個回答
0
已接受的答案

Customers coming from SQL Server might first start with an Amazon Redshift pilot:

  1. Create their first Redshift cluster (and secondary AWS things like AWS VPC subnet groups, Amazon S3 buckets, etc.).
  2. Load some sample data and get familiar with Redshift fundamentals (COPY command, zone mapping, etc.)
  3. Hook up their favorite BI tool to check for compatibility.

As they progress, they can load a bulk copy of data from their existing MSSQL database using AWS DMS so they can experiment with their existing data model, see how it performs in Redshift, find out changes in data types, changes to their queries (especially things like PIVOTs), etc.

Finally, when they are ready to start integrating their data sources directly into Redshift, they might find they don't need SSIS and they can explore cloud-native services such as Glue, or Marketplace offerings like Matillion. If there is a hard requirement for SSIS, I would recommend a 3rd party plugin called ZappySys. It handles the Redshift loading best practices (i.e. staging data in S3, using COPY commands on multiple files in S3, etc.).

Hope this helps!

AWS
已回答 5 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南