Questions on Data lake using DMS

0

A customer is implementing a data lake and i am assessing using DMS for pulling data into s3 from on an premise MS SQL DB. As part of this i would like to understand the following points on DMS:

  1. Other users of DMS - widely used?
  2. Examples of using DMS with SQL Server Change Tracking
  3. Volumetrics - how much can it handle
  4. Running over VPN - any issues that you're aware of?
  5. Known issues/limitations of DMS (big binary objects etc)
  6. What ports/access (accounts) does it require for the on prem SQL server
  7. What is the overhead on the on prem SQL server?
1 Answer
1
Accepted Answer
  1. Other users of DMS - widely used? yes, > 100,000 moved to aws using DMS so far

  2. Volumetrics - how much can it handle? Never hit a limit of not keeping up yet.

  3. Running over VPN - any issues that you're aware of? No, other than if using CDC then the CDC change tables could get larger if there’s not enough bandwidth to send the data to DMS instance in the cloud.

  4. Known issues/limitations of DMS (big binary objects etc)? All documented: https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Source.SQLServer.html

  5. What ports/access (accounts) does it require for the on prem SQL server? 1433, sysadmin access to read the logs and CDC tables

  6. What is the overhead on the on prem SQL server? Depends. Transactional repl will add more info to the transaction log, will produce bigger logs. CDC tables will take more space in the customer database, and add some sqlagent jobs to do cleanup.

AWS
answered 5 years ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions