Questions on Data lake using DMS

0

A customer is implementing a data lake and i am assessing using DMS for pulling data into s3 from on an premise MS SQL DB. As part of this i would like to understand the following points on DMS:

  1. Other users of DMS - widely used?
  2. Examples of using DMS with SQL Server Change Tracking
  3. Volumetrics - how much can it handle
  4. Running over VPN - any issues that you're aware of?
  5. Known issues/limitations of DMS (big binary objects etc)
  6. What ports/access (accounts) does it require for the on prem SQL server
  7. What is the overhead on the on prem SQL server?
1 Risposta
1
Risposta accettata
  1. Other users of DMS - widely used? yes, > 100,000 moved to aws using DMS so far

  2. Volumetrics - how much can it handle? Never hit a limit of not keeping up yet.

  3. Running over VPN - any issues that you're aware of? No, other than if using CDC then the CDC change tables could get larger if there’s not enough bandwidth to send the data to DMS instance in the cloud.

  4. Known issues/limitations of DMS (big binary objects etc)? All documented: https://docs.aws.amazon.com/dms/latest/userguide/CHAP_Source.SQLServer.html

  5. What ports/access (accounts) does it require for the on prem SQL server? 1433, sysadmin access to read the logs and CDC tables

  6. What is the overhead on the on prem SQL server? Depends. Transactional repl will add more info to the transaction log, will produce bigger logs. CDC tables will take more space in the customer database, and add some sqlagent jobs to do cleanup.

AWS
con risposta 5 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande