Difference between Amazon Glue and Amazon EMR

0

Hello,

Please share the difference between AWS Glue and AWS EMR and which one we should use and when?

Thanks,

Monica
질문됨 6달 전1212회 조회
1개 답변
2
수락된 답변

Hi, AWS Glue is a serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources. And Amazon EMR (previously called Amazon Elastic MapReduce) is a managed cluster platform that simplifies running big data frameworks, such as Apache Hadoop and Apache Spark, on AWS to process and analyze vast amounts of data.

AWS Glue Supporting Apache Spark and Amazon EMR serverless availability is what makes the overlapping between each other. Always remember that what you may recommend should depend on the user persona and use case.

From a recommendation point of view:

  • AWS Glue is our recommended service for Data Integration workloads and ETL from legacy platforms such as Informatica, Talend etc.
  • Amazon EMR is our recommended service for Big Data workloads that are traditionally run on Hadoop.

Use Amazon EMR:

  • Hadoop Migration from on-prem or other cloud providers, including Databricks migration
  • Customer has expertise beyond just Spark, for ex. Hive, Presto, Trino
  • Customer is skilled in loading their own data source connector libraries for their jobs.

Use AWS Glue:

  • Customer prefers built-in capabilities: connectors, transformations, incremental load, job monitoring, orchestration.
  • Customer wants visual and code ETL development tools
  • Migration from ETL providers such as Informatica, Talend, Matillion
profile pictureAWS
Arifc
답변함 6달 전
AWS
지원 엔지니어
검토됨 25일 전
profile picture
전문가
검토됨 2달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠