2 個答案
- 最新
- 最多得票
- 最多評論
0
The fastest way is to run workloads inside Glue ETL jobs, which is Spark.
You can connect SageMaker Notebooks with Glue Dev Endpoints. Then take that code out and put them into Glue ETL jobs.
Inside Glue, the Glue ETL library (DynamicFrames, etc.) will all be available.
Otherwise, the customer can also consider running EMR on EKS and connect with EMR Studio. However, that requires knowledge of managing EKS clusters and managing Fargate in EKS.
已回答 3 年前
0
EMR serverless has just been announced at re:Invent, adding this as another option: https://aws.amazon.com/blogs/big-data/announcing-amazon-emr-serverless-preview-run-big-data-applications-without-managing-servers/ (Note: its currently still in preview.)
已回答 2 年前
相關內容
- AWS 官方已更新 1 年前