1回答
- 新しい順
- 投票が多い順
- コメントが多い順
1
You have a couple of options:
- Amazon SageMaker DataWrangler. You can use Databricks as a data source in SageMaker Data Wrangler. This allows you to interactively query the data stored in Databricks using SQL, and preview data before importing it. Once the data is imported you can cleanse, engineer features, and prepare it for training. Please refer to the blog below for more information: https://aws.amazon.com/blogs/machine-learning/prepare-data-from-databricks-for-machine-learning-using-amazon-sagemaker-data-wrangler/
- AWS Glue: Assuming the Delta Lake table are stored in S3 you can build a Glue job to read, transform , and prepare the data for Sagemaker training. More info can be found here: https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-format-delta-lake.html
回答済み 2年前
関連するコンテンツ
- AWS公式更新しました 2年前