Incorporate SageMaker Data Wrangler into SageMaker Pipelines

0

Hi all, is it possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines? So that every time the SageMaker Pipelines gets triggered, it starts with sagemaker data wrangler job first before triggering sagemaker training process next?

AWS
質問済み 2ヶ月前136ビュー
2回答
0

Yes, it is possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines.

  • SageMaker Pipelines allows you to define and execute a sequence of ML workflow steps such as data preprocessing, model training, evaluation etc.
  • SageMaker Data Wrangler can be used to prepare and transform data. It provides notebooks, flows and processing jobs capabilities.
  • To add a Data Wrangler step in a Pipeline, you can define a SageMaker Processing job that runs a Data Wrangler notebook or flow.
  • This processing job can be configured as the first step in the Pipeline. It will run the data wrangling tasks to preprocess the data.
  • The output of the Data Wrangler processing job can then be used as input to the subsequent training step in the Pipeline.
  • This allows automated execution of end-to-end ML workflows with data preparation via Data Wrangler followed by model training/evaluation via Pipelines.
profile picture
エキスパート
回答済み 2ヶ月前
0
profile pictureAWS
回答済み 2ヶ月前

ログインしていません。 ログイン 回答を投稿する。

優れた回答とは、質問に明確に答え、建設的なフィードバックを提供し、質問者の専門分野におけるスキルの向上を促すものです。

質問に答えるためのガイドライン

関連するコンテンツ