Incorporate SageMaker Data Wrangler into SageMaker Pipelines

0

Hi all, is it possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines? So that every time the SageMaker Pipelines gets triggered, it starts with sagemaker data wrangler job first before triggering sagemaker training process next?

2개 답변
0

Yes, it is possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines.

  • SageMaker Pipelines allows you to define and execute a sequence of ML workflow steps such as data preprocessing, model training, evaluation etc.
  • SageMaker Data Wrangler can be used to prepare and transform data. It provides notebooks, flows and processing jobs capabilities.
  • To add a Data Wrangler step in a Pipeline, you can define a SageMaker Processing job that runs a Data Wrangler notebook or flow.
  • This processing job can be configured as the first step in the Pipeline. It will run the data wrangling tasks to preprocess the data.
  • The output of the Data Wrangler processing job can then be used as input to the subsequent training step in the Pipeline.
  • This allows automated execution of end-to-end ML workflows with data preparation via Data Wrangler followed by model training/evaluation via Pipelines.
profile picture
전문가
답변함 2달 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인