Incorporate SageMaker Data Wrangler into SageMaker Pipelines

0

Hi all, is it possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines? So that every time the SageMaker Pipelines gets triggered, it starts with sagemaker data wrangler job first before triggering sagemaker training process next?

AWS
preguntada hace 2 meses136 visualizaciones
2 Respuestas
0

Yes, it is possible to incorporate SageMaker Data Wrangler as a step in SageMaker Pipelines.

  • SageMaker Pipelines allows you to define and execute a sequence of ML workflow steps such as data preprocessing, model training, evaluation etc.
  • SageMaker Data Wrangler can be used to prepare and transform data. It provides notebooks, flows and processing jobs capabilities.
  • To add a Data Wrangler step in a Pipeline, you can define a SageMaker Processing job that runs a Data Wrangler notebook or flow.
  • This processing job can be configured as the first step in the Pipeline. It will run the data wrangling tasks to preprocess the data.
  • The output of the Data Wrangler processing job can then be used as input to the subsequent training step in the Pipeline.
  • This allows automated execution of end-to-end ML workflows with data preparation via Data Wrangler followed by model training/evaluation via Pipelines.
profile picture
EXPERTO
respondido hace 2 meses
0
profile pictureAWS
respondido hace 2 meses

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas