1 Answer
- Newest
- Most votes
- Most comments
0
Once the FailStep is reached, the execution fails and the error message is set as the failure reason. To be more specific, this step will first fail the pipeline exection, which results in the waiter timeout. Then it will record your provided message as failure reason in meta data of this execution.
This failure reason field will be available when you call describe pipelien execution api. In the response, as described in https://docs.aws.amazon.com/sagemaker/latest/APIReference/API_DescribePipelineExecution.html#API_DescribePipelineExecution_ResponseSyntax
answered a year ago
Relevant content
- asked 8 months ago
- asked a year ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated a year ago
The describe method does not retrieve any specific reason about why the pipeline execution failed. The field FailureReason only has this value: "'Step failure: One or multiple steps failed.'" No information about which step failed or why whatsoever... Where is then this metadata containing the error messages thrown by the pipeline's execution at runtime that you mentioned?
Seems the step failure reason is not marked as the pipeline failure reason. Could you please try https://docs.aws.amazon.com/cli/latest/reference/sagemaker/list-pipeline-execution-steps.html this API to see if the failure step will show the provided error message ?
Hi, yes, I can confirm that and I was actually about to post an answer to my own question after finding out that indeed the list_steps function contains the pipeline's execution metadata of all steps, such as, the status and the error message in case of failure. Thank you