ClientError: AlgorithmError: framework error: Traceback (most recent call last): File "/miniconda3/lib/python3.7/site-packages/sagemaker_containers/_trainer.py", line 84, in train entrypoint() File "/miniconda3/lib/python3.7/site-packages/sagemaker_sklearn_container/training.py", line 39, in main train(environment.Environment()) File "/miniconda3/lib/python3.7/site-packages/sagemaker_sklearn_container/training.py", line 35, in train runner_type=runner.ProcessRunnerType) File "/miniconda3/lib/python3.7/site-packages/sagemaker_training/entry_point.py", line 100, in run wait, capture_error File "/miniconda3/lib/python3.7/site-packages/sagemaker_training/process.py", line 291, in run cwd=environment.code_dir, File "/miniconda3/lib/python3.7/site-packages/sagemaker_training/process.py", line 208, in check_error info=extra_info, sagemaker_training.errors.ExecuteUserScriptError: ExecuteUserScriptError: ExitCode 1 ErrorMessage "" Command "/bin/sh -c ./_repack_script_launcher.sh --dependencies
where can i find the script? the register-RepackModel is automatically created in the pipeline by sagemaker after check-model-accuracy?
From the error logs you've provided, the system is unable to find the
_repack_script_launcher.sh
script, which should be a part of the SageMaker training job. There's also an error indicating that the filemodel.tar.gz
cannot be found in the expected directory, and there's a failure to parse a hyperparameter.The
model.tar.gz
file is not found in the directory/opt/ml/input/data/training/
. Try to check that the file is being generated correctly and is being placed in the correct directory.Check this issue in Github
Which OS are you using for your Sagemaker? Make sure you are using a supported OS