- Newest
- Most votes
- Most comments
Hello,
Thankyou for using AWS Sagemaker !!
From the description I see that you are using the PySparkProcessor for data preprocessing using Sagemaker and the requirements is to install custom packages in all the nodes while using multi-instance setup. Unfortunately, at the moment its is not possible to install the custom packages on container provided by Sagemaker as it’s a managed service.
However, if you want to extend/add custom packages you can use bring your own container. Please refer the below links for Build Your Own Processing Container :
[+] Build Your Own Processing Container (Advanced Scenario) - https://docs.aws.amazon.com/sagemaker/latest/dg/build-your-own-processing-container.html
[+] Build Your Own Processing Container (Advanced Scenario) - Run Your Processing Container Using the SageMaker Python SDK - https://docs.aws.amazon.com/sagemaker/latest/dg/build-your-own-processing-container.html#byoc-run
If you experience any difficulty in implementing the above solution, I would recommend you please reach out to AWS Support[1] (Sagemaker), along with your issue/use case in detail and share relevant AWS resource names. We will be more than happy to assist you.
Hope this helps!
Relevant content
- asked a year ago
- asked 5 months ago
- AWS OFFICIALUpdated a month ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated 2 months ago
- AWS OFFICIALUpdated 2 years ago