Sagemaker Training Input mode as FastFile still downloading

0

I'm trying to train a custom model using the FastFile input mode, as my input data is quite large (about 27 GB). As specified in the Sagemaker SDK doc, I've set the input_mode parameter of my Estimator object to FastFile, and created a small pipeline to expose my hyperparameters (I prefer to use the pipeline in Sagemaker Studio). When I run this pipeline, a training task is created using my custom image and code to train this model. I usually use the ml.g4dn.xlarge instance for training to speed up the process. Finally, after the initialization stage, the training task attempts to download the data from the "folder" on the S3 bucket (where my 27 GB of data is stored). But I clearly specified that I wanted to use FastFile mode, I didn't expect to download the data from my bucket.

So the question is: why is the training job still downloading data from the bucket even though I've enabled FastFile mode?

  • Do you have a large number of files ?

  • Ye, in those 27 GB, I have 266 folders with 4 numpy files each. Each folder is about 100 MB of data.

asked 10 months ago486 views
1 Answer
0

SageMaker Fast file mode streams the data directly from S3 when you access the file. From an usability perspective you will still access the files as if they are on disc and SageMaker makes sure to stream the file from S3 when accessed. For your use case using File Mode which does the full copy rather than streaming will be better approach as the initial copy is much faster for datasets less than 100 GB. Please refer to the below blog to determine the right option for your training

https://aws.amazon.com/blogs/machine-learning/choose-the-best-data-source-for-your-amazon-sagemaker-training-job/

AWS
answered 10 months ago
  • In the short term, I can deal with the default File mode. However, in the long term, I may need the Fast File mode (I didn't reach 100 GB of data yet). I was expecting to be working with a small example of nearly 30 GB, that's why I do not understand why it's not working, especially when I can switch from File to FastFile without changing the code.

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions