"RuntimeError: Caught NoCredentialsError in worker process 6" during AWS Batch job using mfive for a specific S3 location

0

Getting the following error when using a specific s3 location for aws batch job using mfive, any suggestions on how to fix this issue ? I do not get error when using a different s3 folder

1691588920754,"[0]:  0%|          | 1/109375 [00:20<634:18:01, 20.88s/it][0]:"
1691588922757,"[0]:  0%|          | 2/109375 [00:22<297:56:44,  9.81s/it][0]:"
1691588924759,"[0]:  0%|          | 3/109375 [00:24<189:49:13,  6.25s/it][0]:"
1691588926861,"[0]:  0%|          | 4/109375 [00:26<139:00:13,  4.58s/it][0]:"
1691588928864,"[0]:  0%|          | 5/109375 [00:28<110:52:11,  3.65s/it][0]:"
1691588929865,"[0]:  0%|          | 6/109375 [00:30<93:59:31,  3.09s/it] [7]:╭───────────────────── Traceback (most recent call last) ──────────────────────╮"
1691588929865,[7]:│ /workspace/mfive/mfive/train.py:193 in <module>                              │
1691588929865,[7]:╰──────────────────────────────────────────────────────────────────────────────╯
1691588929865,[7]:RuntimeError: Caught NoCredentialsError in worker process 6.

Full Error log: https://us-east-1.console.aws.amazon.com/cloudwatch/home?region=us-east-1#logsV2:log-groups/log-group/$252Faws$252Fbatch$252Fjob/log-events/pretrain_stage1-AFM-lk7pxtc4krm56$252Fdefault$252F8e935fb1636d4b36b33a84cb06fd58de$3Fstart$3D-3600000

Link to S3: s3://search-m5-users/sirswe/datasets/blip2_datasets/LAION115M_synthetic_json_96M/

asked 8 months ago61 views
No Answers

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions