"RuntimeError: Caught NoCredentialsError in worker process 6" during AWS Batch job using mfive for a specific S3 location

0

Getting the following error when using a specific s3 location for aws batch job using mfive, any suggestions on how to fix this issue ? I do not get error when using a different s3 folder

1691588920754,"[0]:  0%|          | 1/109375 [00:20<634:18:01, 20.88s/it][0]:"
1691588922757,"[0]:  0%|          | 2/109375 [00:22<297:56:44,  9.81s/it][0]:"
1691588924759,"[0]:  0%|          | 3/109375 [00:24<189:49:13,  6.25s/it][0]:"
1691588926861,"[0]:  0%|          | 4/109375 [00:26<139:00:13,  4.58s/it][0]:"
1691588928864,"[0]:  0%|          | 5/109375 [00:28<110:52:11,  3.65s/it][0]:"
1691588929865,"[0]:  0%|          | 6/109375 [00:30<93:59:31,  3.09s/it] [7]:╭───────────────────── Traceback (most recent call last) ──────────────────────╮"
1691588929865,[7]:│ /workspace/mfive/mfive/train.py:193 in <module>                              │
1691588929865,[7]:╰──────────────────────────────────────────────────────────────────────────────╯
1691588929865,[7]:RuntimeError: Caught NoCredentialsError in worker process 6.

Full Error log: https://us-east-1.console.aws.amazon.com/cloudwatch/home?region=us-east-1#logsV2:log-groups/log-group/$252Faws$252Fbatch$252Fjob/log-events/pretrain_stage1-AFM-lk7pxtc4krm56$252Fdefault$252F8e935fb1636d4b36b33a84cb06fd58de$3Fstart$3D-3600000

Link to S3: s3://search-m5-users/sirswe/datasets/blip2_datasets/LAION115M_synthetic_json_96M/

gefragt vor 9 Monaten65 Aufrufe
Keine Antworten

Du bist nicht angemeldet. Anmelden um eine Antwort zu veröffentlichen.

Eine gute Antwort beantwortet die Frage klar, gibt konstruktives Feedback und fördert die berufliche Weiterentwicklung des Fragenstellers.

Richtlinien für die Beantwortung von Fragen

Relevanter Inhalt