Transfer Files from Google Drive to EC2

0

Hi Dears, I am trying to transfer a machine learning model file (500 MB) from google drive to EC2, for inferencing.

I tried different commands, and I see that wget works better, but when I do it :

The result was :

Connecting to drive.google.com (drive.google.com)|172.293.62.138|:443... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [text/html] Saving to: 'uc?id=1-Dqk6fZzDiFKTqnnQ2yqW48uJk-CPqrB' 2022-09-20 18:26:40 (37.2 MB/s) - 'uc?id=1-Dqk6fZzDiFKTqnnQ2yqW48uJk-CPqrB' saved [2241] [2]- Done wget https://drive.google.com/uc?id=1-Dqk6fZzDiFKTqnnQ2yqW48uJk-CPqrB

But, I cant see the file that I need to transfer, I see file with neme " uc?id=1-Dqk6fZzDiFKTqnnQ2yqW48uJk-CPqrB "... Can you please help with that step? many

Thanks in advance

Best

Basem

2 Respuestas
1
Respuesta aceptada

Hi THere

Take a look at this article showing the steps to download a file from Google Drive using wget. You need to make it publicly shareable first. Pay attention to the later part of the article, there is special handling for files greater than 100mb.

https://medium.com/@acpanjan/download-google-drive-files-using-wget-3c2c025a8b99

profile pictureAWS
EXPERTO
Matt-B
respondido hace 2 años
profile pictureAWS
EXPERTO
Chris_G
revisado hace 2 años
  • Thanks for your kind response dear matt! But can you please, tell me what things must be changed? as I have a google drive link not google docs as mentioned in the articel, Where I change fileid and file name, but ask specifically for the links included in the command.

    wget --load-cookies /tmp/cookies.txt "https://docs.google.com/uc?export=download&confirm=$(wget --quiet --save-cookies /tmp/cookies.txt --keep-session-cookies --no-check-certificate 'https://docs.google.com/uc?export=download&id=FILEID' -O- | sed -rn 's/.confirm=([0-9A-Za-z_]+)./\1\n/p')&id=FILEID" -O FILENAME && rm -rf /tmp/cookies.txt

  • If you look in the comments of the article there are some other tips. One was to use this command and replace FILEID and FILENAME

    wget --no-check-certificate --load-cookies /tmp/cookies.txt "https://drive.google.com/uc?export=download&confirm=true$(wget --quiet --save-cookies /tmp/cookies.txt --keep-session-cookies --no-check-certificate 'https://drive.google.com/uc?export=download&id=FILEID' -O- | sed -rn 's/.*confirm=([0-9A-Za-z_]+).*/\1\n/p')&id=FILEID" -O FILENAME && rm -rf /tmp/cookies.txt

0

You may also want to take a look at gdown, available here: https://github.com/wkentaro/gdown. Downloading files can be done either via command line, or from within Python code.

AWS
respondido hace 2 años

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas