Cannot Install external python packages in AWS Glue spark script?

0

Cannot install python packages like opencv-python(cv2),scikit-image in aws glue script. Tried whl and zip files still cant in the job parameters still facing issues in installing the python packages.Enter image description here
Enter image description here
Enter image description here

Any help would be great and also facing issues in converting list of tuples to rdd using parallelize any help in this regard also would be really great

  • There is not enough information here. How did you attempt to include the packages?

posta 2 anni fa585 visualizzazioni
1 Risposta
0

Hi,

if you can access internet from your job the easiest way to add external libraries is to follow the steps described at this documentation page; while the page refers to Glue 2.0 it is applicable to Glue 3.0 as well.

Under Job parameters, do the following:
For Key, enter --additional-python-modules.
For Value, enter opencv-python, scikit-image

If your job will execute in a VPC with no internet access (not even via a NAT Gateway) please review this blog post.

hope this helps,

AWS
ESPERTO
con risposta 2 anni fa

Accesso non effettuato. Accedi per postare una risposta.

Una buona risposta soddisfa chiaramente la domanda, fornisce un feedback costruttivo e incoraggia la crescita professionale del richiedente.

Linee guida per rispondere alle domande