Cannot Install external python packages in AWS Glue spark script?

0

Cannot install python packages like opencv-python(cv2),scikit-image in aws glue script. Tried whl and zip files still cant in the job parameters still facing issues in installing the python packages.Enter image description here
Enter image description here
Enter image description here

Any help would be great and also facing issues in converting list of tuples to rdd using parallelize any help in this regard also would be really great

  • There is not enough information here. How did you attempt to include the packages?

已提問 2 年前檢視次數 585 次
1 個回答
0

Hi,

if you can access internet from your job the easiest way to add external libraries is to follow the steps described at this documentation page; while the page refers to Glue 2.0 it is applicable to Glue 3.0 as well.

Under Job parameters, do the following:
For Key, enter --additional-python-modules.
For Value, enter opencv-python, scikit-image

If your job will execute in a VPC with no internet access (not even via a NAT Gateway) please review this blog post.

hope this helps,

AWS
專家
已回答 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南