Cannot Install external python packages in AWS Glue spark script?

0

Cannot install python packages like opencv-python(cv2),scikit-image in aws glue script. Tried whl and zip files still cant in the job parameters still facing issues in installing the python packages.Enter image description here
Enter image description here
Enter image description here

Any help would be great and also facing issues in converting list of tuples to rdd using parallelize any help in this regard also would be really great

  • There is not enough information here. How did you attempt to include the packages?

질문됨 2년 전585회 조회
1개 답변
0

Hi,

if you can access internet from your job the easiest way to add external libraries is to follow the steps described at this documentation page; while the page refers to Glue 2.0 it is applicable to Glue 3.0 as well.

Under Job parameters, do the following:
For Key, enter --additional-python-modules.
For Value, enter opencv-python, scikit-image

If your job will execute in a VPC with no internet access (not even via a NAT Gateway) please review this blog post.

hope this helps,

AWS
전문가
답변함 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠