bootstrap action python libraries are getting over written by Amazon EMR default libraries

0

I am trying to provision EMR with python libraries like numpy=1.19.5 and pandas. I am doing it via bootstrap script. But some how the bootstrap values are getting over written. To avoid this i have followed this particular https://repost.aws/knowledge-center/emr-install-python-libraries. Now i am using the script as the bootstrap action file. Now the NODEPROVISIONSTATE forever in PENDING state. I have gone through logs /var/log/bootstrap-actions/1/ I don't see any errors or warnings there.

But later cluster got terminated with the below error.

On the master instance (i-****), bootstrap action 1 timed out executing

  • My bad i am able to fix the issue, I have miss read the doc. I am not bootstrap file in Back ground hence getting above error.

preguntada hace un año302 visualizaciones
2 Respuestas
0

Hi,

Thank you for reaching to re:Post with your concerns.

I understand that you were able to resolve the issue after placing the bootstrap action file in the background as mentioned in the documentation - https://repost.aws/knowledge-center/emr-install-python-libraries.

Please feel free to update this post if you are still facing issue related to this.

Thank you!

AWS
INGENIERO DE SOPORTE
respondido hace un año
0

A way to go may be to use the script as a Step in EMR instead of a bootstrap script. I had also an issue with files getting overriden and it helped. Some clues on steps and https://docs.aws.amazon.com/emr/latest/ManagementGuide/emr-add-steps-console.html

Michal
respondido hace 10 meses

No has iniciado sesión. Iniciar sesión para publicar una respuesta.

Una buena respuesta responde claramente a la pregunta, proporciona comentarios constructivos y fomenta el crecimiento profesional en la persona que hace la pregunta.

Pautas para responder preguntas