EMR Serverless Application Disk Space?

0

When configuring an EMR Serverless application, you can choose the disk size for your preinitialized capacity and a maximum disk limit for the application:

Disk size

However, at least for Hive jobs, EMR Serverless stores both the scratch and warehouse directories on S3. So what is this configured disk space used for? I've searched the documentation and there's no mention of how this space is used, or why it's needed.

Patrick
demandé il y a 2 ans1108 vues
1 réponse
1
Réponse acceptée

EMR Serverless uses disk size for the following purposes.

  1. Storing the log files ( Hive, Tez AM logs, Tez Task logs etc)
  2. Shuffle data is stored on the local disk.
  3. Resources such as aux libraries, user jar etc.

Hope it answers your question.

AWS
répondu il y a un an
AWS
INGÉNIEUR EN ASSISTANCE TECHNIQUE
vérifié il y a 23 jours
  • This does answer the question, thanks. But we needed this answer 3 months ago. We've since moved our company's infrastructure to a new cloud provider.

Vous n'êtes pas connecté. Se connecter pour publier une réponse.

Une bonne réponse répond clairement à la question, contient des commentaires constructifs et encourage le développement professionnel de la personne qui pose la question.

Instructions pour répondre aux questions