EMR Serverless Application Disk Space?

0

When configuring an EMR Serverless application, you can choose the disk size for your preinitialized capacity and a maximum disk limit for the application:

Disk size

However, at least for Hive jobs, EMR Serverless stores both the scratch and warehouse directories on S3. So what is this configured disk space used for? I've searched the documentation and there's no mention of how this space is used, or why it's needed.

Patrick
질문됨 2년 전1107회 조회
1개 답변
1
수락된 답변

EMR Serverless uses disk size for the following purposes.

  1. Storing the log files ( Hive, Tez AM logs, Tez Task logs etc)
  2. Shuffle data is stored on the local disk.
  3. Resources such as aux libraries, user jar etc.

Hope it answers your question.

AWS
답변함 일 년 전
AWS
지원 엔지니어
검토됨 23일 전
  • This does answer the question, thanks. But we needed this answer 3 months ago. We've since moved our company's infrastructure to a new cloud provider.

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠