Maximum number of workers vs glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors

0

I have mentioned 4 as the Maximum number of workers when defining the Glue Spark Job of G1X worker type . If I check the Cloudwatch Job monitors , I see some these below metrics touching 20 + in the line graph .

  1. glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors
  2. glue.driver.ExecutorAllocationManager.executors.numberAllExecutors

My understanding is, for G1X worker type , there is only 1 executor per worker . Can someone explain if I am reading the chart right or if there is something missing ?

This is the line graph for 3 jobs , each having just 4 workers configured. Enter image description here

질문됨 2년 전259회 조회
1개 답변
0
수락된 답변

Please note that the metric- glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors does not depend upon the number of workers configured by you for the job. It is dynamically calculated as the job progresses. Generally this metric has a higher value at the beginning of the job. It just shows the maximum number of executors that are required in current moment to satisfy the workload. This is a spark related metric and it is calculated by making use of number of partitions, number of tasks per executor.

Please refer this documentation for more details regarding various metrics of a Glue job.

You can have a look at this documentation which shows how various calculations involving executors.

profile pictureAWS
지원 엔지니어
Chaitu
답변함 2년 전
AWS
전문가
검토됨 2년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인