Maximum number of workers vs glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors

0

I have mentioned 4 as the Maximum number of workers when defining the Glue Spark Job of G1X worker type . If I check the Cloudwatch Job monitors , I see some these below metrics touching 20 + in the line graph .

  1. glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors
  2. glue.driver.ExecutorAllocationManager.executors.numberAllExecutors

My understanding is, for G1X worker type , there is only 1 executor per worker . Can someone explain if I am reading the chart right or if there is something missing ?

This is the line graph for 3 jobs , each having just 4 workers configured. Enter image description here

已提問 2 年前檢視次數 259 次
1 個回答
0
已接受的答案

Please note that the metric- glue.driver.ExecutorAllocationManager.executors.numberMaxNeededExecutors does not depend upon the number of workers configured by you for the job. It is dynamically calculated as the job progresses. Generally this metric has a higher value at the beginning of the job. It just shows the maximum number of executors that are required in current moment to satisfy the workload. This is a spark related metric and it is calculated by making use of number of partitions, number of tasks per executor.

Please refer this documentation for more details regarding various metrics of a Glue job.

You can have a look at this documentation which shows how various calculations involving executors.

profile pictureAWS
支援工程師
Chaitu
已回答 2 年前
AWS
專家
已審閱 2 年前

您尚未登入。 登入 去張貼答案。

一個好的回答可以清楚地回答問題並提供建設性的意見回饋,同時有助於提問者的專業成長。

回答問題指南