Skip to content

Glue cost analysis

0

I have a glue etl with version 4.0 and I need to know how the cost is calculated for G.1X and G.2X worker type. Need the sample cost analysis as well for understanding

asked 8 months ago328 views
1 Answer
0

AWS Glue costs are calculated based on the Data Processing Units (DPUs) used and the duration of your job runs. Here's how the pricing works for G.1X and G.2X worker types:

For G.1X workers:

  • Each G.1X worker equals 1 DPU
  • 4 vCPUs and 16 GB of memory per worker

For G.2X workers:

  • Each G.2X worker equals 2 DPUs
  • 8 vCPUs and 32 GB of memory per worker

AWS Glue charges an hourly rate of $0.44 per DPU-hour, billed by the second. There's a minimum billing duration of 1 minute for each job run, and after that, you're charged per second.

Sample cost analysis:

Example 1: G.1X with 2 workers running for 10 minutes

  • 2 workers × 1 DPU per worker = 2 DPUs
  • 10 minutes = 0.17 hours
  • 2 DPUs × 0.17 hours × $0.44 per DPU-hour = $0.15 per job run

Example 2: G.2X with 2 workers running for 10 minutes

  • 2 workers × 2 DPUs per worker = 4 DPUs
  • 10 minutes = 0.17 hours
  • 4 DPUs × 0.17 hours × $0.44 per DPU-hour = $0.30 per job run

If you're using AWS Glue Studio for development, note that interactive sessions (used for data preview) are also billed at the same DPU-hour rate. For example, a typical interactive session with 5 DPUs running for 24 minutes would cost: 5 DPUs × (24/60) hours × $0.44 per DPU-hour = $0.88.

To monitor your actual costs, you can use AWS Cost Explorer and filter for Glue services and specific usage types like "ETL-DPU-Hour" for your region.
Sources
Glue pricing for visual ETL jobs | AWS re:Post
AWS Glue worker types - AWS Glue
Monitor and optimize cost on AWS Glue for Apache Spark | AWS re:Post
AWS Glue Pricing

answered 8 months ago

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.