- Newest
- Most votes
- Most comments
To run a scheduled AWS Glue notebook job from the console with job bookmarking enabled, you need to use the AWS CLI or SDK to start the job run and pass the --job-bookmark-option
parameter to enable job bookmarking.
Simply saving and running the notebook job from the console does not carry over the job bookmarking option that was enabled when running the notebook interactively.
Yes, you can write a Lambda function triggered by EventBridge on a schedule to start a Glue job run with job bookmark enabled by using the AWS CLI command.
To enable job bookmarking when starting a Glue job run via the CLI, you need to pass the --job-bookmark-option parameter with the value job-bookmark-enable, as shown in your example command. This will tell Glue to track state and prevent reprocessing of old data each time the job runs.
So your proposed approach of using EventBridge to trigger a Lambda function on a schedule that then executes that CLI command would work to run your ETL job on a cron and leverage Glue job bookmarks.
Relevant content
- asked 18 days ago
- asked a month ago
- AWS OFFICIALUpdated 3 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 2 years ago
- AWS OFFICIALUpdated 18 days ago