Batch and Spot Interruptions
Customer is asking if their Batch job that execute on spot is interrupted due to spot price increasing beyond their threshold, will Batch resubmit the job when the spot price falls again, or will the customer have to manually resubmit?
AWS Batch has a mechanism for job retry. Documentations here: https://docs.aws.amazon.com/batch/latest/userguide/job_retries.html
Some quote from the doc:
You can apply a retry strategy to your jobs and job definitions that allows your jobs to be automatically retried if they fail. Possible failure scenarios include:
Any non-zero exit code from a container job
Amazon EC2 instance failure or termination
Internal AWS service error or outage
When a job is submitted to a job queue and placed into the RUNNING state, that is considered an attempt. By default, each job is given one attempt to move to either the SUCCEEDED or FAILED job state. However, both the job definition and the job submission workflows allow you to specify a retry strategy with between 1 and 10 attempts. For more information, see Retry Strategy.
Relevant questions
What happen if no spot instance are available ?
asked a month agoHibernating Spot Instances upon interruption in Amazon EKS
Accepted Answerasked 2 years agoDoes AWS Batch support EC2 Spot stop/start and hibernation
Accepted Answerasked 4 years agoDoes using SPOT_CAPACITY _OPTIMIZED launch spot instances into an auto-scaling group in AWS Batch?
asked 5 days agoECS + Spot Integration - Multiple ASGs vs SpotFleet
Accepted Answerasked 4 years agoBatch and Spot Interruptions
Accepted Answerasked 4 years agoCan I know the size of spot capacity pools?
asked a month agoAWS Spot Pricing Specifics
asked a month agoSpot Fleet Instance could not be stopped
asked 22 days agoHow to do game session migration to handle spot interruption?
Accepted Answerasked 4 years ago