- Neueste
- Die meisten Stimmen
- Die meisten Kommentare
This service will need to be available at all points, but is unlikely to be called more than a few times a week.
I take it this means that the application needs to be available at all times (needs at least one instance running) and able to scale on the times that it is called within the week.
For this, Capacity Providers with a mix of Spot and on-demand instances would be ideal for your infrastructure:
https://aws.amazon.com/blogs/containers/optimize-cost-for-container-workloads-with-ecs-capacity-providers-and-ec2-spot-instances/
A capacity provider is linked to an Auto Scaling group, hence the Auto Scaling group can be configured to only utilize instances which contain GPUs.
If the application needs to scale (i.e. you would need more than one ECS task when the application is called), you can look at implementing Service Auto Scaling:
https://docs.aws.amazon.com/AmazonECS/latest/developerguide/service-auto-scaling.html
This would allow your ECS service to scale as needed (i.e. launch and stop tasks based on demand). The Capacity Provider would launch new instances (if required) as the services scales up, and terminate any instances not in use as the service scales down.
Relevanter Inhalt
- AWS OFFICIALAktualisiert vor 2 Jahren
- AWS OFFICIALAktualisiert vor 2 Jahren
- AWS OFFICIALAktualisiert vor 2 Jahren