Resilience
Resilience focus areas include the Well-Architected Reliability Pillar, Operational Readiness Review, Disaster Recovery, and Chaos Engineering.
Recent questions
see all1 / 14
- I have a web applications in AWS and I am considering enabling Cloudfront, WAF and R53. Before I was using multivendor DNS with hidden primary as best practice. Does it still make sense when using R53...
- We are using **AWS OpenSearch Serverless** for our search workloads and have observed intermittent HTTP 4xx and 5xx errors over the past few months. While the scale of the issue is low, we are reachin...
- I have a ECS cluster running Fargate profile and ECS service associated to public ALB (2 AZs), running one task. I´m trying to simulate an AZ failure blocking all traffic in one AZ using Network ACL. ...
- IAM Identity Center controls access to its permission sets and applications from its primary Region only. Does this mean if the primary region is down, Nobody will be able to sign in? or the services...
- Hi, I have AWS environment which uses IAM Identity Centre. Users are created in Active Directory and synced across AWS and they can access AWS. I want to create emergency access account to access AWS...
- I have several EB clusters, with capacity scaling based on CPU usage. For each cluster, the idle state is just 1 instance, and it will scale-up from there based on load. But what about if an instance...
- Why does Auto-Recovery Behavior not occur even when System Checked Failed occurs despite setting the Auto-Recovery Behavior option to Default (On) on the EC2(T2.micro, Ubuntu(22.04)) instance? And, Ho...
- We have 2 AWS regions in active mode. Services in Region-1 and Region-2 have health checks registered with Route53 which is setup for 'latency based routing'. The Route53 TTL is 30 seconds. Here is t...
- In reading up on Transfer Family resiliency [here](https://docs.aws.amazon.com/transfer/latest/userguide/disaster-recovery-resiliency.html), it says "Transfer Family supports up to 3 Availability Zone...
- Hi AWS, there is a question: A company runs a website that uses a content management system (CMS) on Amazon EC2. The CMS runs on a single EC2 instance and uses an Amazon Aurora MySQL Multi-AZ DB inst...
- When building an AWS site to site VPN each tunnel of the VPN connection gives me a different outside IP address for the AWS Virtual Private Gateway, which is a good practice for redundancy reasons, as...
- If the Availability Zone has been wiped off the map and is never coming back, any guaranteed RTO for all services?
- 1. How many availability zones SQS uses for replicating the messages? 2. Replicating the messages to multiple availability zones is automatically activated or a manual process? 3. How many regions SQS...
- Let's say I want to extract entities (like address) from invoices from many companies and countries around the world. In many cases, I would want to pass Amazon Textract to Amazon Translate to Amazon ...
Recent articles
see all1 / 12
- Bhanusree VadlamudiEXPERTpublished 6 days ago1 votes83 viewsIn this post, we'll explore how organizations can overcome the common challenge of creating and validating effective disaster recovery plans. We'll introduce AWS's entitlement for ES customers, The Dr...
- AWS OFFICIALUpdated a month ago1 votes708 viewsThis article explains how to use Simulated Conditions Response and Management (SCRaM) to enhance your incident response readiness. The article includes best practices and proactive activities that you...
- Vanessa AuEXPERTpublished 3 months ago3 votes417 viewsLearn how you can use Application Recovery Controller for automated multi-Region application recovery, even across AWS accounts
- Sandhya KhanderiaEXPERTpublished 7 months ago0 votes432 viewsIn the world of big data processing, ensuring data consistency and fault tolerance is crucial. While AWS Glue provides built-in job bookmarks, sometimes we need more fine-grained control over our proc...
- Sandhya KhanderiaEXPERTpublished 7 months ago0 votes174 viewsData protection is the cornerstone of any enterprise storage solution. With Amazon FSx becoming increasingly popular for Linux workloads, implementing robust data protection strategies is crucial. In ...
- Henrique SantanaEXPERTpublished 8 months ago0 votes310 viewsThis blog post summarizes key highlights from the AWS re:Invent 2024 session "Building production-grade resilient architectures with Amazon EKS" presented by Carlos Santana and Niall Thomson from AWS....
- Sobhan ArchakamEXPERTpublished 8 months ago1 votes452 viewsThe context of the article is the use case where customers use DRS as a solution to setup Disaster Recovery. The article talks about how the time taken for a failback operation (after a failover) can ...
- Ed GummettEXPERTpublished 9 months ago2 votes360 viewsAs legal hold has no expiration date, users may wish to use this mode to apply an indefinite lock on objects they wish to protect from accidental or malicious deletion. In this scenario, it may be des...
- AWS OFFICIALUpdated 9 months ago1 votes415 viewsThis article is the second part of a series on resilience best practices and key design principles that can minimize business disruptions during outages.
- Henrique SantanaEXPERTpublished 9 months ago0 votes477 viewsThis blog post summarizes key highlights from the AWS re:Invent 2024 session "Deep dive into Amazon ECS resilience and availability" presented by Maish Saidel-Keesing and Malcolm Featonby. We'll explo...
- AWS OFFICIALUpdated 7 months ago2 votes1.2K viewsThis article is the first part of a series on resilience best practices and key design principles that can minimize business disruptions during outages.
- Vanessa AuEXPERTpublished a year ago0 votes4.3K viewsThis article lists AWS Services with multi-Region capabilities
Recent selections
see all1 / 3
- AWS OFFICIALUpdated 2 years ago0 votes57 viewsDesign your contact center for highly resilient operations at any scale with Amazon Connect.
- Jonathan_DEXPERTpublished 2 years ago4 votes11.7K viewsDo you have critical workloads running in AWS? Review these handpicked resources to find ways to ensure your applications are resilient to failures.
- AWS OFFICIALUpdated 2 years ago0 votes68 viewsPrepare and protect your applications from disruptions
1 / 18
Giovanni Lauria
EXPERTOsvaldo Marte
EXPERTAdeleke Adebowale .J.
EXPERTAWS-User-alantam
EXPERTGunasekaran, Makendran
EXPERTkranthi putti
EXPERTMina Gobrial
EXPERTJonathan_D
EXPERTAndreas Seemueller
EXPERTJisoo_K
SUPPORT ENGINEERSrikanth_N
SUPPORT ENGINEERMojgan-Toth
EXPERT
