Back to Design Solutions for Organizational Complexity

Warm standby disaster recovery

5 minutes 5 Questions

Warm standby disaster recovery is a strategy that maintains a scaled-down but fully functional version of your production environment running continuously in a secondary AWS region. This approach strikes a balance between cost efficiency and rapid recovery time, making it ideal for organizations re…

Warm Standby Disaster Recovery

What is Warm Standby Disaster Recovery?

Warm standby is a disaster recovery (DR) strategy where a scaled-down but fully functional version of your production environment runs continuously in another AWS Region. Unlike pilot light, which only keeps critical core components running, warm standby maintains a minimal yet active deployment that can handle a reduced workload and can be rapidly scaled up during a disaster.

Why is Warm Standby Important?

Warm standby provides a balance between cost and recovery time, making it essential for organizations that:
• Require Recovery Time Objectives (RTO) measured in minutes rather than hours
• Need Recovery Point Objectives (RPO) that are near real-time
• Cannot afford extended downtime but also need to manage DR costs
• Want an environment that can be tested and validated regularly
• Need to handle some read traffic or non-critical workloads in the DR region

How Warm Standby Works

1. Active Secondary Environment: A smaller-scale version of your production environment runs continuously in a secondary Region. This includes web servers, application servers, and databases.

2. Data Replication: Continuous data replication occurs between primary and secondary regions using services like:
• Amazon RDS Multi-AZ with cross-region read replicas
• Amazon Aurora Global Database
• Amazon S3 Cross-Region Replication
• AWS Database Migration Service for ongoing replication

3. Reduced Capacity: The warm standby environment typically runs at a fraction of production capacity, perhaps 10-20% of the full size, keeping costs lower while maintaining readiness.

4. Scaling During Failover: When disaster strikes, the environment scales up using:
• Amazon EC2 Auto Scaling to increase instance counts
• Modifying instance sizes to larger types
• Promoting read replicas to primary databases
• DNS failover using Amazon Route 53

5. Traffic Routing: Route 53 health checks detect failures and route traffic to the secondary region using failover routing policies.

Key AWS Services for Warm Standby

• Amazon Route 53: DNS failover and health checking
• Elastic Load Balancing: Distributes traffic in both regions
• Amazon EC2 Auto Scaling: Scales capacity during failover
• Amazon RDS/Aurora: Database replication and failover
• AWS CloudFormation: Infrastructure as code for consistent deployments
• AWS Systems Manager: Automation of scaling operations

Warm Standby vs Other DR Strategies

Backup and Restore: Lowest cost, highest RTO (hours)
Pilot Light: Core services only, moderate RTO (tens of minutes)
Warm Standby: Scaled-down active environment, low RTO (minutes)
Multi-Site Active/Active: Full redundancy, lowest RTO (near-zero), highest cost

Exam Tips: Answering Questions on Warm Standby

1. Identify RTO/RPO Requirements: When questions mention RTO in minutes and RPO near real-time, warm standby is often the answer. If RTO must be seconds or zero, consider multi-site active/active.

2. Look for Cost Considerations: If the scenario mentions balancing cost with recovery speed, warm standby fits well. It costs more than pilot light but less than active/active.

3. Scaled-Down Keywords: Questions mentioning a smaller or reduced-capacity environment running in another region point toward warm standby.

4. Distinguish from Pilot Light: Pilot light keeps only critical core elements running. Warm standby runs a complete but smaller version of the application stack. If the question mentions functional systems handling some traffic, choose warm standby.

5. Scaling Language: Questions about environments that need to scale up during failover align with warm standby architecture.

6. Testing Requirements: If the scenario emphasizes regular DR testing with a live environment, warm standby supports this since the environment is always running.

7. Remember the Recovery Order: Route 53 health check fails, DNS failover occurs, Auto Scaling increases capacity, database replica promotes to primary.

8. Common Distractors: Do not confuse warm standby with Multi-AZ deployments, which provide high availability within a single region, not cross-region disaster recovery.

Test mode:

Exam (Timed)

Practice (With explanations)

Start practice test

Unlock Premium Access

AWS Certified Solutions Architect - Professional

Access to ALL Certifications: Study for any certification on our platform with one subscription
8734 Superior-grade AWS Certified Solutions Architect - Professional practice questions
Unlimited practice tests across all certifications
Detailed explanations for every question
SAP-C02: 5 full exams plus all other certification exams
100% Satisfaction Guaranteed: Full refund if unsatisfied
Risk-Free: 7-day free trial with all premium features!

More Warm standby disaster recovery questions

30 questions (total)

Start 30 question test