Back to Design Solutions for Organizational Complexity

Pilot light disaster recovery

5 minutes 5 Questions

Pilot light disaster recovery is a cost-effective AWS strategy that maintains a minimal version of your production environment in a secondary region, ready to scale up when disaster strikes. The term comes from the small flame in gas heaters that can quickly ignite the full system when needed. In …

Pilot Light Disaster Recovery - Complete Guide

What is Pilot Light Disaster Recovery?

Pilot Light is a disaster recovery (DR) strategy where a minimal version of your environment is always running in a secondary AWS region. The term comes from the pilot light on a gas furnace - a small flame that is always on and can quickly ignite the full furnace when needed.

In this approach, only the most critical core elements of your system are kept running, typically including:
- Database servers with continuous replication
- Core application configurations
- Essential AMIs and launch templates

Why is Pilot Light Important?

Pilot Light occupies a strategic middle ground in the DR spectrum:

Cost Efficiency: It costs less than maintaining a fully scaled environment (Warm Standby or Multi-Site) while providing faster recovery than Backup and Restore.

Reduced RTO: Recovery Time Objective is typically measured in minutes to hours rather than hours to days, since core components are already running.

Data Protection: Continuous database replication means minimal data loss (low RPO - Recovery Point Objective).

Business Continuity: Organizations can meet compliance requirements and SLAs for critical workloads.

How Pilot Light Works

Normal Operations:
1. Primary region handles all production traffic
2. Database replication occurs continuously to the DR region (using services like RDS Read Replicas, Aurora Global Database, or database-native replication)
3. AMIs and configurations are kept synchronized
4. Minimal compute resources run in DR region (or none at all for true pilot light)

During Failover:
1. Detect the disaster or trigger manual failover
2. Promote the replicated database to become the primary
3. Scale up compute resources (launch EC2 instances, scale Auto Scaling groups)
4. Update DNS records (Route 53) to point to the DR region
5. Verify application functionality

Key AWS Services for Pilot Light

- Amazon RDS: Cross-region read replicas for database replication
- Amazon Aurora Global Database: Sub-second replication across regions
- Amazon S3: Cross-region replication for static assets
- AWS CloudFormation/Terraform: Infrastructure as Code for rapid provisioning
- Amazon Route 53: DNS failover and health checks
- AWS Auto Scaling: Rapid scaling of compute resources
- AWS Systems Manager: Automation for failover procedures

Pilot Light vs Other DR Strategies

Backup and Restore: Lower cost but higher RTO (hours to days). No running resources in DR region.

Pilot Light: Core systems running, moderate cost, RTO in minutes to hours.

Warm Standby: Scaled-down but fully functional environment running. Higher cost, lower RTO.

Multi-Site Active-Active: Full production capacity in multiple regions. Highest cost, near-zero RTO.

Exam Tips: Answering Questions on Pilot Light Disaster Recovery

Identify Pilot Light Scenarios:
- Questions mentioning 'minimal running resources' with 'database replication'
- Requirements for RTO of 10 minutes to a few hours
- Cost-conscious organizations needing faster recovery than backup/restore
- Scenarios requiring 'core elements' to be maintained

Key Differentiators to Remember:
- Pilot Light = databases replicated + minimal/no compute running
- Warm Standby = scaled-down but complete environment running
- If the question mentions 'always running at reduced capacity' it is likely Warm Standby, not Pilot Light

Common Exam Patterns:
- Watch for RTO/RPO requirements - Pilot Light offers low RPO due to replication but moderate RTO due to scaling time
- Cost optimization questions where Backup/Restore is too slow
- Questions about promoting read replicas during failover

Red Flags in Answer Choices:
- If an answer suggests no database replication, it is Backup and Restore
- If an answer mentions full capacity running in both regions, it is Multi-Site
- If compute resources are described as 'scaled down but operational,' consider Warm Standby

Remember the Analogy: Like a pilot light on a furnace - always burning minimally, ready to ignite the full system when needed. The 'flame' is your replicated database; the 'furnace' is your full application stack.

Test mode:

Exam (Timed)

Practice (With explanations)

Start practice test

Unlock Premium Access

AWS Certified Solutions Architect - Professional

Access to ALL Certifications: Study for any certification on our platform with one subscription
8734 Superior-grade AWS Certified Solutions Architect - Professional practice questions
Unlimited practice tests across all certifications
Detailed explanations for every question
SAP-C02: 5 full exams plus all other certification exams
100% Satisfaction Guaranteed: Full refund if unsatisfied
Risk-Free: 7-day free trial with all premium features!

More Pilot light disaster recovery questions

29 questions (total)

Start 29 question test