Back to Reliability and Business Continuity

Amazon EC2 Auto Scaling

5 minutes 5 Questions

Amazon EC2 Auto Scaling is a powerful AWS service that automatically adjusts the number of EC2 instances in your application fleet based on demand, ensuring optimal performance and cost efficiency while maintaining high availability. Key Components: 1. **Launch Templates/Configurations**: Define …

Amazon EC2 Auto Scaling: Complete Guide for AWS SysOps Administrator Associate Exam

Why Amazon EC2 Auto Scaling is Important

Amazon EC2 Auto Scaling is a critical service for maintaining application availability and optimizing costs. It ensures that you have the right number of EC2 instances available to handle the load for your application. For the AWS SysOps Administrator Associate exam, this topic is essential because it directly relates to reliability, business continuity, and operational excellence.

Key benefits include:
• High Availability - Automatically replaces unhealthy instances
• Cost Optimization - Scale down during low demand periods
• Fault Tolerance - Distribute instances across multiple Availability Zones
• Elasticity - Respond to changing demand patterns

What is Amazon EC2 Auto Scaling?

Amazon EC2 Auto Scaling is a service that automatically adjusts the number of EC2 instances in your fleet based on conditions you define. It consists of several key components:

1. Auto Scaling Groups (ASG)
A logical grouping of EC2 instances that share similar characteristics and are treated as a single unit for scaling and management purposes. You define minimum, maximum, and desired capacity settings.

2. Launch Templates or Launch Configurations
Launch Templates (recommended) or Launch Configurations define the instance configuration including AMI ID, instance type, key pair, security groups, and block device mappings.

3. Scaling Policies
Rules that determine when and how to scale:
• Target Tracking Scaling - Maintains a specific metric at a target value (e.g., CPU at 50%)
• Step Scaling - Scales based on a set of scaling adjustments based on alarm breach size
• Simple Scaling - Adds or removes instances based on a single scaling adjustment
• Scheduled Scaling - Scales based on predictable load patterns at specific times
• Predictive Scaling - Uses machine learning to forecast load and schedule scaling actions

How Amazon EC2 Auto Scaling Works

Scaling Process:
1. CloudWatch monitors metrics (CPU, network, custom metrics)
2. When thresholds are breached, alarms trigger
3. Auto Scaling evaluates scaling policies
4. Instances are launched or terminated based on the policy
5. New instances register with load balancers (if configured)

Health Checks:
• EC2 Health Checks - Default check based on EC2 instance status
• ELB Health Checks - Checks if instance is healthy according to the load balancer
• Custom Health Checks - Using the SetInstanceHealth API

Cooldown Periods:
Prevent Auto Scaling from launching or terminating additional instances before previous scaling activities take effect. Default is 300 seconds.

Instance Warm-up:
Time for a newly launched instance to warm up before contributing to aggregated metrics.

Termination Policies:
Determine which instances to terminate first during scale-in:
• Default - Balances across AZs, then oldest launch configuration
• OldestInstance
• NewestInstance
• OldestLaunchConfiguration
• OldestLaunchTemplate
• ClosestToNextInstanceHour
• AllocationStrategy

Lifecycle Hooks:
Allow you to perform custom actions when instances launch or terminate. Instances enter a wait state where you can run scripts, install software, or pull logs before completion.

Instance Refresh:
Updates instances in an ASG to match a new launch template version. You can configure minimum healthy percentage and warm-up time.

Exam Tips: Answering Questions on Amazon EC2 Auto Scaling

Common Exam Scenarios:

Scenario 1: Application not scaling as expected
• Check CloudWatch alarms and metrics
• Verify scaling policy configurations
• Review cooldown periods (may be too long)
• Check if maximum capacity has been reached

Scenario 2: Instances launching but failing health checks
• Extend health check grace period
• Verify security group rules allow health check traffic
• Check application startup time
• Review user data scripts for errors

Scenario 3: Cost optimization requirements
• Use Target Tracking with appropriate metrics
• Implement Scheduled Scaling for predictable patterns
• Consider Predictive Scaling for variable workloads
• Mix On-Demand and Spot instances

Scenario 4: High availability requirements
• Distribute instances across multiple AZs
• Enable ELB health checks
• Set appropriate minimum capacity
• Use multiple instance types for Spot capacity

Key Points to Remember:
• Launch Templates are preferred over Launch Configurations (more features, versioning)
• Target Tracking is the simplest and most common scaling policy
• Cooldown periods apply to Simple Scaling but not Step Scaling or Target Tracking
• Lifecycle hooks have a default timeout of 3600 seconds (1 hour)
• Auto Scaling can span multiple AZs but not multiple regions
• Suspended processes can prevent scaling actions
• Instance protection prevents specific instances from being terminated during scale-in

Metrics to Know:
• ASGAverageCPUUtilization
• ASGAverageNetworkIn/Out
• ALBRequestCountPerTarget
• Custom metrics via CloudWatch

Troubleshooting Tips for Exam:
• If instances keep launching and terminating, check health check configuration
• If scaling is slow, reduce cooldown periods or use Step Scaling
• If costs are high during low traffic, verify scale-in policies are configured
• If capacity is unbalanced across AZs, check for AZ rebalancing settings
• Review Activity History in the console for scaling event details

Integration Points:
• Elastic Load Balancing - Automatic registration and deregistration
• Amazon SNS - Notifications for scaling events
• AWS CloudFormation - Infrastructure as code deployment
• AWS Systems Manager - Patch management and automation
• Amazon EventBridge - Event-driven scaling actions

Test mode:

Exam (Timed)

Practice (With explanations)

Start practice test

Unlock Premium Access

AWS Certified SysOps Administrator - Associate

Access to ALL Certifications: Study for any certification on our platform with one subscription
4584 Superior-grade AWS Certified SysOps Administrator - Associate practice questions
Unlimited practice tests across all certifications
Detailed explanations for every question
SOA-C02: 5 full exams plus all other certification exams
100% Satisfaction Guaranteed: Full refund if unsatisfied
Risk-Free: 7-day free trial with all premium features!