SlideShare una empresa de Scribd logo
1 de 32
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
Auto Scaling
T h e F l e e t M a n a g e m e n t S o l u t i o n f o r P l a n e t E a r t h
A n d r é D u f o u r , S e n i o r M a n a g e r , A W S
H o o k H u a , D a t a S c i e n t i s t , J e t P r o p u l s i o n L a b o r a t o r y , C a l i f o r n i a
I n s t i t u t e o f T e c h n o l o g y
C M P 2 0 1
N o v e m b e r 2 9 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
These are a few of my favorite (and least
favorite) things
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Availability with fleet management
Keeping up with demand with dynamic scaling
Future of Auto Scaling
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Myth Fact
My application does not scale,
so I don’t need Auto Scaling
It’s hard to use
My instances are stateful or unique;
I can’t use Auto Scaling
It works well
with stateful instances
You can get started in minutes
It monitors and heals instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Auto Scaling groupAuto Scaling group
Auto Scaling
Fleet management Dynamic scaling
ELB
EC2 instances
ELB
CPU
Utilization
EC2 instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Is fleet management for you?
“I’ve got instances serving a
business-impacting application”
“If my instances become unhealthy,
I’d like them replaced automatically”
“I would like my instances distributed
to maximize resilience”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Demo
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Dynamic scaling
Demo & Details: CMP311 “Auto Scaling Prime Time:
Target Tracking Hits the Bullseye at Netflix”
Thursday 12:15 p.m. - Venetian
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Scaling on a schedule
Recurring scaling
events
Schedule individual
events
Auto Scaling group
ELB
EC2 instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Reactive scaling with target tracking
1. Pick your
metric
2. Set the
target value
4. Profit
…
You do this part Auto Scaling manages your capacity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Dynamic scaling for fun and profit
Collect metrics
Auto Scaling
Scaling policy:
keep my CPU at 50%
ELB
Auto Scaling group
50%50%65%
Add capacity
50%
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
NASA JPL: Auto
Scaling Advantage
Advanced Rapid Imaging and Analysis
© 2017 California Institute of Technology. Government sponsorship acknowledged.
Reference herein to any specific commercial product, or service by trade name, trademark, manufacturer, or otherwise, does not constitute or
imply its endorsement by the United States Government or the Jet Propulsion Laboratory, California Institute of Technology.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Challenge: Automated and rapid remote
sensing for urgent disaster response
Automated data system are required to analyze large
quantities of data from NASA NISAR, other satellite missions,
and rapidly expanding GPS networks
Mean Access Time (Day)
∞ 4 2 1.3 1
Going from artisan to automation: Use system
engineering approach to translate specialized data
analysis into operational capability
Orbit Data Coherence
Library
AZO / MAI
EQ
Kinema c
Modeling
Level-0 Data
Kinema c
Model
Catalog &
Waveform
Sta c
Model
InSAR Processing
Time Series
Con guous Pairs Sta c
Modeling
Int. Phase
Stress Change
III. Rou ne Data Processing
Coherence
Change Maps
Seismometer
Radar Sensor
I. Geode c
Sensors
II. Data Providers / Archives
Global CMT
USGS NEIC
IRIS (Waveform)
Fault
Geometry
Mw>5.5 Mw>6.5
Yes
Yes
No Small EQ
Modeling
Amp / Arr. T
correc on
IV. Triggered
Legend
Ready to be used ( > 70 % readiness )
Need modifica on ( 30 – 70 % readiness)
Doesn’t exist ( < 30 % readiness) Latency / Bo leneck
Out of scope of R&TD
Customer Products
Level-0 Data
Op cal Im.
Processing
Offset Image
Event Driven
Op cal Sensor
SPOT Image (SPOT 1-5,
Formosat 2)
NSPO (SPOT)
DigitalGlobe (Quickbird,
Worldview 1)
RINEX Data
GPS Processing
Daily
Point Pos.
1 Hz (GDGPS)
Hourly
Daily
GPS Sta on
Orbit Data
Sub-daily
Point Pos.
SOPAC
+ many others
GPS Constella on
NASA - JPL
V. Customer
Products
Damage
Assessment
3D Surface
Disp. Maps
Ground
Mo on
3D Surface
Disp. Maps
EQ catalog
CTBT
correc on
surface
Coseismic
Interferogram
Coseismic GPS
vector map
Coseismic
Interferogram
Coseismic GPS
vector map
Temporal records
of ground deformation
Spatial maps of ground
deformation
Earthquake
models
Coseismic ground
deformation
Demonstrate response to hazards with standardized set of data products for decision and policy
makers
Coseismic damage
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Automated
data collection
& processing
Radar
Sensors
Building damage
and inundation
Radar
GPS Seismology
Permanent
ground deformation
High-resolution
hazard assessment
from fault models
Monitoring &
near real-time
assessment
Examples from the 2011 M9.0
Tohoku-Oki (Japan) earthquake
GPS
Networks
Seismic
Networks
Optical
Sensors
Advanced Rapid Imaging and Analysis
(ARIA)
Amatrice, Italy earthquake
(August 23, 2016)
Automated Urgent
Response Interferogram
Urgent response
processing of ESA’s
Sentinel-1A data to
interferograms were
automatically
processed—all in
AWS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hurricane Harvey Response
August 2017
Flood Proxy Map
Hurricane Maria Response
September 2017
Damage Proxy Map
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Urgent Response Analysis in AWS Cloud
M7.1 Earthquake near Puebla, Mexico (September 9, 2017)
Reactive Auto Scaling of
satellite analysis based on
increased data acquisitions
Urgent Response Analysis in AWS Cloud
SAR-based Damage Proxy Map
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Dynamic Scaling in Earth Science Data
System
The size of the science data system compute nodes can automatically
grow/shrink based on processing demand
Auto Scaling group policies
Target tracking scaling policies
Auto Scaling enabling runs
of over 100,000 vCPUs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Earth Science Data System in AWS
NASA OCO-2 L2 full physics processing operational in AWS
• Processing of L2 full physics data products in Amazon cloud across multiple regions
• Scaled up thousands of compute nodes
• Demonstrated capability of higher internal data throughput rates than NISAR needs
Number of
compute
nodes over
time
Per node
transfer rate
over time
Scalable internal
data throughput
@ 32,000 full-physics
processing on 1,000 nodes
ASG max set to 1000 instances x 32 vCPUs
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Considerations for Scaling In/Out Events
• Target tracking scaling policies
• Scaling up in batches + rest periods
Scaling up (scale out)
• What policy to set to scale down?
CPU/network utilization
• Potential stateful domain knowledge only
known within the instances
• Instance protection
Scaling down (scale in)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Auto Scaling and the Amazon EC2
Spot market
• Auto Scaling works well with Spot Instances
• Major cost savings (75%–90% savings over on-demand)…if you can use Spot Instances
• Compute instances terminated if market prices exceed your bid threshold
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fleet Management for High Resiliency
X
X
X X
X
Availability Zone a Availability Zone b Availability Zone c
Running in Spot market forces the data
system to be more resilient to failures
Compute fleet instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Auto Scaling and the “Market Maker”
This OCO-2 data production run
of 1000 x 32vCPUs affected the
market prices
Strategy:
• Mitigate impact on spot market
• Diversification of resources
• “Spot fleet”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
“Thundering Herd”
Fleet of ASG compute instances calling
same services at same time
• “API rate limit exceeded”
“Jittering” the API calls
• Introduce randomizations to API calls
• Distributes load on infrastructure
Service
Compute fleet instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Next generation NASA missions
• The volume of data
produced is larger
than previous missions
• Data storage,
processing,
movement, and costs
are the biggest
challenges
0
10
20
30
40
50
60
70
80
90
100
OCO-2 SMAP NISAR
Estimated Daily “Keep Up” Volume (TB)
(2021)(2009) (2015)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The future of Auto Scaling:
Application Auto Scaling
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Application Auto Scaling: Databases
Amazon
Aurora
Amazon
DynamoDB
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Application Auto Scaling: Compute
EMR Cluster
Task
Task
Task
ECS Service EC2 Spot Fleet
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Learn more
CMP301
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix
Thursday, November 30, 12:15 p.m. - Venetian, Level 5, Palazzo P
aws.amazon.com/autoscaling/getting-started
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!

Más contenido relacionado

La actualidad más candente

AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017Amazon Web Services
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudAmazon Web Services
 
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWS
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWSGPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWS
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWSAmazon Web Services
 
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...Amazon Web Services
 
DEV205_Developing Applications on AWS in the JVM
DEV205_Developing Applications on AWS in the JVMDEV205_Developing Applications on AWS in the JVM
DEV205_Developing Applications on AWS in the JVMAmazon Web Services
 
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...Amazon Web Services
 
Deep Dive: AWS X-Ray London Summit 2017
Deep Dive: AWS X-Ray London Summit 2017Deep Dive: AWS X-Ray London Summit 2017
Deep Dive: AWS X-Ray London Summit 2017Randall Hunt
 
Real world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSReal world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSAmazon Web Services
 
MCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionMCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionAmazon Web Services
 
DAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineDAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineAmazon Web Services
 
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at Scale
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at ScaleDEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at Scale
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at ScaleAmazon Web Services
 
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...Amazon Web Services
 
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...Amazon Web Services
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...Amazon Web Services
 
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise WorkloadsDAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise WorkloadsAmazon Web Services
 
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017Amazon Web Services
 
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)Amazon Web Services Korea
 
DAT202_Getting started with Amazon Aurora
DAT202_Getting started with Amazon AuroraDAT202_Getting started with Amazon Aurora
DAT202_Getting started with Amazon AuroraAmazon Web Services
 

La actualidad más candente (20)

AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017AWS Commercial Management and Cost Optimisation - Dec 2017
AWS Commercial Management and Cost Optimisation - Dec 2017
 
DAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the CloudDAT317_Migrating Databases and Data Warehouses to the Cloud
DAT317_Migrating Databases and Data Warehouses to the Cloud
 
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWS
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWSGPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWS
GPSWKS407-Strategies for Migrating Microsoft SQL Databases to AWS
 
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...
DAT321_How Careem Used Amazon ElastiCache for Redis to Accelerate Their Ride ...
 
DEV205_Developing Applications on AWS in the JVM
DEV205_Developing Applications on AWS in the JVMDEV205_Developing Applications on AWS in the JVM
DEV205_Developing Applications on AWS in the JVM
 
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
GPSTEC313_GPS Real-Time Data Processing with AWS Lambda Quickly, at Scale, an...
 
Deep Dive: AWS X-Ray London Summit 2017
Deep Dive: AWS X-Ray London Summit 2017Deep Dive: AWS X-Ray London Summit 2017
Deep Dive: AWS X-Ray London Summit 2017
 
Real world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSReal world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWS
 
MCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon RekognitionMCL314_Unlocking Media Workflows Using Amazon Rekognition
MCL314_Unlocking Media Workflows Using Amazon Rekognition
 
EC2 and VPC Workshop
EC2 and VPC WorkshopEC2 and VPC Workshop
EC2 and VPC Workshop
 
DAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC OnlineDAT322_The Nanoservices Architecture That Powers BBC Online
DAT322_The Nanoservices Architecture That Powers BBC Online
 
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at Scale
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at ScaleDEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at Scale
DEV333_Using Amazon CloudWatch for Amazon ECS Resource Monitoring at Scale
 
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...
Migrating Massive Databases and Data Warehouses to the Cloud - ENT327 - re:In...
 
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...
Run Your CI/CD Pipeline at Scale for a Fraction of the Cost - AWS Online Tech...
 
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
DAT339_Replicate, Analyze, and Visualize Datasets Using AWS Database Migratio...
 
Amazon Aurora 深度探討
Amazon Aurora 深度探討Amazon Aurora 深度探討
Amazon Aurora 深度探討
 
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise WorkloadsDAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
DAT332_How Verizon is Adopting Amazon Aurora PostgreSQL for Enterprise Workloads
 
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017
Running Oracle Databases on Amazon RDS - DAT313 - re:Invent 2017
 
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
클라우드 기반 데이터 분석 및 인공 지능을 위한 비지니스 혁신 - 윤석찬 (AWS 테크에반젤리스트)
 
DAT202_Getting started with Amazon Aurora
DAT202_Getting started with Amazon AuroraDAT202_Getting started with Amazon Aurora
DAT202_Getting started with Amazon Aurora
 

Similar a Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:Invent 2017

Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...
Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...
Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...Amazon Web Services
 
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...Amazon Web Services
 
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightAmazon Web Services
 
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...Amazon Web Services
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAmazon Web Services
 
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017Amazon Web Services
 
CMP207_High Performance Computing on AWS
CMP207_High Performance Computing on AWSCMP207_High Performance Computing on AWS
CMP207_High Performance Computing on AWSAmazon Web Services
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAmazon Web Services
 
ATC301-Big Data & Analytics for Manufacturing Operations
ATC301-Big Data & Analytics for Manufacturing OperationsATC301-Big Data & Analytics for Manufacturing Operations
ATC301-Big Data & Analytics for Manufacturing OperationsAmazon Web Services
 
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAmazon Web Services
 
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...Amazon Web Services
 
The Non-Relational Revolution
The Non-Relational RevolutionThe Non-Relational Revolution
The Non-Relational RevolutionMikhail Prudnikov
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...Amazon Web Services
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureAmazon Web Services
 
How to Design a Multi-Region Active-Active Architecture
How to Design a Multi-Region Active-Active ArchitectureHow to Design a Multi-Region Active-Active Architecture
How to Design a Multi-Region Active-Active ArchitectureAmazon Web Services
 
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Amazon Web Services
 
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...Amazon Web Services
 
Building Multiregion Serverless Backends
Building Multiregion Serverless BackendsBuilding Multiregion Serverless Backends
Building Multiregion Serverless BackendsAmazon Web Services
 

Similar a Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:Invent 2017 (20)

Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...
Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...
Set it and Forget it: Auto Scaling Target Tracking Policies - AWS Online Tech...
 
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
What's New for AWS Purpose Built, Non-relational Databases - DAT204 - re:Inve...
 
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSightABD206-Building Visualizations and Dashboards with Amazon QuickSight
ABD206-Building Visualizations and Dashboards with Amazon QuickSight
 
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...
The Gronk Effect: Efficiently Handling Huge Spikes in Traffic Using Predictiv...
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
 
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
AWS Database and Analytics State of the Union - 2017 - DAT201 - re:Invent 2017
 
CMP207_High Performance Computing on AWS
CMP207_High Performance Computing on AWSCMP207_High Performance Computing on AWS
CMP207_High Performance Computing on AWS
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
 
ATC301-Big Data & Analytics for Manufacturing Operations
ATC301-Big Data & Analytics for Manufacturing OperationsATC301-Big Data & Analytics for Manufacturing Operations
ATC301-Big Data & Analytics for Manufacturing Operations
 
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
 
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...
Keys to Successfully Monitoring and Optimizing Innovative and Sophisticated C...
 
The Non-Relational Revolution
The Non-Relational RevolutionThe Non-Relational Revolution
The Non-Relational Revolution
 
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
FINRA's Managed Data Lake: Next-Gen Analytics in the Cloud - ENT328 - re:Inve...
 
ARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active ArchitectureARC319_Multi-Region Active-Active Architecture
ARC319_Multi-Region Active-Active Architecture
 
How to Design a Multi-Region Active-Active Architecture
How to Design a Multi-Region Active-Active ArchitectureHow to Design a Multi-Region Active-Active Architecture
How to Design a Multi-Region Active-Active Architecture
 
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
 
STG401_This Is My Architecture
STG401_This Is My ArchitectureSTG401_This Is My Architecture
STG401_This Is My Architecture
 
Non-Relational Revolution
Non-Relational RevolutionNon-Relational Revolution
Non-Relational Revolution
 
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...
AWS re:Invent 2016: Auto Scaling – the Fleet Management Solution for Planet E...
 
Building Multiregion Serverless Backends
Building Multiregion Serverless BackendsBuilding Multiregion Serverless Backends
Building Multiregion Serverless Backends
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Auto Scaling: The Fleet Management Solution for Planet Earth - CMP201 - re:Invent 2017

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT Auto Scaling T h e F l e e t M a n a g e m e n t S o l u t i o n f o r P l a n e t E a r t h A n d r é D u f o u r , S e n i o r M a n a g e r , A W S H o o k H u a , D a t a S c i e n t i s t , J e t P r o p u l s i o n L a b o r a t o r y , C a l i f o r n i a I n s t i t u t e o f T e c h n o l o g y C M P 2 0 1 N o v e m b e r 2 9 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. These are a few of my favorite (and least favorite) things
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Availability with fleet management Keeping up with demand with dynamic scaling Future of Auto Scaling
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Myth Fact My application does not scale, so I don’t need Auto Scaling It’s hard to use My instances are stateful or unique; I can’t use Auto Scaling It works well with stateful instances You can get started in minutes It monitors and heals instances
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Auto Scaling groupAuto Scaling group Auto Scaling Fleet management Dynamic scaling ELB EC2 instances ELB CPU Utilization EC2 instances
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Is fleet management for you? “I’ve got instances serving a business-impacting application” “If my instances become unhealthy, I’d like them replaced automatically” “I would like my instances distributed to maximize resilience”
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Demo
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Dynamic scaling Demo & Details: CMP311 “Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix” Thursday 12:15 p.m. - Venetian
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Scaling on a schedule Recurring scaling events Schedule individual events Auto Scaling group ELB EC2 instances
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Reactive scaling with target tracking 1. Pick your metric 2. Set the target value 4. Profit … You do this part Auto Scaling manages your capacity
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Dynamic scaling for fun and profit Collect metrics Auto Scaling Scaling policy: keep my CPU at 50% ELB Auto Scaling group 50%50%65% Add capacity 50%
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. NASA JPL: Auto Scaling Advantage Advanced Rapid Imaging and Analysis © 2017 California Institute of Technology. Government sponsorship acknowledged. Reference herein to any specific commercial product, or service by trade name, trademark, manufacturer, or otherwise, does not constitute or imply its endorsement by the United States Government or the Jet Propulsion Laboratory, California Institute of Technology.
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Challenge: Automated and rapid remote sensing for urgent disaster response Automated data system are required to analyze large quantities of data from NASA NISAR, other satellite missions, and rapidly expanding GPS networks Mean Access Time (Day) ∞ 4 2 1.3 1 Going from artisan to automation: Use system engineering approach to translate specialized data analysis into operational capability Orbit Data Coherence Library AZO / MAI EQ Kinema c Modeling Level-0 Data Kinema c Model Catalog & Waveform Sta c Model InSAR Processing Time Series Con guous Pairs Sta c Modeling Int. Phase Stress Change III. Rou ne Data Processing Coherence Change Maps Seismometer Radar Sensor I. Geode c Sensors II. Data Providers / Archives Global CMT USGS NEIC IRIS (Waveform) Fault Geometry Mw>5.5 Mw>6.5 Yes Yes No Small EQ Modeling Amp / Arr. T correc on IV. Triggered Legend Ready to be used ( > 70 % readiness ) Need modifica on ( 30 – 70 % readiness) Doesn’t exist ( < 30 % readiness) Latency / Bo leneck Out of scope of R&TD Customer Products Level-0 Data Op cal Im. Processing Offset Image Event Driven Op cal Sensor SPOT Image (SPOT 1-5, Formosat 2) NSPO (SPOT) DigitalGlobe (Quickbird, Worldview 1) RINEX Data GPS Processing Daily Point Pos. 1 Hz (GDGPS) Hourly Daily GPS Sta on Orbit Data Sub-daily Point Pos. SOPAC + many others GPS Constella on NASA - JPL V. Customer Products Damage Assessment 3D Surface Disp. Maps Ground Mo on 3D Surface Disp. Maps EQ catalog CTBT correc on surface Coseismic Interferogram Coseismic GPS vector map Coseismic Interferogram Coseismic GPS vector map Temporal records of ground deformation Spatial maps of ground deformation Earthquake models Coseismic ground deformation Demonstrate response to hazards with standardized set of data products for decision and policy makers Coseismic damage
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Automated data collection & processing Radar Sensors Building damage and inundation Radar GPS Seismology Permanent ground deformation High-resolution hazard assessment from fault models Monitoring & near real-time assessment Examples from the 2011 M9.0 Tohoku-Oki (Japan) earthquake GPS Networks Seismic Networks Optical Sensors Advanced Rapid Imaging and Analysis (ARIA)
  • 15. Amatrice, Italy earthquake (August 23, 2016) Automated Urgent Response Interferogram Urgent response processing of ESA’s Sentinel-1A data to interferograms were automatically processed—all in AWS
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hurricane Harvey Response August 2017 Flood Proxy Map Hurricane Maria Response September 2017 Damage Proxy Map
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Urgent Response Analysis in AWS Cloud M7.1 Earthquake near Puebla, Mexico (September 9, 2017) Reactive Auto Scaling of satellite analysis based on increased data acquisitions
  • 19. Urgent Response Analysis in AWS Cloud SAR-based Damage Proxy Map
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Dynamic Scaling in Earth Science Data System The size of the science data system compute nodes can automatically grow/shrink based on processing demand Auto Scaling group policies Target tracking scaling policies Auto Scaling enabling runs of over 100,000 vCPUs
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Earth Science Data System in AWS NASA OCO-2 L2 full physics processing operational in AWS • Processing of L2 full physics data products in Amazon cloud across multiple regions • Scaled up thousands of compute nodes • Demonstrated capability of higher internal data throughput rates than NISAR needs Number of compute nodes over time Per node transfer rate over time Scalable internal data throughput @ 32,000 full-physics processing on 1,000 nodes ASG max set to 1000 instances x 32 vCPUs
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Considerations for Scaling In/Out Events • Target tracking scaling policies • Scaling up in batches + rest periods Scaling up (scale out) • What policy to set to scale down? CPU/network utilization • Potential stateful domain knowledge only known within the instances • Instance protection Scaling down (scale in)
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Auto Scaling and the Amazon EC2 Spot market • Auto Scaling works well with Spot Instances • Major cost savings (75%–90% savings over on-demand)…if you can use Spot Instances • Compute instances terminated if market prices exceed your bid threshold
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Fleet Management for High Resiliency X X X X X Availability Zone a Availability Zone b Availability Zone c Running in Spot market forces the data system to be more resilient to failures Compute fleet instances
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Auto Scaling and the “Market Maker” This OCO-2 data production run of 1000 x 32vCPUs affected the market prices Strategy: • Mitigate impact on spot market • Diversification of resources • “Spot fleet”
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. “Thundering Herd” Fleet of ASG compute instances calling same services at same time • “API rate limit exceeded” “Jittering” the API calls • Introduce randomizations to API calls • Distributes load on infrastructure Service Compute fleet instances
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Next generation NASA missions • The volume of data produced is larger than previous missions • Data storage, processing, movement, and costs are the biggest challenges 0 10 20 30 40 50 60 70 80 90 100 OCO-2 SMAP NISAR Estimated Daily “Keep Up” Volume (TB) (2021)(2009) (2015)
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. The future of Auto Scaling: Application Auto Scaling
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Application Auto Scaling: Databases Amazon Aurora Amazon DynamoDB
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Application Auto Scaling: Compute EMR Cluster Task Task Task ECS Service EC2 Spot Fleet
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Learn more CMP301 Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix Thursday, November 30, 12:15 p.m. - Venetian, Level 5, Palazzo P aws.amazon.com/autoscaling/getting-started
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you!