SlideShare una empresa de Scribd logo
1 de 49
Descargar para leer sin conexión
CCA - NoDerivs 3.0 Unported License - Usage OK, no modifications, full attribution*!
* All unlicensed or borrowed works retain their original licenses
Pets vs. Cattle:!
The Elastic Cloud Story
!
DevOps Chicago Meetup!
February 26, 2014
@randybias
A Tale of Two Clouds
2
Enterprise Computing Approach
3
GUI Driven!
Ticket-Based!
Hand-Crafted!
Reserved !
Scale-up!
Smart Hardware!
Proprietary!
Traditional Dev!
…
Cloud Computing Approach
4
API Driven!
Self-Service!
Automated!
On-demand!
Scale-out!
Smart Apps!
Open Source!
Agile DevOps!
…
Elastic Cloud Shifts Uptime Responsibility
5
Enterprise Model Cloud Model
99.9%!
Applications!
(8h46m down)
99.999%!
Infrastructure!
($$$$)
99.999%
Applications!
(5m down)
99%
Infrastructure!
($$)
Elastic Cloud Origins
6
Elastic !
Private Cloud
Enterprise Virtualization!
Private Cloud
Elastic &
Virtualization
2.0 Clouds are 

very different.!
!
Different
workloads.!
!
Different !
architectures.!
!
Different !
skills.!
!
Different
economics.
≠
Virtual Infrastructure


Standardization, Automation,!
Chargeback, Self-Service!
Designed for Server Consolidation !
IT Admins manage Infrastructure!
Ticket-based manual provisioning!
Improves virtualization value
=
+
Elastic Public Cloud


On-premise 

Deployment!
Designed for Agility!
Cloud Admins manage Services!
Self-service automated provisioning!
Delivers cloud value on-premise
=
+
What Companies Care About?
7
Cloud
Computing!
Agile
Development!
Business !
Agility!
Operational
Discipline!
ACCELERATING!
TIME TO VALUE!
Continuous
Integration
Continuous
Testing &
Delivery
Agile
Methodologies
IaaS / PaaS
!
!
Public / Private /
Hybrid
!
!
Big Data /
Analytics
!
!
Public APIs
Continuous
Deployment
DevOps Data Center & 

App Automation
Line of
Business
Enablement
New App
Initiatives
(Mobile, SaaS,
etc.)
Data Center
Modernization
Elastic Cloud is a Mindset Change
8
Attribution: Bill Baker, Distinguished Engineer, Microsoft
bowzer.company.com!
(scale-up)
web001.company.com!
(scale-out)
(Virtual) Servers *are* cattle
Pets vs. Cattle Takes Off
9
Microsoft
Cloudscaling
CERN
IBM
ScalrRackspace
Red Hat
Scale-out, not UP in Cloud
(Some) Elastic Cloud Patterns
!
!
!
What follows are *some* Elastic Cloud Patterns!
There are many more, but these are mine!
Input, ideas, & other thoughts welcome via twitter / email
10
Big Failure Domains !
Make Big Craters
11
Big Failure Domains !
Make Big Craters
12
Anti-Pattern
Anti-Pattern
Smaller Failure Domains
13
Would you rather have the whole cloud down !
or just a small bit of it for a short time?
vs
Loose Coupling
14
Synchronous, blocking
calls mean cascading
failures.
Async, non-block calls
mean failure in
isolation.
Open Source Software
15
Excessive software
taxation is the past.
Black boxes
create lock-in.
You can !
always fork.
Uptime in Software Self-management
16
Hardware fails.!
Software fails.!
People fail.
Only software can
measure itself &
respond to failure in
near real-time.
Applications designed
for 99.999% uptime can
run anywhere
Scale Out vs Scale up
17
Vertical Scaling 

Make boxes bigger 

(usually an HA pair)
Horizontal Scaling

Make more boxes

A
A
➔
➔
B
B ...A B C N
Circuit Breaker Pattern
18
Fallback mechanisms (e.g.
cached data) 

ensure uninterrupted service
while giving service time to
recover
When failing service
detected, stop calling that
API and serve fallback
responses
Buy from ODMs
19
ODMs operate their
businesses on 3-10%
margins.
AMZN, GOOG, and
Facebook buy direct
without a middleman.
Only a few enterprise
vendors are pivoting to
compete.
Less Enterprise “Value” in x86 Servers
20
Generic servers rule. Full
stop. Nothing is better
because nothing else is
*generic*.
“... a data center full of vanity free servers
... more efficient ... less expensive to build
and run ... “ - OCP
Fully Routed (L3) Networking
21
The largest cloud operators
all run layer-3 routed,
networks with no VLANs.
Cloud-ready apps don’t
need or want VLANs.
Enterprise apps can be
supported on elastic clouds
using Software-defined
Networking (SDN)
Software-defined Networking (SDN)
22
• x86 server is the new Linecard"
• network switch is the new ASIC"
• VXLAN (or NVGRE) is the new Chassis"
• SDN Controller is the new SUP Engine
“Network Virtualization”
Flat Networking + SDNs
23
Flat + SDN co-exist
& thrive together
Standard
Security
Group
1 2
Availability
Zone
VM VM
VM
VM
VM
VM
Virtual L2 Network
VM
VMVM
Virtual Private
Cloud
Networking
VPC
Security
Group
Internet
VPC
Gateway
Physical
Node
RAIS instead of HA Pairs/Clusters
Redundant arrays of inexpensive services (RAIS)!
Load balanced with no state sharing!
Active … active … active … active … !
On failure, connections are lost, but failures are rare!
Rolling upgrades are easier, because each server is an island!
Think: scale-out + fault isolation (sharding)!
Ridiculously simple & scalable!
Hardware failures are infrequent & impact subset of traffic!
(N-F)/N, where N = total, F = failed!
10 RAIS servers - 1 failure == 90% capacity!
Most things retry anyway!
Cascade failures are unlikely and failure domains are small
24
Service Array (RAIS) Example
25
Backbone Routers
Cloud Access Switches
AZ (Spine) Switches
RAIS (NAT, LB, VPN)
OSPF Route Announcements
Return Traffic (default or source
NAT)
API
Public IP
Blocks
Cloud
Control
Plane
Lots of Inexpensive 1RU Switches
26
1RU: 6K-30K VMs / AZ
Simple spine-and-leaf flat routed network
Rack 1 Rack 2 Rack 3
Modular: 40K-200K VMs / AZ
Rack 1
Rack 2
Multiple
Racks
Rack 1
Rack 2
Multiple
Racks
Rack 1
Rack 2
Multiple
Racks
Direct-attached Storage (DAS)
27
Cloud-ready apps
manage their own
data replication.
DAS is the smallest failure
domain possible with
reasonable storage I/O.
SAN == massive failure
domain.
SSDs will be the great
equalizer.
Elastic Block Device Services
28
EBS/EBD is a crutch
Bigger failure domains 

(AWS outage anyone?), complex,
sets high expectations
Sometimes you need a crutch.
When you do, overbuild the network, and make sure
you have a smart scheduler.
AWS EBS Outage!
http://aws.amazon.com/message/65648/
More Servers == More Storage I/O
29
>1M writes/second, triple-
redundancy w/ Cassandra on AWS
Linear scale-out == linear costs for performance
Hypervisors are a Commodity
30
Cloud end-users want OS
of choice, not HVs.
Level up! Managing iron is for
mainframe operators."
… hypervisors are bare metal APIs
Hypervisor of the future is open
source, easily modifiable, &
extensible.
The Hypervisor of the Future 

May Be NO Hypervisor
31
LXC
ironic
Bare Metal Cloud
Quiz Time
32
Quiz Time
33
Pets Cattle
NIC bonding?
Quiz Time
34
Pets Cattle
NIC bonding ➔
Quiz Time
35
Pets Cattle
NIC bonding
Managing a Server 

at a Time?
Quiz Time
36
Pets Cattle
NIC bonding
Managing a Server

at a Time ➔
Quiz Time
37
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling?
Quiz Time
38
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling➔
Quiz Time
39
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure?
Quiz Time
40
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure➔
Quiz Time
41
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals?
Quiz Time
42
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals ➔
Quiz Time
43
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy?
Quiz Time
44
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy ➔
Quiz Time
45
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy
Shared Nothing
Architecture?
Quiz Time
46
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy
Shared Nothing
Architecture➔
Quiz Time
47
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy
Shared Nothing
Architecture
Persistent Block Storage?
Quiz Time
48
Pets Cattle
NIC bonding
Managing Server at a
Time
Auto-scaling
Design-for-Failure
100% Uptime Goals
HA pairs for redundancy
Shared Nothing
Architecture
Persistent Block Storage
➔
Q & A
49
Randy Bias!
Founder & CEO, Cloudscaling!
Director, OpenStack Foundation!
@randybias

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Chaos Engineering
Chaos EngineeringChaos Engineering
Chaos Engineering
 
Microsoft Azure
Microsoft AzureMicrosoft Azure
Microsoft Azure
 
A Roadmap to Cloud Center of Excellence Adoption
A Roadmap to Cloud Center of Excellence AdoptionA Roadmap to Cloud Center of Excellence Adoption
A Roadmap to Cloud Center of Excellence Adoption
 
DevOps Best Practices
DevOps Best PracticesDevOps Best Practices
DevOps Best Practices
 
[WhaTap DevOps Day] 세션 6 : 와탭랩스 DevOps 이야기
[WhaTap DevOps Day] 세션 6 : 와탭랩스 DevOps 이야기[WhaTap DevOps Day] 세션 6 : 와탭랩스 DevOps 이야기
[WhaTap DevOps Day] 세션 6 : 와탭랩스 DevOps 이야기
 
Chaos Engineering: Why the World Needs More Resilient Systems
Chaos Engineering: Why the World Needs More Resilient SystemsChaos Engineering: Why the World Needs More Resilient Systems
Chaos Engineering: Why the World Needs More Resilient Systems
 
DevSecOps, An Organizational Primer - AWS Security Week at the SF Loft
DevSecOps, An Organizational Primer - AWS Security Week at the SF LoftDevSecOps, An Organizational Primer - AWS Security Week at the SF Loft
DevSecOps, An Organizational Primer - AWS Security Week at the SF Loft
 
20170919 AWS Black Belt Online Seminar AWS Database Migration Service
20170919 AWS Black Belt Online Seminar AWS Database Migration Service20170919 AWS Black Belt Online Seminar AWS Database Migration Service
20170919 AWS Black Belt Online Seminar AWS Database Migration Service
 
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
 
Serverless時代のJavaについて
Serverless時代のJavaについてServerless時代のJavaについて
Serverless時代のJavaについて
 
Microsoft azure overview
Microsoft azure overviewMicrosoft azure overview
Microsoft azure overview
 
Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo
Microservices Architectures: Become a Unicorn like Netflix, Twitter and HailoMicroservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo
Microservices Architectures: Become a Unicorn like Netflix, Twitter and Hailo
 
[最新バージョンの情報がDescription欄にございます]AWS Black Belt Online Seminar 2018 Amazon Connect
[最新バージョンの情報がDescription欄にございます]AWS Black Belt Online Seminar 2018 Amazon Connect[最新バージョンの情報がDescription欄にございます]AWS Black Belt Online Seminar 2018 Amazon Connect
[最新バージョンの情報がDescription欄にございます]AWS Black Belt Online Seminar 2018 Amazon Connect
 
Databases on AWS Workshop.pdf
Databases on AWS Workshop.pdfDatabases on AWS Workshop.pdf
Databases on AWS Workshop.pdf
 
Nutanix
NutanixNutanix
Nutanix
 
Pivotal Cloud Foundry: A Technical Overview
Pivotal Cloud Foundry: A Technical OverviewPivotal Cloud Foundry: A Technical Overview
Pivotal Cloud Foundry: A Technical Overview
 
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
Operational Excellence with Containerized Workloads Using AWS Fargate (CON320...
 
Cost Optimisation on AWS
Cost Optimisation on AWSCost Optimisation on AWS
Cost Optimisation on AWS
 
Chaos engineering
Chaos engineering Chaos engineering
Chaos engineering
 
微服務基礎建設 - Message Queue
微服務基礎建設 - Message Queue微服務基礎建設 - Message Queue
微服務基礎建設 - Message Queue
 

Similar a Pets vs. Cattle: The Elastic Cloud Story

Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Kai Wähner
 

Similar a Pets vs. Cattle: The Elastic Cloud Story (20)

[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
[Capitole du Libre] #serverless -  mettez-le en oeuvre dans votre entreprise...
 
An introduction to Serverless
An introduction to ServerlessAn introduction to Serverless
An introduction to Serverless
 
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
AWS re:Invent 2016: Serverless IoT Back Ends (IOT401)
 
Internet Scale Architecture
Internet Scale ArchitectureInternet Scale Architecture
Internet Scale Architecture
 
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
Apache Kafka vs. Traditional Middleware (Kai Waehner, Confluent) Frankfurt 20...
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB) - Friends, Enemies or ...
 
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your CloudCloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
Cloud Expo Silicon Valley 2013 | Why Lease When You Can Buy Your Cloud
 
Serverless solutions - AWS Summit SG 2017
Serverless solutions - AWS Summit SG 2017 Serverless solutions - AWS Summit SG 2017
Serverless solutions - AWS Summit SG 2017
 
Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why
Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and WhyDifferent I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why
Different I/O Access Methods for Linux, What We Chose for ScyllaDB, and Why
 
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
Battery Ventures: Simulating and Visualizing Large Scale Cassandra DeploymentsBattery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
 
Devoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en basDevoxx university - Kafka de haut en bas
Devoxx university - Kafka de haut en bas
 
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB PipelineLearning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
 
ServerlessConf Tokyo キーノート
ServerlessConf Tokyo キーノートServerlessConf Tokyo キーノート
ServerlessConf Tokyo キーノート
 
The Best of Both Worlds: Implementing Hybrid IT with AWS
The Best of Both Worlds: Implementing Hybrid IT with AWSThe Best of Both Worlds: Implementing Hybrid IT with AWS
The Best of Both Worlds: Implementing Hybrid IT with AWS
 
AWS Serverless Community Day Keynote and Vendia Launch 6-26-2020
AWS Serverless Community Day Keynote and Vendia Launch 6-26-2020AWS Serverless Community Day Keynote and Vendia Launch 6-26-2020
AWS Serverless Community Day Keynote and Vendia Launch 6-26-2020
 
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
Architecture patterns for distributed, hybrid, edge and global Apache Kafka d...
 
AWS Serverless patterns & best-practices in AWS
AWS Serverless  patterns & best-practices in AWSAWS Serverless  patterns & best-practices in AWS
AWS Serverless patterns & best-practices in AWS
 
Running serverless at scale
Running serverless at scaleRunning serverless at scale
Running serverless at scale
 
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
Apache Kafka vs. Integration Middleware (MQ, ETL, ESB)
 
Getting Started with AWS Lambda and the Serverless Cloud
Getting Started with AWS Lambda and the Serverless CloudGetting Started with AWS Lambda and the Serverless Cloud
Getting Started with AWS Lambda and the Serverless Cloud
 

Más de Randy Bias

Más de Randy Bias (20)

Services are the New Cloud Platform (Services-as-a-Platform)
Services are the New Cloud Platform (Services-as-a-Platform)Services are the New Cloud Platform (Services-as-a-Platform)
Services are the New Cloud Platform (Services-as-a-Platform)
 
Rebooting the OpenContrail Community
Rebooting the OpenContrail CommunityRebooting the OpenContrail Community
Rebooting the OpenContrail Community
 
State of the Stack v4 - OpenStack in All It's Glory
State of the Stack v4 - OpenStack in All It's GloryState of the Stack v4 - OpenStack in All It's Glory
State of the Stack v4 - OpenStack in All It's Glory
 
Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
Connect Expo 2015 - Australia - Bringing OpenStack into the EnterpriseConnect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
Connect Expo 2015 - Australia - Bringing OpenStack into the Enterprise
 
The Cloud Revolution - Philippines Cloud Summit
The Cloud Revolution - Philippines Cloud SummitThe Cloud Revolution - Philippines Cloud Summit
The Cloud Revolution - Philippines Cloud Summit
 
The Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy
The Lie of a Benevolent Dictator; the Truth of a Working Democratic MeritocracyThe Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy
The Lie of a Benevolent Dictator; the Truth of a Working Democratic Meritocracy
 
OpenStack Architected Like AWS (and GCP)
OpenStack Architected Like AWS (and GCP)OpenStack Architected Like AWS (and GCP)
OpenStack Architected Like AWS (and GCP)
 
OpenStack Scale-out Networking Architecture
OpenStack Scale-out Networking ArchitectureOpenStack Scale-out Networking Architecture
OpenStack Scale-out Networking Architecture
 
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013
SFBay OpenStack Meetup // Neutron and SDN in Production – Dec 3 2013
 
AWS Repatriation: Bring Your Apps Back
AWS Repatriation: Bring Your Apps BackAWS Repatriation: Bring Your Apps Back
AWS Repatriation: Bring Your Apps Back
 
State of the Stack v2
State of the Stack v2State of the Stack v2
State of the Stack v2
 
Networking is NOT Free: Lessons in Network Design
Networking is NOT Free: Lessons in Network DesignNetworking is NOT Free: Lessons in Network Design
Networking is NOT Free: Lessons in Network Design
 
Scale-Out Block Storage
Scale-Out Block StorageScale-Out Block Storage
Scale-Out Block Storage
 
State of the Stack April 2013
State of the Stack April 2013State of the Stack April 2013
State of the Stack April 2013
 
Open Cloud System Networking Vision
Open Cloud System Networking VisionOpen Cloud System Networking Vision
Open Cloud System Networking Vision
 
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"
OpenStack Summit :: Redundancy Doesn't Always Mean "HA" or "Cluster"
 
OpenStack Summit :: Profiling the Nova Scheduler
OpenStack Summit :: Profiling the Nova SchedulerOpenStack Summit :: Profiling the Nova Scheduler
OpenStack Summit :: Profiling the Nova Scheduler
 
OpenStack Summit :: Pimp My Cloud
OpenStack Summit :: Pimp My CloudOpenStack Summit :: Pimp My Cloud
OpenStack Summit :: Pimp My Cloud
 
2012 open storage summit keynote
2012 open storage summit   keynote2012 open storage summit   keynote
2012 open storage summit keynote
 
Distributed RPC in Nova with ZeroMQ
Distributed RPC in Nova with ZeroMQDistributed RPC in Nova with ZeroMQ
Distributed RPC in Nova with ZeroMQ
 

Último

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Último (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

Pets vs. Cattle: The Elastic Cloud Story

  • 1. CCA - NoDerivs 3.0 Unported License - Usage OK, no modifications, full attribution*! * All unlicensed or borrowed works retain their original licenses Pets vs. Cattle:! The Elastic Cloud Story ! DevOps Chicago Meetup! February 26, 2014 @randybias
  • 2. A Tale of Two Clouds 2
  • 3. Enterprise Computing Approach 3 GUI Driven! Ticket-Based! Hand-Crafted! Reserved ! Scale-up! Smart Hardware! Proprietary! Traditional Dev! …
  • 4. Cloud Computing Approach 4 API Driven! Self-Service! Automated! On-demand! Scale-out! Smart Apps! Open Source! Agile DevOps! …
  • 5. Elastic Cloud Shifts Uptime Responsibility 5 Enterprise Model Cloud Model 99.9%! Applications! (8h46m down) 99.999%! Infrastructure! ($$$$) 99.999% Applications! (5m down) 99% Infrastructure! ($$)
  • 6. Elastic Cloud Origins 6 Elastic ! Private Cloud Enterprise Virtualization! Private Cloud Elastic & Virtualization 2.0 Clouds are 
 very different.! ! Different workloads.! ! Different ! architectures.! ! Different ! skills.! ! Different economics. ≠ Virtual Infrastructure 
 Standardization, Automation,! Chargeback, Self-Service! Designed for Server Consolidation ! IT Admins manage Infrastructure! Ticket-based manual provisioning! Improves virtualization value = + Elastic Public Cloud 
 On-premise 
 Deployment! Designed for Agility! Cloud Admins manage Services! Self-service automated provisioning! Delivers cloud value on-premise = +
  • 7. What Companies Care About? 7 Cloud Computing! Agile Development! Business ! Agility! Operational Discipline! ACCELERATING! TIME TO VALUE! Continuous Integration Continuous Testing & Delivery Agile Methodologies IaaS / PaaS ! ! Public / Private / Hybrid ! ! Big Data / Analytics ! ! Public APIs Continuous Deployment DevOps Data Center & 
 App Automation Line of Business Enablement New App Initiatives (Mobile, SaaS, etc.) Data Center Modernization
  • 8. Elastic Cloud is a Mindset Change 8 Attribution: Bill Baker, Distinguished Engineer, Microsoft bowzer.company.com! (scale-up) web001.company.com! (scale-out) (Virtual) Servers *are* cattle
  • 9. Pets vs. Cattle Takes Off 9 Microsoft Cloudscaling CERN IBM ScalrRackspace Red Hat Scale-out, not UP in Cloud
  • 10. (Some) Elastic Cloud Patterns ! ! ! What follows are *some* Elastic Cloud Patterns! There are many more, but these are mine! Input, ideas, & other thoughts welcome via twitter / email 10
  • 11. Big Failure Domains ! Make Big Craters 11
  • 12. Big Failure Domains ! Make Big Craters 12 Anti-Pattern Anti-Pattern
  • 13. Smaller Failure Domains 13 Would you rather have the whole cloud down ! or just a small bit of it for a short time? vs
  • 14. Loose Coupling 14 Synchronous, blocking calls mean cascading failures. Async, non-block calls mean failure in isolation.
  • 15. Open Source Software 15 Excessive software taxation is the past. Black boxes create lock-in. You can ! always fork.
  • 16. Uptime in Software Self-management 16 Hardware fails.! Software fails.! People fail. Only software can measure itself & respond to failure in near real-time. Applications designed for 99.999% uptime can run anywhere
  • 17. Scale Out vs Scale up 17 Vertical Scaling 
 Make boxes bigger 
 (usually an HA pair) Horizontal Scaling
 Make more boxes
 A A ➔ ➔ B B ...A B C N
  • 18. Circuit Breaker Pattern 18 Fallback mechanisms (e.g. cached data) 
 ensure uninterrupted service while giving service time to recover When failing service detected, stop calling that API and serve fallback responses
  • 19. Buy from ODMs 19 ODMs operate their businesses on 3-10% margins. AMZN, GOOG, and Facebook buy direct without a middleman. Only a few enterprise vendors are pivoting to compete.
  • 20. Less Enterprise “Value” in x86 Servers 20 Generic servers rule. Full stop. Nothing is better because nothing else is *generic*. “... a data center full of vanity free servers ... more efficient ... less expensive to build and run ... “ - OCP
  • 21. Fully Routed (L3) Networking 21 The largest cloud operators all run layer-3 routed, networks with no VLANs. Cloud-ready apps don’t need or want VLANs. Enterprise apps can be supported on elastic clouds using Software-defined Networking (SDN)
  • 22. Software-defined Networking (SDN) 22 • x86 server is the new Linecard" • network switch is the new ASIC" • VXLAN (or NVGRE) is the new Chassis" • SDN Controller is the new SUP Engine “Network Virtualization”
  • 23. Flat Networking + SDNs 23 Flat + SDN co-exist & thrive together Standard Security Group 1 2 Availability Zone VM VM VM VM VM VM Virtual L2 Network VM VMVM Virtual Private Cloud Networking VPC Security Group Internet VPC Gateway Physical Node
  • 24. RAIS instead of HA Pairs/Clusters Redundant arrays of inexpensive services (RAIS)! Load balanced with no state sharing! Active … active … active … active … ! On failure, connections are lost, but failures are rare! Rolling upgrades are easier, because each server is an island! Think: scale-out + fault isolation (sharding)! Ridiculously simple & scalable! Hardware failures are infrequent & impact subset of traffic! (N-F)/N, where N = total, F = failed! 10 RAIS servers - 1 failure == 90% capacity! Most things retry anyway! Cascade failures are unlikely and failure domains are small 24
  • 25. Service Array (RAIS) Example 25 Backbone Routers Cloud Access Switches AZ (Spine) Switches RAIS (NAT, LB, VPN) OSPF Route Announcements Return Traffic (default or source NAT) API Public IP Blocks Cloud Control Plane
  • 26. Lots of Inexpensive 1RU Switches 26 1RU: 6K-30K VMs / AZ Simple spine-and-leaf flat routed network Rack 1 Rack 2 Rack 3 Modular: 40K-200K VMs / AZ Rack 1 Rack 2 Multiple Racks Rack 1 Rack 2 Multiple Racks Rack 1 Rack 2 Multiple Racks
  • 27. Direct-attached Storage (DAS) 27 Cloud-ready apps manage their own data replication. DAS is the smallest failure domain possible with reasonable storage I/O. SAN == massive failure domain. SSDs will be the great equalizer.
  • 28. Elastic Block Device Services 28 EBS/EBD is a crutch Bigger failure domains 
 (AWS outage anyone?), complex, sets high expectations Sometimes you need a crutch. When you do, overbuild the network, and make sure you have a smart scheduler. AWS EBS Outage! http://aws.amazon.com/message/65648/
  • 29. More Servers == More Storage I/O 29 >1M writes/second, triple- redundancy w/ Cassandra on AWS Linear scale-out == linear costs for performance
  • 30. Hypervisors are a Commodity 30 Cloud end-users want OS of choice, not HVs. Level up! Managing iron is for mainframe operators." … hypervisors are bare metal APIs Hypervisor of the future is open source, easily modifiable, & extensible.
  • 31. The Hypervisor of the Future 
 May Be NO Hypervisor 31 LXC ironic Bare Metal Cloud
  • 35. Quiz Time 35 Pets Cattle NIC bonding Managing a Server 
 at a Time?
  • 36. Quiz Time 36 Pets Cattle NIC bonding Managing a Server
 at a Time ➔
  • 37. Quiz Time 37 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling?
  • 38. Quiz Time 38 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling➔
  • 39. Quiz Time 39 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure?
  • 40. Quiz Time 40 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure➔
  • 41. Quiz Time 41 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals?
  • 42. Quiz Time 42 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals ➔
  • 43. Quiz Time 43 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy?
  • 44. Quiz Time 44 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy ➔
  • 45. Quiz Time 45 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy Shared Nothing Architecture?
  • 46. Quiz Time 46 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy Shared Nothing Architecture➔
  • 47. Quiz Time 47 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy Shared Nothing Architecture Persistent Block Storage?
  • 48. Quiz Time 48 Pets Cattle NIC bonding Managing Server at a Time Auto-scaling Design-for-Failure 100% Uptime Goals HA pairs for redundancy Shared Nothing Architecture Persistent Block Storage ➔
  • 49. Q & A 49 Randy Bias! Founder & CEO, Cloudscaling! Director, OpenStack Foundation! @randybias