This complementary presentation discusses the use of AWS as a disaster recovery (DR) environment and how the architectural approaches to DR in the AWS cloud make a perfect learning ground for more complex production application deployments.
In this webinar we introduce differing architectural patterns for using AWS as a disaster recovery platform and how AWS can make a cost effective part of your overall business continuity planning. By discussing a customer use-case end to end, we'll highlight key lessons learned to help you on your journey into the AWS cloud.
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Journey Through the Cloud: Disaster Recovery Best Practices and AWS Services
1. Journey through the Cloud:
Disaster Recovery
Ryan Shuttleworth – Technical Evangelist
@ryanAWS
2. Common use cases & stepping stones into the AWS cloud
Learning from customer journeys
Best practices to bootstrap your projects
Journey through the cloud
3. Explore AWS for a ‘non-production’ use case
Phase systems into ‘live’ DR use with relative ease
Choose your success objectives for a cloud project ‘out of band’
Disaster recovery
4. Why AWS for disaster recovery?
AWS services that can be employed
Common DR architectures
Customer example
Where to go next
Agenda
6. AWS is used in a variety of ways…
AWS & Disaster Recovery
Backup and disaster recovery
system for its remote sales offices
Fast, secure and cost effective
backup and DR for Oracle Apps
Disaster recovery solution to backup
and store critical medical image data
DR and testing environment reducing
IT overhead and increasing availability
7. You might be able to:
Business & technical drivers
Reduce costs
Slash DR budgets by up to 50%
Reduce on-premise
Eliminate 30%+ of on-premise
physical equipment
Consolidate sites
Eliminate the need to run a
secondary site
Remove aging
technologies
Eliminate tape for backup and
archive
8. DR is part of a wider set of policies and controls…
DR & business continuity
High availability Backup Disaster recovery
Keep your applications
running 24x7
Make sure your data is safe Get your applications and
data back after a major
disaster
9. DR is part of a wider set of policies and controls…
DR & business continuity
It’s not an all or nothing thing
Choose what needs to failover and what does not
Some things more important than others
Some things will still be working
High availability Backup Disaster recovery
Keep your applications
running 24x7
Make sure your data is safe Get your applications and
data back after a major
disaster
10. Each set of IT assets will have different requirements…
DR & business continuity
Recovery Time
Objective (RTO)
How quickly you need this asset to be
recovered?
e.g. 1min? 15min? 1hr? 4hrs? 1day?
Recovery Point
Objective (RPO)
How ‘fresh’ the recovery must be for the
asset?
e.g. zero data loss, 15mins out of date?
11. Assets will sit on a spectrum of technical complexity…
DR & business continuity
Rebuild when
required from
offsite backup
Run hot-hot
configuration with
auto-failover
12. The fundamental economic model…
Utility, on-demand datacenter
Primary Site
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Primary Storage
Backup
Archive
Secondary Site
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Primary Storage
Backup
Archive
13. The fundamental economic model…
Utility, on-demand datacenter
Primary Site
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Primary Storage
Backup
Archive
AWS
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Snapshot Storage
Backup
Archive
14. The fundamental economic model…
Utility, on-demand datacenter
Primary Site
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Primary Storage
Backup
Archive
AWS
Routers
Firewalls
Network
Application Licenses
Operating Systems
Hypervisor
Servers
SAN
Snapshot Storage
Backup
Archive
Secondary
site costs
18. Amazon
Simple
Storage
Service (S3)
AWS Import/Export
AWS Storage
Gateway Service
AWS Direct
Connect
Amazon Virtual
Private Cloud
(VPC)
Amazon
Route 53
Amazon Elastic
Compute Cloud
(EC2)
Amazon Relational
Database Service (RDS)
Amazon
Elastic Block
Storage (EBS)
Object storage &
transfer services
Networking services Foundation services
19. S3 and Elastic Block Store
AWS storage is ideal for DR
Simple Storage Service
Highly scalable object storage
1 byte to 5TB in size
99.999999999% durability
Elastic Block Store
High performance block storage device
1GB to 1TB in size
Mount as drives to instances with
snapshot/cloning functionalities
20. Direct Connect
Dedicated connection between your IT
infrastructure and the AWS datacenters
Extend your network infrastructure and
VLANs into AWS
VPN Connection
A Hardware VPN connection connects
amazon environment to your datacenter
Internet Protocol security (IPsec) VPN
connection
Commonly used hardware supported
Virtual Private Cloud
Private, isolated section of the AWS Cloud
Launch resources in a virtual network that you
define
complete control over your virtual networking
environment
Internet
Internet
Networking options
22. 4 main patterns
Common DR architectures
Backup & Restore Pilot light
Warm standby in
AWS
Multi-site solution
in AWS & on-
premise
23. We’ll focus on 2 of them…
Common DR architectures
Backup & Restore Pilot light
Warm standby in
AWS
Multi-site solution
in AWS & on-
premise
24. Let’s start with Backup & Restore
Common DR architectures
Backup & Restore Pilot light
Warm standby in
AWS
Multi-site solution
in AWS & on-
premise
25. Advantages to starting a journey with this pattern
Backup & Restore pattern
Simple to get started
Easy starting point for exploring the
AWS cloud
Low technical barrier to entry
Focus on incorporating cloud into your
DR strategy, not on complex technical
issues related to hot-hot systems
Cost effective
Very high levels of data durability at
low price
Cost of storing snapshots in S3
Archiving possibilities beyond tape
using Glacier
26. The preparation process…
Backup & Restore pattern
Take backups of
current systems
Store backups
in S3
Move to long term
archive in Glacier
27. The process…
Backup & Restore pattern
Take backups of
current systems
Store backups
in S3
Detail how you will restoring from backup or
recover from archive
Move to long term
archive in Glacier
29. Long term Glacier archive
Durable
Designed for 99.999999999%
durability of archives
Cost effective
Write-once, read-never. Cost effective for
long term storage. Pay for accessing data
33. Logs
Txns
✗accessible from S3
Objects
expire and
are deleted
time
accessible from S3
Object
transition to
Glacier invoked
Restoration of
object requested
for x hrs
ExpiryTransition
34. Logs
Txns
✗accessible from S3
Objects
expire and
are deleted
time
accessible from S3
Object
transition to
Glacier invoked
Restoration of
object requested
for x hrs
3-5hrs
Object held in S3
RRS for x hrs
ExpiryTransition
46. Gateway stored
volumes
Data stored locally
Asynchronous backup
EBS snapshots
iSCSI local interface
Up to 1TB volumes
Gateway cached
volumes
Data stored in S3
Recently read data cached
Low latency
iSCSI local interface
Up to 32TB volumes
49. Let’s look at the Pilot Light pattern…
Common DR architectures
Backup & Restore Pilot light
Warm standby in
AWS
Multi-site solution
in AWS & on-
premise
50. Moving along the DR spectrum…
Pilot light architecture
Build resources around
replicated dataset
Keep ‘pilot light’ on by replicating core
databases
Build AWS resources around dataset and
leave in stopped state
51. Moving along the DR spectrum…
Pilot light architecture
Build resources around
replicated dataset
Keep ‘pilot light’ on by replicating core
databases
Build AWS resources around dataset and
leave in stopped state
Scale resources in AWS in
response to a DR event
Start up pool of resources in AWS when
events dictate
Match current production capacity through
auto-scaling polcies
52. Moving along the DR spectrum…
Pilot light architecture
Build resources around
replicated dataset
Keep ‘pilot light’ on by replicating core
databases
Build AWS resources around dataset and
leave in stopped state
Scale resources in AWS in
response to a DR event
Start up pool of resources in AWS when
events dictate
Match current production capacity through
auto-scaling policies
Switch-over to system in AWS
56. EU region DR site for range of business applications
All running in a Virtual Private Cloud (VPC)
DR provision for applications dependent on Oracle and SQL Server databases
Includes DR for Active Directory and Windows file shares
57. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
58. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Dual route
connectivity
59. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Active Directory
Replication
60. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Bastion
Host
61. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Database
replication
62. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Application images
63. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Desktop
environments
64. VPC Subnet B
Region
Availability Zone
Client-to-site VPN Site-to-site VPN
S3 Buckets
with Objects
Bastion Host
Internet
On-premise
Data Centre A
Remote
Desktops
AWS Direct Connect
On-premise
Data Centre B
VPC Subnet D VPC Subnet F
Databases
VPC Subnet E
Applications
VPC Subnet A
SmartSentinel
VPC Subnet G
File
Servers
VPC Subnet C
Active
Directory
Proxy Server
Durable data
backups
69. The cloud makes backup and recovery easy
You can get started for pennies per month
The cloud will scale to accommodate all of your data
You retain visibility and control of your information