In addition to running databases in Amazon EC2, AWS customers can choose among a variety of managed database services. These services save effort, save time, and unlock new capabilities and economies. In this session, we make it easy to understand how they differ, what they have in common, and how to choose one or more. We explain the fundamentals of Amazon DynamoDB, a fully managed NoSQL database service; Amazon RDS, a relational database service in the cloud; and Amazon Redshift, a fully managed, petabyte-scale data-warehouse solution that can be surprisingly economical. We will cover how each service might help support your application and how to get started.
3. If you host your databases on premises
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
4. If you host your databases in Amazon EC2
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
5. If you host your databases in Amazon EC2
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
Scaling
High availability
DB software installs
OS installation
you
App optimization
6. If you choose a managed DB service
Power, HVAC, net
Rack and stack
Server maintenance
OS patches
DB software patches
Database backups
App optimization
High availability
DB software installs
OS installation
you
Scaling
8. A managed service for each major DB type
Amazon
DynamoDB
Document
and key-
value store
Amazon
RDS
Relational
database
engines
Amazon
Redshift
Data
warehouse
12. RDS is simple and fast to scale
Database instance types
offer a range of CPU and
memory selections
Scale up or down among
instance types on demand
Database storage is
scalable on demand
13. RDS offers fast, predictable storage
General Purpose
(SSD) for most
workloads
Provisioned IOPS
(SSD) for OLTP
workloads up to
30,000 IOPS
Magnetic for small
workloads with
infrequent access
14. Fault tolerance with multi-AZ deployments
Enterprise-grade fault tolerance solution for
production databases
15. Choose Read Replicas for scalability and enhanced
data locality
Relieve pressure on your master
node for supporting reads and
writes
Even faster recovery in the event
of disaster
Bring data close to your
customers
Promote to a master for easy
migration
16. How do RDS backups work?
Automated backups
Restore your database to a point in
time
Enabled by default
Choose a retention period, up to 35
days
Manual snapshots
Build a new database instance from a
snapshot when needed
Initiated by you
Persist until you delete them
Stored in S3
18. A bit of history …
Re-imagining relational databases for the cloud era
19. Multiple layers of
functionality all in a
monolithic stack
SQL
Transactions
Caching
Logging
Relational databases were not designed for the
cloud
20. Not much has changed in last 20 years
Even when you scale it out, you’re still replicating the same stack
SQL
Transactions
Caching
Logging
SQL
Transactions
Caching
Logging
Application
SQL
Transactions
Caching
Logging
SQL
Transactions
Caching
Logging
Application
SQL
Transactions
Caching
Logging
SQL
Transactions
Caching
Logging
Storage
Application
22. Meet Amazon Aurora
R Speed and availability of high-end commercial databases
R Simplicity and cost-effectiveness of open source databases
R Drop-in compatibility with MySQL
R Simple pay-as-you-go pricing
Delivered as a managed service
Databases reimagined for the cloud
23. Scale-out, distributed, multi-tenant architecture
§ Storage volume is striped across
hundreds of storage nodes distributed
over 3 different Availability Zones
§ Six copies of data, two copies in each
Availability Zone to protect against
AZ+1 failures
§ 5x faster than MySQL on same
hardware
§ SysBench: 100 K writes/sec and 500 K
reads/sec
§ Designed for 99.99% availability
§ Scale to 64 TB and 15 Read Replicas
Master Replica Replica Replica
Availability
Zone 1
Shared storage volume
Availability
Zone 2
Availability
Zone 3
Storage nodes with SSDs
SQL
Transactions
Caching
SQL
Transactions
Caching
SQL
Transactions
Caching
25. Innovid's cross-platform marketing technology delivers video ads to any device
including desktop, mobile, tablet, and over 25 different connected TV devices
such as over-the-top (OTT) enabled boxes, smart TVs.
About Innovid
220+
ENTERPRISE CLIENTS
500M+
Daily Impressions
26. 26
The World
(High Scale)
Analytics
(High Scale)
RedShift
Innovid Database Usage
Innovid Employees
(Low Scale)
SERVING
CouchBase
STUDIO
MySQL
Backend
(Mid Scale)
MongoDB
Cassandra
Elasticsearch
MySQL
33. DynamoDB: a managed document and key-
value store
• Simple and fast to deploy, easy to scale
• Data is automatically replicated
• Fast, predictable performance
• Helps provide security and control
• Fine-grained access control
• No cost to get started
• Pay only for what you consume
34. DynamoDB: A schemaless database
Attributes
Schemaless
Schema is defined per item
Items
Table
Item
key
35. Writes
Replicated continuously to 3 AZs
Persisted to disk (custom SSD)
Reads
Strongly or eventually consistent
No latency trade-off
Automatic replication for rock-solid durability and
availability
36. Popular use cases
Ad tech IoT Gaming
Mobile
& web
Ad serving, ID
lookup, user
profile
management,
session
tracking, real-
time bidding
Tracking state,
metadata and
readings from
millions of
devices,
real-time
notifications
Recording
game details,
leaderboards,
session
information,
usage history,
and logs
Storing user
profiles,
session details,
personalization
settings,
entity-specific
metadata
38. NoSQL vs. relational DB for a new app: How to choose?
• Schemaless, easy reads
and writes, simple data
model
• Scaling is easy
• Focus on performance and
availability at any scale
• Strong schema, complex
relationships,
transactions and joins
• Scaling is difficult
• Focus on consistency
over scale and availability
NoSQL SQL
40. Amazon
Redshift
a lot faster
a lot cheaper
a whole lot simpler
Relational data warehouse
Massively parallel; petabyte scale
Fully managed
HDD and SSD platforms
$1,000/TB/year; starts at $0.25/hour
41. Amazon Redshift is easy to use
Provisioning in
minutes
Automatic patching SQL - Data loading
Backups are built-in Security is built-in Compression is built-in
42. Security is built in
VPC Access Management Cluster Security
Groups
Cluster Encryption
SSL Connections
Load Data Encryption Security Compliance
43. Amazon Redshift is fast
Dramatically less I/O
Column storage
Data compression
Zone maps
Direct-attached storage
Large data block sizes
10 | 13 | 14 | 26 |…
… | 100 | 245 | 324
375 | 393 | 417…
… 512 | 549 | 623
637 | 712 | 809 …
… | 834 | 921 | 959
10
324
375
623
637
959
ID Age State Amount
123 20 CA 500
345 25 WA 250
678 40 FL 125
957 37 WA 375
44. Fully managed, continuous/incremental backups
Multiple copies within cluster
Continuous and incremental backups
to S3
Continuous and incremental backups
across regions
Streaming restore
S3
S3
Region 1
Region 2
45. Amazon Redshift offers rock-solid fault tolerance
S3
S3
Region 1
Region 2
Disk failures
Node failures
Network failure
Availability Zone or region-level
disasters
46. Popular use cases
10x cheaper
Easy to provision
Greater DBA productivity
Traditional
enterprises
10x faster
No programming
Easily leverage BI tools,
Hadoop, machine
learning, streaming
Companies
with big data
Analysis in line with
process flows
Pay as you go, grow as
you need
Managed availability and
disaster recovery
SaaS
companies
48. Benefits of AWS managed database services
Pay only for what
you use
No upfront cost
Fully managed
services
AWS handles
installs, patching,
restarts
Easy to scale
Grow as you need
Designed for use
with other AWS
services
AWS
Data Pipeline
Amazon
EC2
Amazon
S3
Amazon
CloudWatch
Amazon
SNS
Amazon
VPC