SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Matching the Database to the
Workload
Rick Houlihan
Principal Technologist, NoSQL
AWS
D A T 3 0 1
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda for This Session
• Database workload classifications
• Traditional approaches to scaling RDBMS
• How NoSQL databases compare
• The flavors of NoSQL on AWS
• What database to use when
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why did you choose this database?
“Because we heard X is the best new thing.”
“Because we have a site license for X.”
“Because X is what we know how to use.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why did you choose this database?
“Because this database is purpose built to support
what my application is designed to do.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Types of Database Workloads
• Online Transaction Processing (OLTP)
Most common type of app
• Online Analytics Processing (OLAP)
BI and ad-hoc data projections
• Decision Support Systems (DSS)
Long running query aggregations and projections
Operations
Analytics
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the Workload
Unbounded problems are harder to solve
“I need a root cause analysis engine to correlate transaction level events to trading patterns
across global markets.”
Problems with limited scope are easier to solve
“I need a system to manage inventory in my store.”
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sizing the Database
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Scaling an RDBMS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sharded Relational DBs?
A B C D
?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
NoSQL Databases
• Denormalize and shard to
provide horizontal scale
• Near unbounded throughput
and storage
Collection 1
1 TB
Shard A
500
GB
Shard B
500
GB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
00 55 A954 FFAA00 FF
Partition Keys in NoSQL
Partition Key uniquely identifies an item
Partition Key is used for building an unordered hash index
Allows table to be partitioned for scale
Id = 1
Name = Jim
Hash (1) = 7B
Id = 2
Name = Andy
Dept = Eng
Hash (2) = 48
Id = 3
Name = Kim
Dept = Ops
Hash (3) = CD
Key Space
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Iron Triangle of Data - All About CAP
C
A
PConsistency:
all clients always have
the same view of data
Partition tolerance:
the system works well despite
physical network partitions
Availability:
all clients can always
read and write
CA
MSSQL
Oracle
DB2
MySQL
Aster Data
Greenplum
Postgres
CP
Big Table
Hypertable
HBase
MongoDB
Terastore
Couchbase
Scalaris
DynamoDB
BerkeleyDB
Memcached
Redis
Pick Two
AP
Voldemort
Tokyo Cabinet
KAI
DynamoDB
Cassandra
SimpleDB
CouchDB
Riak
Data Models:
Relational
Wide Column
Document
Key/Value
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Technology Adoption and the Hype Curve
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DW | Big Data Processing | Ad hoc
AWS Databases and Analytics
Broadest and deepest portfolio purpose-built for builders
Business Intelligence & Machine Learning
Data Movement
Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams
Amazon QuickSight
Relational Databases
RDS
Aurora
Data lake (Batch/ETL)
S3/Glacier
(Storage)
Glue
(ETL & Data Catalog)
Machine Learning
Macie
(Data Protection)
NoSQL Databases
Analytics (OLAP/DSS)
DynamoDB
(Wide Column/Document)
ElastiCache
(Indexed Key Value)
Amazon
Redshift
EMR Athena
Kinesis Data
Analytics
Elasticsearch
Service
Real-time
Opertional (OLTP)
Neptune
(Graph)
QLDB
(Ledger)
Timestream
(TSDB)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon RDS
Managed relational database service with a choice of six popular database engines
Easy to administer Highly flexible Available &
durable
Fast
No need for infrastructure
provisioning, installing and
maintaining database
software.
Scale database compute
and storage with a few
mouse clicks and zero
downtime.
Multi-AZ:
Automatically
replicates data.
Automated backup,
snapshots, failover.
Choose between dual SSD-
backed storage for high-
performance OLTP.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon DynamoDB
Fast and flexible NoSQL database service for any scale
Key-value NoSQL database that supports both document and wide column structures
Fast, consistent
performance
Highly scalable Fully managed
Business-critical
reliability
Consistent single-digit
millisecond latencies at any
scale. DAX speeds up times
to microseconds.
Auto-scaling tables serving
millions of requests per
second, storing hundreds of
terabytes of data.
Automatic provisioning
and infrastructure
management.
Data replicated across
multiple AZs and accessed
with regionally available
APIs.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DynamoDB Schema
Table
Items
Attributes
Partition
Key
Sort
Key
Mandatory
Key-value access pattern
Determines data distribution
Optional
Model 1:N relationships
Enables rich query capabilities
All items for key
==, <, >, >=, <=
“begins with”
“between”
“contains”
“in”
sorted results
counts
top/bottom N values
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
SQL vs. NoSQL Design Pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune
Fully managed graph database
Fast Reliable Open
Query billions of
relationships with
millisecond latency
Six replicas of your
data across three AZs
with full backup and
restore
Build powerful
queries easily with
Gremlin and SPARQL
Supports Apache
TinkerPop & W3C
RDF graph models
Easy
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Graph Workloads
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Normalized Graph Design Pattern
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
De-normalized Graph Design Pattern
Node Relationship Endpoint
Bill
person Bill
visited Eiffel Tower
Alice
person Alice
visited Eiffel Tower
friend Bob
Bob
person Bob
born 7/14/90
friend Alice
Interest Mona Lisa
Leonardo daVinci person Leaonardo daVinci
La Jaconde a Washington
video La Jaconde…
about Mona Lisa
Eiffel Tower
place Eiffel Tower
located Paris
7/14/90 date 7/14/90
Paris place City
The Louvre
place Museum
location Paris
Mona Lisa
painting Mona Lisa
creator Leonardo daVinci
location The Louvre
Nodes are the vertices of a
graph
Relationships are the edges
of a graph
Select nodes to get
edges for an entity
Index Relationship and
Endpoint for edge type
and target aggregations
Follow the edges to
traverse the graph
Bob wants to see the
Mona Lisa. While he is in
Paris he would like to see
other things his friends
have enjoyed.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Graph Query Types
Node Query (Primary)
What entities are in the graph?
Edge Query (Index)
What relationships do graph entities have?
Hybrid Query (Traversal)
How are entities related through each other?
RDBMS, NoSQL,
GraphDB
GraphDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Redshift – Data Warehousing
Fast, powerful, and simple data warehousing at 1/10 the cost
Massively parallel, petabyte scale
Fast Inexpensive Scalable Secure
Columnar storage technology
to improve I/O efficiency and
parallelize queries. Data load
scales linearly.
As low as $1,000 per
terabyte per year, 1/10
the cost of traditional
data warehouse solutions.
Resize your cluster up and
down as your
performance and capacity
needs change.
Data encrypted at rest and
transit. Isolate clusters with
VPC. Manage your own keys
with AWS KMS.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Athena – Interactive Analysis
Interactive query service to analyze data in Amazon S3 using standard SQL
No infrastructure to set up or manage and no data to load
Ability to run SQL queries on data archived in Amazon Glacier (Coming soon)
Serverless
Zero setup cost. Just
point to Amazon S3,
and start querying.
Pay per query
Pay only for queries run.
Save 30–90% on per-
query costs through
compression.
Open
ANSI SQL interface,
JDBC/ODBC drivers, multiple
formats, compression types,
and complex joins and data
types.
Easy
Serverless. Zero
infrastructure. Zero
administration.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
QLDB (Preview)
Fully managed ledger database
Track and verify history of all changes made to your application’s data
Immutable and
transparent
Cryptographically
verifiable
Easy to useHighly scalable
Append-only, immutable journal
tracks history of all changes
which cannot be deleted or
modified. Get full visibility into
entire data lineage
All changes are
cryptographically
chained and verifiable
Executes 2 – 3X as many
transactions than ledgers in
common blockchain
frameworks
Flexible document model,
query with familiar SQL-
like interface
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Timestream (sign up for the preview today)
Fast, scalable, and fully managed time series database
1,000x faster at 1/10 the
cost of relational databases
Trillions of daily
events
Analytics optimized
for time series data
Serverless
Collect fast moving time-
series data from multiple
sources at the rate of
millions of inserts per
second
Capable of processing
trillions of events daily; the
adaptive query processing
engine maintains steady,
predictable performance
Built-in analytics for
interpolation, smoothing,
and approximation to
identify trends, patterns,
and anomalies
No servers to manage;
time-consuming tasks such
as hardware provisioning,
software patching, setup, &
configuration done for you
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Categories of Database
Optimized for
Storage
Optimized for
Compute
Optimized for
Relationships
Normalized relational
or dimensional DW
Denormalized document,
wide column or key value
Denormalized entity
relationship
Ad hoc queries and
aggregations
Instantiated views and
computed aggregations
Ad hoc
entity/relationship
aggregations
Scale vertically Scale horizontally Hybrid
Great for OLAP and
DSS
Built for OLTP or DSS at
scale
Designed for graph
traversals
SQL NoSQL Graph
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Infinite Scale:
The database can gracefully increase
size and throughput without practical
limits
The Iron Triangle of Purpose (The PIE Theorem)
I
P
E Efficiency:
The database will deliver required
query latency for the workload at all
times
Pattern Flexibility:
The database supports random access
patterns and ad hoc queries
PI
Amazon RDS
Elasticsearch
Aurora Serverless
Neptune
IE
Pick Two
PE
Data Models:
Relational
Wide Column
Document
Graph
Columnar
Unstructured
Amazon DynamoDB
Amazon Redshift
Athena
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of Thousands of Customers Use DynamoDB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Hundreds of Thousands More Use Amazon RDS
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Purpose Built Database Solutions from AWS
Provisioning
Capacity planning
Monitoring
OS patching
Hardware upgrades
Database upgrades
Security patches
Scaling
Monitoring
Performance tuning
Replication across data centers
Re-replicate on server failureProvision new regions
Infrastructure Software
With Zero Unplanned Downtime
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Time: 15 minutes after this session
Location: Speaker Lounge (ARIA East, Level 1, Willow Lounge)
Duration: 30 min.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Más contenido relacionado

La actualidad más candente

Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...Amazon Web Services
 
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...Amazon Web Services
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueAmazon Web Services
 
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링Amazon Web Services Korea
 
re:Invent 2022 DAT326 Deep dive into Amazon Aurora and its innovations
re:Invent 2022  DAT326 Deep dive into Amazon Aurora and its innovationsre:Invent 2022  DAT326 Deep dive into Amazon Aurora and its innovations
re:Invent 2022 DAT326 Deep dive into Amazon Aurora and its innovationsGrant McAlister
 
Deep Dive - Amazon Elastic MapReduce (EMR)
Deep Dive - Amazon Elastic MapReduce (EMR)Deep Dive - Amazon Elastic MapReduce (EMR)
Deep Dive - Amazon Elastic MapReduce (EMR)Amazon Web Services
 
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018Amazon Web Services Korea
 
Introduction to Serverless
Introduction to ServerlessIntroduction to Serverless
Introduction to ServerlessNikolaus Graf
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueAmazon Web Services
 
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018Amazon Web Services
 
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...Amazon Web Services
 
AWS Black Belt Online Seminar 2017 Amazon Kinesis
AWS Black Belt Online Seminar 2017 Amazon KinesisAWS Black Belt Online Seminar 2017 Amazon Kinesis
AWS Black Belt Online Seminar 2017 Amazon KinesisAmazon Web Services Japan
 
Building Data Lakes in the AWS Cloud
Building Data Lakes in the AWS CloudBuilding Data Lakes in the AWS Cloud
Building Data Lakes in the AWS CloudAmazon Web Services
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaKai Wähner
 
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)Amazon Web Services Korea
 
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...Amazon Web Services
 
Identity and Access Management: The First Step in AWS Security
Identity and Access Management: The First Step in AWS SecurityIdentity and Access Management: The First Step in AWS Security
Identity and Access Management: The First Step in AWS SecurityAmazon Web Services
 

La actualidad más candente (20)

Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
Amazon DynamoDB Under the Hood: How We Built a Hyper-Scale Database (DAT321) ...
 
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
[REPEAT 1] Deep Dive on Amazon Aurora with MySQL Compatibility (DAT304-R1) - ...
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS Glue
 
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링
Amazon OpenSearch Deep dive - 내부구조, 성능최적화 그리고 스케일링
 
re:Invent 2022 DAT326 Deep dive into Amazon Aurora and its innovations
re:Invent 2022  DAT326 Deep dive into Amazon Aurora and its innovationsre:Invent 2022  DAT326 Deep dive into Amazon Aurora and its innovations
re:Invent 2022 DAT326 Deep dive into Amazon Aurora and its innovations
 
Deep Dive - Amazon Elastic MapReduce (EMR)
Deep Dive - Amazon Elastic MapReduce (EMR)Deep Dive - Amazon Elastic MapReduce (EMR)
Deep Dive - Amazon Elastic MapReduce (EMR)
 
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018
AWS KMS를 활용하여 안전한 AWS 환경을 구축하기 위한 전략::임기성::AWS Summit Seoul 2018
 
Introduction to Serverless
Introduction to ServerlessIntroduction to Serverless
Introduction to Serverless
 
Building Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS GlueBuilding Serverless ETL Pipelines with AWS Glue
Building Serverless ETL Pipelines with AWS Glue
 
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
Scaling Up to Your First 10 Million Users (ARC205-R1) - AWS re:Invent 2018
 
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
 
AWS Black Belt Online Seminar 2017 Amazon Kinesis
AWS Black Belt Online Seminar 2017 Amazon KinesisAWS Black Belt Online Seminar 2017 Amazon Kinesis
AWS Black Belt Online Seminar 2017 Amazon Kinesis
 
Azure storage
Azure storageAzure storage
Azure storage
 
Building Data Lakes in the AWS Cloud
Building Data Lakes in the AWS CloudBuilding Data Lakes in the AWS Cloud
Building Data Lakes in the AWS Cloud
 
Introducing Amazon SageMaker
Introducing Amazon SageMakerIntroducing Amazon SageMaker
Introducing Amazon SageMaker
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache KafkaReal-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
 
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
마이크로 서비스를 위한 AWS Cloud Map & App Mesh - Saeho Kim (AWS Solutions Architect)
 
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...
Getting Started with Serverless Architectures with Microservices_AWSPSSummit_...
 
BDA311 Introduction to AWS Glue
BDA311 Introduction to AWS GlueBDA311 Introduction to AWS Glue
BDA311 Introduction to AWS Glue
 
Identity and Access Management: The First Step in AWS Security
Identity and Access Management: The First Step in AWS SecurityIdentity and Access Management: The First Step in AWS Security
Identity and Access Management: The First Step in AWS Security
 

Similar a Building with AWS Databases: Match Your Workload to the Right Database (DAT301) - AWS re:Invent 2018

Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitAmazon Web Services
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...Amazon Web Services
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitAmazon Web Services
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAmazon Web Services
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAmazon Web Services
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Amazon Web Services
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Amazon Web Services
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoAmazon Web Services LATAM
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Amazon Web Services
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Amazon Web Services
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAdir Sharabi
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesAmazon Web Services
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCAmazon Web Services LATAM
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSAmazon Web Services
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLPreparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLAmazon Web Services
 

Similar a Building with AWS Databases: Match Your Workload to the Right Database (DAT301) - AWS re:Invent 2018 (20)

Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Anaheim AWS Summit
 
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ... SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
SRV307 Applying AWS Purpose-Built Database Strategy: Match Your Workload to ...
 
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS SummitApplying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
Applying AWS Purpose-Built Database Strategy - SRV307 - Toronto AWS Summit
 
AWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scaleAWS Data Lake: data analysis @ scale
AWS Data Lake: data analysis @ scale
 
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech TalksAnalyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
Analyze your Data Lake, Fast @ Any Scale - AWS Online Tech Talks
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
Database Freedom. Database migration approaches to get to the Cloud - Marcus ...
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analíticoImmersion Day - Como simplificar o acesso ao seu ambiente analítico
Immersion Day - Como simplificar o acesso ao seu ambiente analítico
 
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
Using data lakes to quench your analytics fire - AWS Summit Cape Town 2018
 
Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28Building Data Lake on AWS | AWS Floor28
Building Data Lake on AWS | AWS Floor28
 
AWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWSAWS Floor 28 - Building Data lake on AWS
AWS Floor 28 - Building Data lake on AWS
 
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best PracticesBuild Data Lakes and Analytics on AWS: Patterns & Best Practices
Build Data Lakes and Analytics on AWS: Patterns & Best Practices
 
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best PracticesBuild Data Lakes & Analytics on AWS: Patterns & Best Practices
Build Data Lakes & Analytics on AWS: Patterns & Best Practices
 
Builders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LCBuilders' Day - Building Data Lakes for Analytics On AWS LC
Builders' Day - Building Data Lakes for Analytics On AWS LC
 
Data Warehouses and Data Lakes
Data Warehouses and Data LakesData Warehouses and Data Lakes
Data Warehouses and Data Lakes
 
ABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWSABD201-Big Data Architectural Patterns and Best Practices on AWS
ABD201-Big Data Architectural Patterns and Best Practices on AWS
 
Preparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/MLPreparing Your Data for Cloud Analytics & AI/ML
Preparing Your Data for Cloud Analytics & AI/ML
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Building with AWS Databases: Match Your Workload to the Right Database (DAT301) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Matching the Database to the Workload Rick Houlihan Principal Technologist, NoSQL AWS D A T 3 0 1
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda for This Session • Database workload classifications • Traditional approaches to scaling RDBMS • How NoSQL databases compare • The flavors of NoSQL on AWS • What database to use when
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why did you choose this database? “Because we heard X is the best new thing.” “Because we have a site license for X.” “Because X is what we know how to use.”
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why did you choose this database? “Because this database is purpose built to support what my application is designed to do.”
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Types of Database Workloads • Online Transaction Processing (OLTP) Most common type of app • Online Analytics Processing (OLAP) BI and ad-hoc data projections • Decision Support Systems (DSS) Long running query aggregations and projections Operations Analytics
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the Workload Unbounded problems are harder to solve “I need a root cause analysis engine to correlate transaction level events to trading patterns across global markets.” Problems with limited scope are easier to solve “I need a system to manage inventory in my store.”
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sizing the Database
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Scaling an RDBMS
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sharded Relational DBs? A B C D ?
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. NoSQL Databases • Denormalize and shard to provide horizontal scale • Near unbounded throughput and storage Collection 1 1 TB Shard A 500 GB Shard B 500 GB
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 00 55 A954 FFAA00 FF Partition Keys in NoSQL Partition Key uniquely identifies an item Partition Key is used for building an unordered hash index Allows table to be partitioned for scale Id = 1 Name = Jim Hash (1) = 7B Id = 2 Name = Andy Dept = Eng Hash (2) = 48 Id = 3 Name = Kim Dept = Ops Hash (3) = CD Key Space
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Iron Triangle of Data - All About CAP C A PConsistency: all clients always have the same view of data Partition tolerance: the system works well despite physical network partitions Availability: all clients can always read and write CA MSSQL Oracle DB2 MySQL Aster Data Greenplum Postgres CP Big Table Hypertable HBase MongoDB Terastore Couchbase Scalaris DynamoDB BerkeleyDB Memcached Redis Pick Two AP Voldemort Tokyo Cabinet KAI DynamoDB Cassandra SimpleDB CouchDB Riak Data Models: Relational Wide Column Document Key/Value
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Technology Adoption and the Hype Curve
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DW | Big Data Processing | Ad hoc AWS Databases and Analytics Broadest and deepest portfolio purpose-built for builders Business Intelligence & Machine Learning Data Movement Database Migration Service | Snowball | Snowmobile | Kinesis Data Firehose | Kinesis Data Streams Amazon QuickSight Relational Databases RDS Aurora Data lake (Batch/ETL) S3/Glacier (Storage) Glue (ETL & Data Catalog) Machine Learning Macie (Data Protection) NoSQL Databases Analytics (OLAP/DSS) DynamoDB (Wide Column/Document) ElastiCache (Indexed Key Value) Amazon Redshift EMR Athena Kinesis Data Analytics Elasticsearch Service Real-time Opertional (OLTP) Neptune (Graph) QLDB (Ledger) Timestream (TSDB)
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon RDS Managed relational database service with a choice of six popular database engines Easy to administer Highly flexible Available & durable Fast No need for infrastructure provisioning, installing and maintaining database software. Scale database compute and storage with a few mouse clicks and zero downtime. Multi-AZ: Automatically replicates data. Automated backup, snapshots, failover. Choose between dual SSD- backed storage for high- performance OLTP.
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon DynamoDB Fast and flexible NoSQL database service for any scale Key-value NoSQL database that supports both document and wide column structures Fast, consistent performance Highly scalable Fully managed Business-critical reliability Consistent single-digit millisecond latencies at any scale. DAX speeds up times to microseconds. Auto-scaling tables serving millions of requests per second, storing hundreds of terabytes of data. Automatic provisioning and infrastructure management. Data replicated across multiple AZs and accessed with regionally available APIs.
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. DynamoDB Schema Table Items Attributes Partition Key Sort Key Mandatory Key-value access pattern Determines data distribution Optional Model 1:N relationships Enables rich query capabilities All items for key ==, <, >, >=, <= “begins with” “between” “contains” “in” sorted results counts top/bottom N values
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. SQL vs. NoSQL Design Pattern
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune Fully managed graph database Fast Reliable Open Query billions of relationships with millisecond latency Six replicas of your data across three AZs with full backup and restore Build powerful queries easily with Gremlin and SPARQL Supports Apache TinkerPop & W3C RDF graph models Easy
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Graph Workloads
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Normalized Graph Design Pattern
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. De-normalized Graph Design Pattern Node Relationship Endpoint Bill person Bill visited Eiffel Tower Alice person Alice visited Eiffel Tower friend Bob Bob person Bob born 7/14/90 friend Alice Interest Mona Lisa Leonardo daVinci person Leaonardo daVinci La Jaconde a Washington video La Jaconde… about Mona Lisa Eiffel Tower place Eiffel Tower located Paris 7/14/90 date 7/14/90 Paris place City The Louvre place Museum location Paris Mona Lisa painting Mona Lisa creator Leonardo daVinci location The Louvre Nodes are the vertices of a graph Relationships are the edges of a graph Select nodes to get edges for an entity Index Relationship and Endpoint for edge type and target aggregations Follow the edges to traverse the graph Bob wants to see the Mona Lisa. While he is in Paris he would like to see other things his friends have enjoyed.
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Graph Query Types Node Query (Primary) What entities are in the graph? Edge Query (Index) What relationships do graph entities have? Hybrid Query (Traversal) How are entities related through each other? RDBMS, NoSQL, GraphDB GraphDB
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Redshift – Data Warehousing Fast, powerful, and simple data warehousing at 1/10 the cost Massively parallel, petabyte scale Fast Inexpensive Scalable Secure Columnar storage technology to improve I/O efficiency and parallelize queries. Data load scales linearly. As low as $1,000 per terabyte per year, 1/10 the cost of traditional data warehouse solutions. Resize your cluster up and down as your performance and capacity needs change. Data encrypted at rest and transit. Isolate clusters with VPC. Manage your own keys with AWS KMS.
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Athena – Interactive Analysis Interactive query service to analyze data in Amazon S3 using standard SQL No infrastructure to set up or manage and no data to load Ability to run SQL queries on data archived in Amazon Glacier (Coming soon) Serverless Zero setup cost. Just point to Amazon S3, and start querying. Pay per query Pay only for queries run. Save 30–90% on per- query costs through compression. Open ANSI SQL interface, JDBC/ODBC drivers, multiple formats, compression types, and complex joins and data types. Easy Serverless. Zero infrastructure. Zero administration.
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. QLDB (Preview) Fully managed ledger database Track and verify history of all changes made to your application’s data Immutable and transparent Cryptographically verifiable Easy to useHighly scalable Append-only, immutable journal tracks history of all changes which cannot be deleted or modified. Get full visibility into entire data lineage All changes are cryptographically chained and verifiable Executes 2 – 3X as many transactions than ledgers in common blockchain frameworks Flexible document model, query with familiar SQL- like interface
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Timestream (sign up for the preview today) Fast, scalable, and fully managed time series database 1,000x faster at 1/10 the cost of relational databases Trillions of daily events Analytics optimized for time series data Serverless Collect fast moving time- series data from multiple sources at the rate of millions of inserts per second Capable of processing trillions of events daily; the adaptive query processing engine maintains steady, predictable performance Built-in analytics for interpolation, smoothing, and approximation to identify trends, patterns, and anomalies No servers to manage; time-consuming tasks such as hardware provisioning, software patching, setup, & configuration done for you
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Categories of Database Optimized for Storage Optimized for Compute Optimized for Relationships Normalized relational or dimensional DW Denormalized document, wide column or key value Denormalized entity relationship Ad hoc queries and aggregations Instantiated views and computed aggregations Ad hoc entity/relationship aggregations Scale vertically Scale horizontally Hybrid Great for OLAP and DSS Built for OLTP or DSS at scale Designed for graph traversals SQL NoSQL Graph
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Infinite Scale: The database can gracefully increase size and throughput without practical limits The Iron Triangle of Purpose (The PIE Theorem) I P E Efficiency: The database will deliver required query latency for the workload at all times Pattern Flexibility: The database supports random access patterns and ad hoc queries PI Amazon RDS Elasticsearch Aurora Serverless Neptune IE Pick Two PE Data Models: Relational Wide Column Document Graph Columnar Unstructured Amazon DynamoDB Amazon Redshift Athena
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of Thousands of Customers Use DynamoDB
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Hundreds of Thousands More Use Amazon RDS
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Purpose Built Database Solutions from AWS Provisioning Capacity planning Monitoring OS patching Hardware upgrades Database upgrades Security patches Scaling Monitoring Performance tuning Replication across data centers Re-replicate on server failureProvision new regions Infrastructure Software With Zero Unplanned Downtime
  • 34. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 35. Time: 15 minutes after this session Location: Speaker Lounge (ARIA East, Level 1, Willow Lounge) Duration: 30 min.
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.