SlideShare una empresa de Scribd logo
1 de 41
Descargar para leer sin conexión
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune Deep Dive
Brad Bebee
Principal Product Manager
Amazon Web Services
D A T 4 0 3
Bruce McGaughy
Sr. Manager, Software Development
Amazon Web Services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
Building applications on highly connected data
Different types of graph models
Amazon Neptune overview
Delivering high availability and enterprise features
Getting started
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Related breakouts
Wednesday, November 28
DAT360 - Neptune Performance Tuning: Get the Best out of Amazon Neptune
5:30 – 6:30PM | Mirage, Grand Ballroom D, Table 3
Wednesday, November 28
DAT359 - Getting Started with Amazon Neptune and Amazon SageMaker Jupyter Notebooks
2:30PM – 3:30PM | Aria West, Level 3, Starvine 10, Table 7
Wednesday, November 28
SRV307-R1 Building Serverless Applications Using AWS AppSync and Amazon Neptune
3:15PM – 5:30PM | MGM, Level 1, Grand Ballroom 120
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Relationships enable new applications
Retail fraud detectionRestaurant recommendationsSocial networks
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Use cases for highly connected data
Social networking
Life Sciences Network & IT operationsFraud detection
Recommendations Knowledge graphs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Dave
Whom might I know? What product should I buy?
Bill
Bob
Alice
Dave
Sara
Bill
Bob
Alice
Dave
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Understanding who, what, when, and where…
What museums should
Alice visit while in Paris?
Who painted the
Mona Lisa?
What artists have
paintings in The Louvre?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Navigate a web of global tax policies
“Our customers are increasingly required to navigate a complex web of global tax policies and
regulations. We need an approach to model the sophisticated corporate structures of our largest
clients and deliver an end-to-end tax solution. We use a microservices architecture approach for
our platforms and are beginning to leverage Amazon Neptune as a graph-based system to
quickly create links within the data.”
said Tim Vanderham, chief technology officer, Thomson Reuters Tax & Accounting
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Challenges Building Apps with Highly Connected DataThe challenges of building apps with highly
connected data using a relational database
Unnatural for
querying graph
Inefficient
graph processing
Rigid schema inflexible
for changing data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Different approaches for highly connected data
Purpose-built for a business process
Purpose-built to answer questions about
relationships
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
A graph database is optimized for efficient storage
and retrieval of highly connected data
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Leading graph models and frameworks
Open Source Apache TinkerPop™
Gremlin Traversal Language
W3C Standard
SPARQL Query Language
RESOURCE DESCRIPTION
FRAMEWORK (RDF)PROPERTY GRAPH
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
A highly connected university example
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Find all of the graduate students who received an
undergraduate degree from the same university
Undergraduate Degree
From
name: ?
name: ?
University
Graduate Student
name: ?
Department
Member Of
subOrganizationOf
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Challenges of existing graph databases
Difficult to maintain
high availability
Difficult to scale
Limited support for
open standards
Too expensive
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune
Fully managed graph database
FAST RELIABLE OPEN
Query billions of
relationships with
millisecond latency
6 replicas of your data
across 3 AZs with full
backup and restore
Build powerful
queries easily with
Gremlin and SPARQL
Supports Apache
TinkerPop & W3C
RDF graph models
EASY
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon Neptune high level architecture
Bulk load
from S3
Database
Mgmt.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Fully managed service
Easily configurable via the console
Multi-AZ high availability
Support for up to 15 read replicas
Supports encryption at rest
Supports encryption in transit (TLS)
Backup and restore, point-in-time
recovery
B E N E F I T S
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Security
• Network isolation via Virtual Private Cloud
• Use security groups to control ingress
• HTTPS encrypted client connections using TLS 1.2
• Encryption at rest using AWS Key Management
Service (KMS)
• AWS Identity and Access Management (IAM)
Policies to secure creation of Neptune resources
• IAM-based Authentication for Access control
• Each request is signed with AWS Signature Version 4
• Libraries provided for Gremlin and SPARQL clients
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Neptune GA customers
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Neptune general availability
• Announced on 5/30/2018
• Regions
• US East (No. Virginia), US East
(Ohio), US West (Oregon), EU
(Ireland), EU (London), EU
(Frankfurt)
• https://aws.amazon.com/about-
aws/whats-new/2018/05/amazon-
neptune-is-now-generally-available/
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Neptune: Distributed storage architecture
 Performance, availability, durability
 Scale-out replica architecture
 Shared storage volume with 10 GB
segments striped across hundreds of nodes
 Data is replicated 6 times across 3 AZs
 Hotspot rebalance, Fast database recovery
 Log applicator embedded in storage layer
Master Replica Replica Replica
Primary
Shared storage volume
Replica Replica
Gremlin /
Sparql
Transactions
Caching
Gremlin /
Sparql
Transactions
Caching
Gremlin /
Sparql
Transactions
Caching
Delivered as a managed service
AZ1 AZ2 AZ3
 Ship only the log
 Less work on engine
 Minimizes network traffic
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Six copies across three availability zones
4 out 6 write quorum; 3 out of 6 read quorum
Many failures possible: Disk (segment loss), Node, AZ network, AZ power, etc..
Continuous monitoring for failures
Automatic repair by peer-to-peer gossiping and replication
Gremlin /
Sparql
Transaction
AZ 1 AZ 2 AZ 3
Caching
Gremlin /
Sparql
Transaction
AZ 1 AZ 2 AZ 3
Caching
Read and write availabilityRead availability
6-way replicated storage to survive “AZ+1” failure
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Why are 6 copies necessary?
 You need replication across 3 AZs
to tolerate an AZ failure.
 Why not just 1 copy per AZ?
 An AZ + 1 node failure would break
the quorum
 Also important for performance
 Hides long tail network latencies
 Only 3/6 needed to ack reads
 Only 4/6 needed to ack writes
AZ 1 AZ 2 AZ 3
Quorum
break on
AZ failure
2/3 read
2/3 write
AZ 1 AZ 2 AZ 3
Quorum
survives
AZ failure
3/6 read
4/6 write
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Continuous backup
Segment snapshot Log records
Recovery point
Segment 1
Segment 2
Segment 3
Time
• Neptune takes periodic snapshots of each segment in parallel
• Continuously streams the redo logs to Amazon Simple Storage Service (Amazon S3)
• Backup happens continuously without performance or availability impact
• At restore, retrieve the appropriate segment snapshots and log streams to storage nodes
• Apply log streams to segment snapshots in parallel and asynchronously
Traditional Database
Have to replay logs since the last
checkpoint
Typically 5 minutes between checkpoints
Often single threaded
Amazon Neptune
Normal reads also replay the logs in the
storage layer
Parallel, distributed, asynchronous
No replay for startup
Checkpointed Data Redo Log
Crash at T0 requires
a re-application of the
redo log since
last checkpoint
T0 T0
Crash at T0 will result in redo logs being
applied to each segment on demand, in
parallel, asynchronously
Instant crash recovery
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Database backtrack
Backtrack brings the database to a point in time without requiring restore from backups
• Backtracking from an unintentional insert or delete
• Backtrack is not destructive. You can backtrack multiple times to find the right point in time.
t0 t1 t2
t0 t1
t2
t3 t4
t3
t4
Rewind to t1
Rewind to t3
Invisible Invisible
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Simplified storage management
 Automatic storage scaling up to 64 TB—no performance impact
 Instantly create user snapshots—no performance impact
Up to 64TB of storage – auto-incremented in 10GB units
up to 64 TB
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Neptune read replicas
PAGE CACHE
UPDATE
Neptune Primary
30% Read
70% Write
Neptune Replica
100% New Reads
Shared Multi-AZ Storage
Amazon Neptune read scaling
Performance
• Applications can scale out read traffic
across up to 15 read replicas
Low Replica Lag
• Typically < 10ms
• Master ships redo logs to replica
• Cached pages have redo applied
• Un-cached pages from shared storage
Availability
• Failing database nodes are
automatically detected and replaced
• If primary fails, a replica replaces it
(typically < 60s failover time)
• Primary upgrade by forced failover
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Monitoring
AWS CloudTrail
• Log all Neptune API calls to S3 bucket
Event Notifications
• Create Amazon Simple Notification Service
(Amazon SNS) subscription via AWS
Command Line Interface (AWS CLI) or AWS
SDK
Amazon CloudWatch
CPUUtilization GremlinRequestsPerSec Http429 SparqlErrors
ClusterReplicaLag Http100 Http500 SparqlRequests
ClusterReplicaLagMaximum Http101 Http501 SparqlRequestsPerSec
ClusterReplicaLagMinimum Http200 LoaderErrors StatusErrors
EngineUptime Http400 LoaderRequests StatusRequests
FreeableMemory Http403 NetworkReceiveThroughput VolumeBytesUsed
GremlinErrors Http405 NetworkThroughput VolumeReadIOPs
GremlinRequests Http413 NetworkTransmitThroughput VolumeWriteIOPs
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
View our blog posts on
using Neptune and
Jupyter Notebooks
https://aws.amazon.com/blogs
/database/analyze-amazon-
neptune-graphs-using-amazon-
sagemaker-jupyter-notebooks/
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Launch Amazon Neptune via AWS CloudFormation
https://docs.aws.amazon.com/neptune/latest/userguide/quickstart.html
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Check out Amazon Neptune samples on Github
https://github.com/aws-samples/amazon-neptune-samples
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Check out Amazon Neptune tools on Github
https://github.com/awslabs/amazon-neptune-tools
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Brad Bebee
beebs@amazon.com
Bruce McGaughy
mcgaughy@amazon.com
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

Más contenido relacionado

La actualidad más candente

Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...Amazon Web Services Korea
 
Graph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFGraph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFAmazon Web Services
 
ABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueAmazon Web Services
 
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018Amazon Web Services
 
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-Amazon Web Services Japan
 
20191001 AWS Black Belt Online Seminar AWS Lake Formation
20191001 AWS Black Belt Online Seminar AWS Lake Formation 20191001 AWS Black Belt Online Seminar AWS Lake Formation
20191001 AWS Black Belt Online Seminar AWS Lake Formation Amazon Web Services Japan
 
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...Amazon Web Services Korea
 
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례Amazon Web Services Korea
 
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Amazon Web Services
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSightAmazon Web Services
 
SRV308 Deep Dive on Amazon Aurora
SRV308 Deep Dive on Amazon AuroraSRV308 Deep Dive on Amazon Aurora
SRV308 Deep Dive on Amazon AuroraAmazon Web Services
 
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWSAmazon Web Services Korea
 
Data Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaData Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaAmazon Web Services
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Amazon Web Services
 
Deploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control TowerDeploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control TowerAmazon Web Services
 
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...Amazon Web Services
 

La actualidad más candente (20)

Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
Amazon Elasticache - Fully managed, Redis & Memcached Compatible Service (Lev...
 
Graph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SFGraph & Amazon Neptune: Database Week SF
Graph & Amazon Neptune: Database Week SF
 
ABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS GlueABD315_Serverless ETL with AWS Glue
ABD315_Serverless ETL with AWS Glue
 
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
On-Ramp to Graph Databases and Amazon Neptune (DAT335) - AWS re:Invent 2018
 
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-
20210330 AWS Black Belt Online Seminar AWS Glue -Glue Studioを使ったデータ変換のベストプラクティス-
 
20191001 AWS Black Belt Online Seminar AWS Lake Formation
20191001 AWS Black Belt Online Seminar AWS Lake Formation 20191001 AWS Black Belt Online Seminar AWS Lake Formation
20191001 AWS Black Belt Online Seminar AWS Lake Formation
 
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
Amazon OpenSearch - Use Cases, Security/Observability, Serverless and Enhance...
 
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례
AWS Summit Seoul 2023 | Amazon EKS 데이터 전송 비용 절감 및 카오스 엔지니어링 적용 사례
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
 
Visualization with Amazon QuickSight
Visualization with Amazon QuickSightVisualization with Amazon QuickSight
Visualization with Amazon QuickSight
 
SRV308 Deep Dive on Amazon Aurora
SRV308 Deep Dive on Amazon AuroraSRV308 Deep Dive on Amazon Aurora
SRV308 Deep Dive on Amazon Aurora
 
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS
있는 그대로 저장하고, 바로 분석 가능한, 새로운 관점의 데이터 애널리틱 플랫폼 - 정세웅 애널리틱 스페셜리스트, AWS
 
Data Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & AthenaData Catalog & ETL - Glue & Athena
Data Catalog & ETL - Glue & Athena
 
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
Building Advanced Workflows with AWS Glue (ANT372) - AWS re:Invent 2018
 
Deep Dive: AWS CloudFormation
Deep Dive: AWS CloudFormationDeep Dive: AWS CloudFormation
Deep Dive: AWS CloudFormation
 
Deep Dive on Amazon Aurora
Deep Dive on Amazon AuroraDeep Dive on Amazon Aurora
Deep Dive on Amazon Aurora
 
Amazon Aurora: Under the Hood
Amazon Aurora: Under the HoodAmazon Aurora: Under the Hood
Amazon Aurora: Under the Hood
 
Deploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control TowerDeploy and Govern at Scale with AWS Control Tower
Deploy and Govern at Scale with AWS Control Tower
 
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
Amazon Aurora Storage Demystified: How It All Works (DAT363) - AWS re:Invent ...
 

Similar a Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018

Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018
Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018
Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018Amazon Web Services
 
Amazon Aurora - Rajeev Chakrabarti
Amazon Aurora - Rajeev ChakrabartiAmazon Aurora - Rajeev Chakrabarti
Amazon Aurora - Rajeev ChakrabartiAmazon Web Services
 
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...Amazon Web Services
 
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS Summit
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS SummitAmazon Aurora: Deep Dive - SRV308 - Chicago AWS Summit
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS SummitAmazon Web Services
 
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksIntroducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksAmazon Web Services
 
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...Amazon Web Services
 
Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Amazon Web Services
 
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Web Services
 
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Amazon Web Services
 
What's new in Amazon Aurora - ADB203 - Chicago AWS Summit
What's new in Amazon Aurora - ADB203 - Chicago AWS SummitWhat's new in Amazon Aurora - ADB203 - Chicago AWS Summit
What's new in Amazon Aurora - ADB203 - Chicago AWS SummitAmazon Web Services
 
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...Amazon Web Services
 
Loading Data into Redshift with Lab
Loading Data into Redshift with LabLoading Data into Redshift with Lab
Loading Data into Redshift with LabAmazon Web Services
 
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...Amazon Web Services
 

Similar a Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018 (20)

Amazon Aurora: Database Week SF
Amazon Aurora: Database Week SFAmazon Aurora: Database Week SF
Amazon Aurora: Database Week SF
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Amazon Aurora_Deep Dive
Amazon Aurora_Deep DiveAmazon Aurora_Deep Dive
Amazon Aurora_Deep Dive
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Managed Relational Databases
Managed Relational DatabasesManaged Relational Databases
Managed Relational Databases
 
Amazon Aurora
Amazon AuroraAmazon Aurora
Amazon Aurora
 
Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018
Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018
Build on Amazon Aurora with MySQL Compatibility (DAT348-R4) - AWS re:Invent 2018
 
Amazon Aurora - Rajeev Chakrabarti
Amazon Aurora - Rajeev ChakrabartiAmazon Aurora - Rajeev Chakrabarti
Amazon Aurora - Rajeev Chakrabarti
 
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...
How to Build Multi-Region Applications in the Cloud: AWS Developer Workshop -...
 
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS Summit
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS SummitAmazon Aurora: Deep Dive - SRV308 - Chicago AWS Summit
Amazon Aurora: Deep Dive - SRV308 - Chicago AWS Summit
 
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech TalksIntroducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
Introducing Amazon Aurora with PostgreSQL Compatibility - AWS Online Tech Talks
 
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...
Accelerate Database Development and Testing with Amazon Aurora (DAT313) - AWS...
 
Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]Databases - EBC on the road Brazil Edition [Portuguese]
Databases - EBC on the road Brazil Edition [Portuguese]
 
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
 
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
Connecting the dots - How Amazon Neptune and Graph Databases can transform yo...
 
What's new in Amazon Aurora - ADB203 - Chicago AWS Summit
What's new in Amazon Aurora - ADB203 - Chicago AWS SummitWhat's new in Amazon Aurora - ADB203 - Chicago AWS Summit
What's new in Amazon Aurora - ADB203 - Chicago AWS Summit
 
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...
How Fannie Mae Processes over a Quarter Million Loans per Day with Amazon S3 ...
 
Loading Data into Redshift with Lab
Loading Data into Redshift with LabLoading Data into Redshift with Lab
Loading Data into Redshift with Lab
 
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...
Achieving Global Consistency Using AWS CloudFormation StackSets - AWS Online ...
 
Loading Data into Redshift
Loading Data into RedshiftLoading Data into Redshift
Loading Data into Redshift
 

Más de Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

Más de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Deep Dive on Amazon Neptune (DAT403) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune Deep Dive Brad Bebee Principal Product Manager Amazon Web Services D A T 4 0 3 Bruce McGaughy Sr. Manager, Software Development Amazon Web Services
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda Building applications on highly connected data Different types of graph models Amazon Neptune overview Delivering high availability and enterprise features Getting started
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Related breakouts Wednesday, November 28 DAT360 - Neptune Performance Tuning: Get the Best out of Amazon Neptune 5:30 – 6:30PM | Mirage, Grand Ballroom D, Table 3 Wednesday, November 28 DAT359 - Getting Started with Amazon Neptune and Amazon SageMaker Jupyter Notebooks 2:30PM – 3:30PM | Aria West, Level 3, Starvine 10, Table 7 Wednesday, November 28 SRV307-R1 Building Serverless Applications Using AWS AppSync and Amazon Neptune 3:15PM – 5:30PM | MGM, Level 1, Grand Ballroom 120
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Relationships enable new applications Retail fraud detectionRestaurant recommendationsSocial networks
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Use cases for highly connected data Social networking Life Sciences Network & IT operationsFraud detection Recommendations Knowledge graphs
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Dave Whom might I know? What product should I buy? Bill Bob Alice Dave Sara Bill Bob Alice Dave
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Understanding who, what, when, and where… What museums should Alice visit while in Paris? Who painted the Mona Lisa? What artists have paintings in The Louvre?
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Navigate a web of global tax policies “Our customers are increasingly required to navigate a complex web of global tax policies and regulations. We need an approach to model the sophisticated corporate structures of our largest clients and deliver an end-to-end tax solution. We use a microservices architecture approach for our platforms and are beginning to leverage Amazon Neptune as a graph-based system to quickly create links within the data.” said Tim Vanderham, chief technology officer, Thomson Reuters Tax & Accounting
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Challenges Building Apps with Highly Connected DataThe challenges of building apps with highly connected data using a relational database Unnatural for querying graph Inefficient graph processing Rigid schema inflexible for changing data
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Different approaches for highly connected data Purpose-built for a business process Purpose-built to answer questions about relationships
  • 13. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. A graph database is optimized for efficient storage and retrieval of highly connected data
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Leading graph models and frameworks Open Source Apache TinkerPop™ Gremlin Traversal Language W3C Standard SPARQL Query Language RESOURCE DESCRIPTION FRAMEWORK (RDF)PROPERTY GRAPH
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. A highly connected university example
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Find all of the graduate students who received an undergraduate degree from the same university Undergraduate Degree From name: ? name: ? University Graduate Student name: ? Department Member Of subOrganizationOf
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Challenges of existing graph databases Difficult to maintain high availability Difficult to scale Limited support for open standards Too expensive
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune Fully managed graph database FAST RELIABLE OPEN Query billions of relationships with millisecond latency 6 replicas of your data across 3 AZs with full backup and restore Build powerful queries easily with Gremlin and SPARQL Supports Apache TinkerPop & W3C RDF graph models EASY
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Neptune high level architecture Bulk load from S3 Database Mgmt.
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Fully managed service Easily configurable via the console Multi-AZ high availability Support for up to 15 read replicas Supports encryption at rest Supports encryption in transit (TLS) Backup and restore, point-in-time recovery B E N E F I T S
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Security • Network isolation via Virtual Private Cloud • Use security groups to control ingress • HTTPS encrypted client connections using TLS 1.2 • Encryption at rest using AWS Key Management Service (KMS) • AWS Identity and Access Management (IAM) Policies to secure creation of Neptune resources • IAM-based Authentication for Access control • Each request is signed with AWS Signature Version 4 • Libraries provided for Gremlin and SPARQL clients
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Neptune GA customers
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Neptune general availability • Announced on 5/30/2018 • Regions • US East (No. Virginia), US East (Ohio), US West (Oregon), EU (Ireland), EU (London), EU (Frankfurt) • https://aws.amazon.com/about- aws/whats-new/2018/05/amazon- neptune-is-now-generally-available/
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Neptune: Distributed storage architecture  Performance, availability, durability  Scale-out replica architecture  Shared storage volume with 10 GB segments striped across hundreds of nodes  Data is replicated 6 times across 3 AZs  Hotspot rebalance, Fast database recovery  Log applicator embedded in storage layer Master Replica Replica Replica Primary Shared storage volume Replica Replica Gremlin / Sparql Transactions Caching Gremlin / Sparql Transactions Caching Gremlin / Sparql Transactions Caching Delivered as a managed service AZ1 AZ2 AZ3  Ship only the log  Less work on engine  Minimizes network traffic
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Six copies across three availability zones 4 out 6 write quorum; 3 out of 6 read quorum Many failures possible: Disk (segment loss), Node, AZ network, AZ power, etc.. Continuous monitoring for failures Automatic repair by peer-to-peer gossiping and replication Gremlin / Sparql Transaction AZ 1 AZ 2 AZ 3 Caching Gremlin / Sparql Transaction AZ 1 AZ 2 AZ 3 Caching Read and write availabilityRead availability 6-way replicated storage to survive “AZ+1” failure
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Why are 6 copies necessary?  You need replication across 3 AZs to tolerate an AZ failure.  Why not just 1 copy per AZ?  An AZ + 1 node failure would break the quorum  Also important for performance  Hides long tail network latencies  Only 3/6 needed to ack reads  Only 4/6 needed to ack writes AZ 1 AZ 2 AZ 3 Quorum break on AZ failure 2/3 read 2/3 write AZ 1 AZ 2 AZ 3 Quorum survives AZ failure 3/6 read 4/6 write
  • 28. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Continuous backup Segment snapshot Log records Recovery point Segment 1 Segment 2 Segment 3 Time • Neptune takes periodic snapshots of each segment in parallel • Continuously streams the redo logs to Amazon Simple Storage Service (Amazon S3) • Backup happens continuously without performance or availability impact • At restore, retrieve the appropriate segment snapshots and log streams to storage nodes • Apply log streams to segment snapshots in parallel and asynchronously
  • 29. Traditional Database Have to replay logs since the last checkpoint Typically 5 minutes between checkpoints Often single threaded Amazon Neptune Normal reads also replay the logs in the storage layer Parallel, distributed, asynchronous No replay for startup Checkpointed Data Redo Log Crash at T0 requires a re-application of the redo log since last checkpoint T0 T0 Crash at T0 will result in redo logs being applied to each segment on demand, in parallel, asynchronously Instant crash recovery
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Database backtrack Backtrack brings the database to a point in time without requiring restore from backups • Backtracking from an unintentional insert or delete • Backtrack is not destructive. You can backtrack multiple times to find the right point in time. t0 t1 t2 t0 t1 t2 t3 t4 t3 t4 Rewind to t1 Rewind to t3 Invisible Invisible
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Simplified storage management  Automatic storage scaling up to 64 TB—no performance impact  Instantly create user snapshots—no performance impact Up to 64TB of storage – auto-incremented in 10GB units up to 64 TB
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Neptune read replicas PAGE CACHE UPDATE Neptune Primary 30% Read 70% Write Neptune Replica 100% New Reads Shared Multi-AZ Storage Amazon Neptune read scaling Performance • Applications can scale out read traffic across up to 15 read replicas Low Replica Lag • Typically < 10ms • Master ships redo logs to replica • Cached pages have redo applied • Un-cached pages from shared storage Availability • Failing database nodes are automatically detected and replaced • If primary fails, a replica replaces it (typically < 60s failover time) • Primary upgrade by forced failover
  • 33. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Monitoring AWS CloudTrail • Log all Neptune API calls to S3 bucket Event Notifications • Create Amazon Simple Notification Service (Amazon SNS) subscription via AWS Command Line Interface (AWS CLI) or AWS SDK Amazon CloudWatch CPUUtilization GremlinRequestsPerSec Http429 SparqlErrors ClusterReplicaLag Http100 Http500 SparqlRequests ClusterReplicaLagMaximum Http101 Http501 SparqlRequestsPerSec ClusterReplicaLagMinimum Http200 LoaderErrors StatusErrors EngineUptime Http400 LoaderRequests StatusRequests FreeableMemory Http403 NetworkReceiveThroughput VolumeBytesUsed GremlinErrors Http405 NetworkThroughput VolumeReadIOPs GremlinRequests Http413 NetworkTransmitThroughput VolumeWriteIOPs
  • 34. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 35. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. View our blog posts on using Neptune and Jupyter Notebooks https://aws.amazon.com/blogs /database/analyze-amazon- neptune-graphs-using-amazon- sagemaker-jupyter-notebooks/
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Launch Amazon Neptune via AWS CloudFormation https://docs.aws.amazon.com/neptune/latest/userguide/quickstart.html
  • 37. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Check out Amazon Neptune samples on Github https://github.com/aws-samples/amazon-neptune-samples
  • 38. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Check out Amazon Neptune tools on Github https://github.com/awslabs/amazon-neptune-tools
  • 39. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 40. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Brad Bebee beebs@amazon.com Bruce McGaughy mcgaughy@amazon.com
  • 41. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.