SlideShare una empresa de Scribd logo
1 de 37
Descargar para leer sin conexión
Highly available, scalable and secure data with
Cassandra and DataStax Enterprise
GOTO Berlin
27th February 2014
DataStax
•  Founded in April 2010
•  We drive Apache Cassandra™
•  400+ customers (24 of the Fortune 100)
•  220+ employees
•  Contribute approximately 80% of the code to Cassandra
•  Home to Apache Cassandra Chair & most committers
•  Headquartered in San Francisco Bay area
•  European headquarters established in London
Our Goal
To be the first and best database choice for online applications
©2014 DataStax Confidential. Do not distribute without consent. 2
Training
•  Checkout the DataStax academy for free online virtual training!
•  http://www.datastax.com/virtual-training
•  Public courses
•  http://www.datastax.com/course-catalog
•  On-site training
http://www.datastax.com/training
©2014 DataStax Confidential. Do not distribute without consent. 3
DataStax Enterprise for start-ups
•  DataStax gives qualifying start-ups access to DataStax Enterprise for free!
•  For more information:
•  http://www.datastax.com/startup
©2014 DataStax Confidential. Do not distribute without consent. 4
DataStax
•  DataStax supports both the open source community and enterprises.
©2014 DataStax Confidential. Do not distribute without consent. 5
Open Source/Community Enterprise Software
•  Apache Cassandra (employ
Cassandra chair and 80+% of
the committers)
•  DataStax Community Edition
•  DataStax OpsCenter
•  DataStax DevCenter
•  DataStax Drivers/Connectors
•  Online Documentation
•  Online Training
•  Mailing lists and forums
•  DataStax Enterprise Edition
•  Certified Cassandra
•  Built-in Analytics
•  Built-in Enterprise Search
•  Enterprise Security
•  DataStax OpsCenter
•  Expert Support
•  Consultative Help
•  Professional Training
DataStax Enterprise
•  DataStax Enterprise: LOB* applications, with analytics and search for online/real-time
application data.
©2014 DataStax Confidential. Do not distribute without consent. 6
LOB
App
NoSQL
LOB
App
NoSQL
LOB
App
NoSQL
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
C*
Transactions:
•  LOB Style
•  Tunable
consistency
Analytics:
•  MapReduce
•  Hive
•  Pig
•  Mahout
Search
•  Solr
Data Warehouse
Hadoop
Transactions:
•  None
Analytics:
•  MapReduce
•  Hive
•  Pig
•  Mahout
Search
•  Solr
*Line of business
•  Hadoop: data warehouse applications with analytics and search for data warehouse.
Availability and Speed Matters for online apps!
•  UK retailers lost 8.5 billion last year to slow web sites, which is 1 million
for every 10 million in online sales
•  Over half of all web users expect a response time of 2 seconds or less
•  A 1 second delay causes a nearly 10% reduction in customer interactions
•  A 1 second decrease in Amazon page load time costs the company $1.6
billion in sales
©2014 DataStax Confidential. Do not distribute without consent. 7
Apache Cassandra™
•  Apache Cassandra™ is a massively scalable, open source, NoSQL, distributed
database built for modern, mission-critical online applications.
•  Written in Java and is a hybrid of Amazon Dynamo and Google BigTable
•  Masterless with no single point of failure
•  Distributed and data centre aware
•  100% uptime
•  Predictable scaling
©2014 DataStax Confidential. Do not distribute without consent. 8
Dynamo
BigTable
BigTable: http://research.google.com/archive/bigtable-osdi06.pdf
Dynamo: http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf
CASSANDRA"
Cassandra – Core Values
•  Ease of use
•  Massive scalability
•  High performance
•  Always Available
©2014 DataStax Confidential. Do not distribute without consent. 9
Cassandra – Performance and Scale
©2014 DataStax Confidential. Do not distribute without consent. 10
“In terms of scalability, there is a clear winner throughout our
experiments. Cassandra achieves the highest throughput for the
maximum number of nodes in all experiments with a linear increasing
throughput.”
Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et
al., August 2012. Benchmark paper presented at the Very Large Database Conference, 2012.
http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2012.pdf
End Point Independent NoSQL Benchmark
Lowest in latency…
http://techblog.netflix.com/2011/11/benchmarking-
cassandra-scalability-on.html
http://www.datastax.com/wp-content/uploads/2013/02/WP-
Benchmarking-Top-NoSQL-Databases.pdf
Netflix Cloud Benchmark…
Highest in throughput…
Cassandra - Performance and Scale
•  Cassandra works for small to huge deployments.
©2014 DataStax Confidential. Do not distribute without consent. 11
•  Cassandra Footprint @ Netflix
•  80+ Clusters
•  2500+ nodes
•  4 Data Centres (Amazon Regions)
•  > 1 Trillion transactions per day
http://planetcassandra.org/functional-use-cases/
Cassandra –Overview
•  Cassandra was designed with the understanding that system/hardware failures can and do
occur
•  Peer-to-peer, distributed system
•  All nodes the same
•  Data partitioned among all nodes in the cluster
•  Custom data replication to ensure fault tolerance
•  Read/Write-anywhere and across data centres
©2014 DataStax Confidential. Do not distribute without consent. 12
Node 1
Node 2
Node 3Node 4
Node 5
Cassandra – More Than One Server
•  All nodes participate in a cluster
•  Add or remove as needed
•  All nodes the same – masterless with no single point of failure
•  Each node communicates with each other through the Gossip
protocol, which exchanges information across the cluster every
second
•  Data partitioned among all nodes in the cluster
•  A commit log is used on each node to capture write activity.
Data durability is assured
•  Data also written to an in-memory structure (memtable) and
then to disk once the memory structure is full (an SStable)
•  More capacity? Add a server!
•  More throughput? Add a server!
©2014 DataStax Confidential. Do not distribute without consent. 13
Node 1
Node 2
Node 3Node 4
Node 5
•  Replication factor (RF): How many copies of your
data?
•  RF = 3 in this example
•  Each node is storing 60% of the clusters total data
i.e. 3/5
Handy Calculator: http://www.ecyrd.com/cassandracalculator/
Cassandra – Locally Distributed
©2014 DataStax Confidential. Do not distribute without consent. 14
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
•  Client reads or writes to any node
•  Node coordinates with others
•  Data read or replicated in parallel
Cassandra – Rack Aware
•  Cassandra is aware of which rack (or
availability zone) each node resides in.
•  It will attempt to place each data copy in a
different rack.
•  RF = 3 in this example
©2014 DataStax Confidential. Do not distribute without consent. 15
Node 1
1st copy
Node 4
Node 2
Node 3
2nd copy
Rack 1
Rack 2Rack 2
Rack 3
Rack 1
Node 5
3rd copy
Cassandra – Data Centre Aware
©2014 DataStax Confidential. Do not distribute without consent. 16
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
DC: EUROPEDC: USA•  Active Everywhere – reads/writes in
multiple data centres
•  Client writes local
•  Data syncs across WAN
•  Replication Factor per DC
•  Different number of nodes per
data centre
Cassandra – Tunable Consistency
•  Consistency Level (CL)
•  Client specifies per read or write
•  Handles multi-data center operations
•  ALL = All replicas ack
•  QUORUM = > 51% of replicas ack
•  LOCAL_QUORUM = > 51% in local DC ack
•  ONE = Only one replica acks
•  Plus more…. (see docs)
•  Blog: Eventual Consistency != Hopeful Consistency
http://planetcassandra.org/blog/post/a-netflix-experiment-eventual-consistency-
hopeful-consistency-by-christos-kalantzis/
©2014 DataStax Confidential. Do not distribute without consent. 17
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Parallel
Write
Write
CL=QUORUM
5 μs ack
12 μs ack
500 μs ack
12 μs ack
Node Failure
•  A single node failure shouldn’t bring failure.
•  Replication Factor + Consistency Level = Success
•  This example:
•  RF = 3
•  CL = QUORUM
©2014 DataStax Confidential. Do not distribute without consent. 18
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Parallel
Write
Write
CL=QUORUM
5 μs ack
12 μs ack
12 μs ack
>51% ack – so request is a success
Node Recovery
•  When a write is performed and a replica node for the row is
unavailable the coordinator will store a hint locally.
•  When the node recovers, the coordinator replays the missed
writes.
•  Note: a hinted write does not count towards the consistency
level
•  Note: you should still run repairs across your cluster
http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/
dml_about_hh_c.html
©2014 DataStax Confidential. Do not distribute without consent. 19
Node 1
1st copy
Node 4
Node 5
Node 2
2nd copy
Node 3
3rd copy
Stores Hints while Node 3 is offline
Rack Failure
•  Cassandra will place the data in as many different
racks or availability zones as it can.
•  This example:
•  RF = 3
•  CL = QUORUM
•  Rack 2 fails
•  Data copies still available in Node 1 and Node 5
•  Quorum can be honored i.e. > 51% ack
©2014 DataStax Confidential. Do not distribute without consent. 20
Node 1
1st copy
Node 4
Node 2
Node 3
2nd copy
Rack 1
Rack 2Rack 2
Rack 3
Rack 1
Node 5
3rd copy
request is a success
Don’t be afraid of Weak Consistency
•  More tolerant to failure
•  Consistency Level of 1 is the most popular (I think)
•  If you want stronger consistency go for LOCAL_QUORUM i.e. quorum is
honored in the local data centre.
•  If you go stronger than LOCAL_QUORUM – understand what this means
and why you are doing it.
•  Remember – you can have different consistency levels for reads and
writes e.g. write with CL:1, read with CL:LOCAL_QUORUM
©2014 DataStax Confidential. Do not distribute without consent. 21
Example Application
©2014 DataStax Confidential. Do not distribute without consent. 22
Cassandra
Replication
Cassandra
Replication
•  Active-Active-Active
•  Service based DNS routing
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Example Application - Uptime
©2014 DataStax Confidential. Do not distribute without consent. 23
Cassandra
Replication
Cassandra
Replication
•  Normal service maintenance
•  Application is unaware
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Example Application – DC Failure
©2014 DataStax Confidential. Do not distribute without consent. 24
Cassandra
Replication
Cassandra
Replication
•  Data is safe.
•  Route Traffic
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Tier Failure
©2014 DataStax Confidential. Do not distribute without consent. 25
Cassandra
Replication
•  App Tier is aware of the other DC
•  Switches to access remote DC automatically
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
WAN Failure
©2014 DataStax Confidential. Do not distribute without consent. 26
Cassandra
Replication
Cassandra
Replication
DC: USA DC: Europe DC: Asia
Web Tier Web Tier Web Tier
Web Tier
App TierApp TierApp Tier
Consistency level?
Cassandra Clients - Native Driver
•  Clients that use the native driver also have access to various policies that enable the client to
intelligently route requests as required.
•  This includes:
•  Load Balancing
•  Data Centre Aware
•  Latency Aware
•  Token Aware
•  Reconnection policies
•  Retry policies
•  Downgrading Consistency
•  Plus others..
•  http://www.datastax.com/download/clientdrivers
©2014 DataStax Confidential. Do not distribute without consent. 27
Quotes
•  “Cassandra, our distributed cloud persistence store which is
distributed across all zones and regions, dealt with the loss of one
third of its regional nodes without any loss of data or availability”.
http://techblog.netflix.com/2012/07/lessons-netflix-learned-from-aws-storm.html
•  “During Hurricane Sandy, we lost an entire data center. Completely.
Lost. It. Our application fail-over resulted in us losing just a few
moments of serving requests for a particular region of the country, but
our data in Cassandra never went offline.”
http://planetcassandra.org/blog/post/outbrain-touches-over-80-of-all-us-online-users-with-help-from-cassandra/
©2014 DataStax Confidential. Do not distribute without consent. 28
Ring-fenced resources
•  If you need to isolate resources for different uses, Cassandra is a great fit.
•  You can create separate virtual data centres optimised as required – different workloads,
hardware, availability etc..
•  Cassandra will replicate the data for you – no ETL is necessary
©2014 DataStax Confidential. Do not distribute without consent. 29
Cassandra
Replication
Customer
Facing
Analytics
Hybrid Cloud
•  DataStax Enterprise and Cassandra are multi-data centre and cloud capable
•  Data written to any node is automatically and transparently written to all other nodes in
multiple data centres i.e. no etl
©2014 DataStax Confidential. Do not distribute without consent. 30
Data Centre 1 Data Centre2
Public Cloud
Security in Cassandra
FEATURESBENEFITS
Internal Authentication Manages
login IDs and passwords inside
the database
+  Ensures only authorized users
can access a database system
using internal validation
+  Simple to implement and easy
to understand
+  No learning curve from the
relational world
Object Permission Management
controls who has access to what
and who can do what in the
database
+  Provides granular based control
over who can add/change/
delete/read data
+  Uses familiar GRANT/REVOKE
from relational systems
+  No learning curve
Client to Node Encryption
protects data in flight to and from
a database cluster
+  Ensures data cannot be
captured/stolen in route to a
server
+  Data is safe both in flight from/
to a database and on the
database; complete coverage is
ensured
Advanced Security in DataStax Enterprise
FEATURESBENEFITS
External Authentication uses
external security software
packages to control security
+  Only authorized users have
access to a database system
using external validation
+  Uses most trusted external
security packages (Kerberos),
mainstays in government and
finance
+  Single sign on to all data
domains
Transparent Data Encryption
encrypts data at rest
+  Protects sensitive data at rest
from theft and from being read
at the file system level
+  No changes needed at
application level
Data Auditing
provides trail of who did and
looked at what/when
+  Supplies admins with an audit
trail of all accesses and
changes
+  Granular control to audit only
what’s needed
+  Uses log4j interface to ensure
performance and efficient audit
operations
Data Replication Security in Cassandra
•  A popular feature from a data security perspective is the ability to control at a keyspace/schema level which data centres
data should be replicated to.
•  What this means is that in a multi-data centre (both physical and virtual) cluster you can ensure that data is not shipped
anywhere it shouldn’t be and access to that data can be controlled.
•  This is very simple to set-up and is extremely useful when you need to share some of your data, but not all of you data or if
you have requirements around where your data is permitted to reside.
DC 1 DC 2
Shared Data
DataStax Enterprise 4.0, OpsCentre 4.1
•  DataStax Enterprise 4.0
•  New in-memory option. Brings all of the goodness of Cassandra to an in-memory
database
•  Production-certified version of Apache Cassandra (2.0)
•  Enterprise search enhancements
•  OpsCenter 4.1
•  Capacity planning updates
•  Better insight into node performance
•  More information:
http://www.datastax.com/wp-content/uploads/2014/02/WP-WhatsNewDSE40.pdf
http://www.datastax.com/download
©2014 DataStax Confidential. Do not distribute without consent. 34
How does in-memory work?
•  Developers can create new tables to be in-memory or alter existing tables
to be in-memory
•  Writes are durable
•  10-100x improvement
©2014 DataStax Confidential. Do not distribute without consent. 35
Find Out More
•  DataStax: http://www.datastax.com
•  Getting Started: http://www.datastax.com/documentation/gettingstarted/index.html
•  Training: http://www.datatstax.com/training
•  Downloads: http://www.datastax.com/download
•  Documentation: http://www.datastax.com/docs
•  Developer Blog: http://www.datastax.com/dev/blog
•  Community Site: http://planetcassandra.org
•  Webinars: http://planetcassandra.org/Learn/CassandraCommunityWebinars
•  Summit Talks: http://planetcassandra.org/Learn/CassandraSummit
©2014 DataStax Confidential. Do not distribute without consent. 36
Thank You
We power the big data
apps that transform business.
©2014 DataStax Confidential. Do not distribute without consent. 37

Más contenido relacionado

La actualidad más candente

DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...
DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...
DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...DataStax
 
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...DataStax
 
Multi-Region Cassandra Clusters
Multi-Region Cassandra ClustersMulti-Region Cassandra Clusters
Multi-Region Cassandra ClustersInstaclustr
 
Hindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to CassandraHindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to CassandraMichael Kjellman
 
Webinar - VPS New Range
Webinar - VPS New RangeWebinar - VPS New Range
Webinar - VPS New RangeOVHcloud
 
Introduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developersIntroduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developersJulien Anguenot
 
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...DataStax
 
Cassandra and security
Cassandra and securityCassandra and security
Cassandra and securityBen Bromhead
 
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...DataStax Academy
 
Cassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
Cassandra Day SV 2014: Designing Commodity Storage in Apache CassandraCassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
Cassandra Day SV 2014: Designing Commodity Storage in Apache CassandraDataStax Academy
 
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...DataStax
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraDataStax Academy
 
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...DataStax
 
Run Cloud Native MySQL NDB Cluster in Kubernetes
Run Cloud Native MySQL NDB Cluster in KubernetesRun Cloud Native MySQL NDB Cluster in Kubernetes
Run Cloud Native MySQL NDB Cluster in KubernetesBernd Ocklin
 
Client Drivers and Cassandra, the Right Way
Client Drivers and Cassandra, the Right WayClient Drivers and Cassandra, the Right Way
Client Drivers and Cassandra, the Right WayDataStax Academy
 
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...DataStax
 

La actualidad más candente (20)

DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...
DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...
DataStax | Best Practices for Securing DataStax Enterprise (Matt Kennedy) | C...
 
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
What We Learned About Cassandra While Building go90 (Christopher Webster & Th...
 
Multi-Region Cassandra Clusters
Multi-Region Cassandra ClustersMulti-Region Cassandra Clusters
Multi-Region Cassandra Clusters
 
Hindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to CassandraHindsight is 20/20: MySQL to Cassandra
Hindsight is 20/20: MySQL to Cassandra
 
BigData Developers MeetUp
BigData Developers MeetUpBigData Developers MeetUp
BigData Developers MeetUp
 
Advanced Operations
Advanced OperationsAdvanced Operations
Advanced Operations
 
Webinar - VPS New Range
Webinar - VPS New RangeWebinar - VPS New Range
Webinar - VPS New Range
 
Introduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developersIntroduction to Cassandra and CQL for Java developers
Introduction to Cassandra and CQL for Java developers
 
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
 
Cassandra and security
Cassandra and securityCassandra and security
Cassandra and security
 
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
San Francisco Cassadnra Meetup - March 2014: I/O Performance tuning on AWS fo...
 
Cassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
Cassandra Day SV 2014: Designing Commodity Storage in Apache CassandraCassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
Cassandra Day SV 2014: Designing Commodity Storage in Apache Cassandra
 
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
Leveraging Docker and CoreOS to provide always available Cassandra at Instacl...
 
Mesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run CassandraMesosphere and Contentteam: A New Way to Run Cassandra
Mesosphere and Contentteam: A New Way to Run Cassandra
 
Storage for VDI
Storage for VDIStorage for VDI
Storage for VDI
 
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
Clock Skew and Other Annoying Realities in Distributed Systems (Donny Nadolny...
 
Run Cloud Native MySQL NDB Cluster in Kubernetes
Run Cloud Native MySQL NDB Cluster in KubernetesRun Cloud Native MySQL NDB Cluster in Kubernetes
Run Cloud Native MySQL NDB Cluster in Kubernetes
 
Client Drivers and Cassandra, the Right Way
Client Drivers and Cassandra, the Right WayClient Drivers and Cassandra, the Right Way
Client Drivers and Cassandra, the Right Way
 
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
The Promise and Perils of Encrypting Cassandra Data (Ameesh Divatia, Baffle, ...
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Destacado

How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)DataStax Academy
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraDataStax
 
An Overview of Apache Cassandra
An Overview of Apache CassandraAn Overview of Apache Cassandra
An Overview of Apache CassandraDataStax
 
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...DataStax
 
CrowdStrike: Real World DTCS For Operators
CrowdStrike: Real World DTCS For OperatorsCrowdStrike: Real World DTCS For Operators
CrowdStrike: Real World DTCS For OperatorsDataStax Academy
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...DataStax Academy
 
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...COIICV
 
NGCC 2016 - Support large partitions
NGCC 2016 - Support large partitionsNGCC 2016 - Support large partitions
NGCC 2016 - Support large partitionsRobert Stupp
 
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...DataStax Academy
 
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...DataStax
 
Cassandra: Two data centers and great performance
Cassandra: Two data centers and great performanceCassandra: Two data centers and great performance
Cassandra: Two data centers and great performanceDATAVERSITY
 
3800 die-bonder overview
3800 die-bonder overview3800 die-bonder overview
3800 die-bonder overviewfastbr
 
Cassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsCassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsJeff Jirsa
 
Securing Cassandra
Securing CassandraSecuring Cassandra
Securing CassandraInstaclustr
 
Cassandra multi-datacenter operations essentials
Cassandra multi-datacenter operations essentialsCassandra multi-datacenter operations essentials
Cassandra multi-datacenter operations essentialsJulien Anguenot
 
Rethinking Topology In Cassandra (ApacheCon NA)
Rethinking Topology In Cassandra (ApacheCon NA)Rethinking Topology In Cassandra (ApacheCon NA)
Rethinking Topology In Cassandra (ApacheCon NA)Eric Evans
 
Time Series Data with Apache Cassandra
Time Series Data with Apache CassandraTime Series Data with Apache Cassandra
Time Series Data with Apache CassandraEric Evans
 
Cassandra architecture
Cassandra architectureCassandra architecture
Cassandra architectureT Jake Luciani
 
Time Series Data with Apache Cassandra
Time Series Data with Apache CassandraTime Series Data with Apache Cassandra
Time Series Data with Apache CassandraEric Evans
 

Destacado (20)

How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)How to size up an Apache Cassandra cluster (Training)
How to size up an Apache Cassandra cluster (Training)
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache Cassandra
 
An Overview of Apache Cassandra
An Overview of Apache CassandraAn Overview of Apache Cassandra
An Overview of Apache Cassandra
 
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
Spectator to Participant. Contributing to Cassandra (Patrick McFadin, DataSta...
 
CrowdStrike: Real World DTCS For Operators
CrowdStrike: Real World DTCS For OperatorsCrowdStrike: Real World DTCS For Operators
CrowdStrike: Real World DTCS For Operators
 
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
iland Internet Solutions: Leveraging Cassandra for real-time multi-datacenter...
 
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
Carlos Santa María - Hiperconvergencia, el futuro del Data Center - semanainf...
 
NGCC 2016 - Support large partitions
NGCC 2016 - Support large partitionsNGCC 2016 - Support large partitions
NGCC 2016 - Support large partitions
 
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
Cassandra Summit 2014: Novel Multi-Region Clusters — Cassandra Deployments Sp...
 
Cassandra at scale
Cassandra at scaleCassandra at scale
Cassandra at scale
 
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
Apache Cassandra Multi-Datacenter Essentials (Julien Anguenot, iLand Internet...
 
Cassandra: Two data centers and great performance
Cassandra: Two data centers and great performanceCassandra: Two data centers and great performance
Cassandra: Two data centers and great performance
 
3800 die-bonder overview
3800 die-bonder overview3800 die-bonder overview
3800 die-bonder overview
 
Cassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For OperatorsCassandra Summit 2015: Real World DTCS For Operators
Cassandra Summit 2015: Real World DTCS For Operators
 
Securing Cassandra
Securing CassandraSecuring Cassandra
Securing Cassandra
 
Cassandra multi-datacenter operations essentials
Cassandra multi-datacenter operations essentialsCassandra multi-datacenter operations essentials
Cassandra multi-datacenter operations essentials
 
Rethinking Topology In Cassandra (ApacheCon NA)
Rethinking Topology In Cassandra (ApacheCon NA)Rethinking Topology In Cassandra (ApacheCon NA)
Rethinking Topology In Cassandra (ApacheCon NA)
 
Time Series Data with Apache Cassandra
Time Series Data with Apache CassandraTime Series Data with Apache Cassandra
Time Series Data with Apache Cassandra
 
Cassandra architecture
Cassandra architectureCassandra architecture
Cassandra architecture
 
Time Series Data with Apache Cassandra
Time Series Data with Apache CassandraTime Series Data with Apache Cassandra
Time Series Data with Apache Cassandra
 

Similar a Highly available, scalable and secure data with Cassandra and DataStax Enterprise

Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideMohammed Fazuluddin
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Johnny Miller
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Apache Cassandra and The Multi-Cloud by Amanda Moran
Apache Cassandra and The Multi-Cloud by Amanda MoranApache Cassandra and The Multi-Cloud by Amanda Moran
Apache Cassandra and The Multi-Cloud by Amanda MoranData Con LA
 
Using Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Using Apache Cassandra and Apache Kafka to Scale Next Gen ApplicationsUsing Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Using Apache Cassandra and Apache Kafka to Scale Next Gen ApplicationsData Con LA
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real WorldJeremy Hanna
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real WorldJeremy Hanna
 
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...Data Con LA
 
Spark Streaming & Kafka-The Future of Stream Processing
Spark Streaming & Kafka-The Future of Stream ProcessingSpark Streaming & Kafka-The Future of Stream Processing
Spark Streaming & Kafka-The Future of Stream ProcessingJack Gudenkauf
 
The MySQL High Availability Landscape and where Galera Cluster fits in
The MySQL High Availability Landscape and where Galera Cluster fits inThe MySQL High Availability Landscape and where Galera Cluster fits in
The MySQL High Availability Landscape and where Galera Cluster fits inSakari Keskitalo
 
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...DataStax Academy
 
Hacking apache cloud stack
Hacking apache cloud stackHacking apache cloud stack
Hacking apache cloud stackNitin Mehta
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoBig Data Joe™ Rossi
 
Real Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark StreamingReal Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark StreamingHari Shreedharan
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaStreaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaHelena Edelson
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Johnny Miller
 
Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Anthony Baker
 
Aruman Cassandra database
Aruman Cassandra databaseAruman Cassandra database
Aruman Cassandra databaseUmesh Dande
 

Similar a Highly available, scalable and secure data with Cassandra and DataStax Enterprise (20)

Cassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction GuideCassandra - A Basic Introduction Guide
Cassandra - A Basic Introduction Guide
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Apache Cassandra and The Multi-Cloud by Amanda Moran
Apache Cassandra and The Multi-Cloud by Amanda MoranApache Cassandra and The Multi-Cloud by Amanda Moran
Apache Cassandra and The Multi-Cloud by Amanda Moran
 
Using Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Using Apache Cassandra and Apache Kafka to Scale Next Gen ApplicationsUsing Apache Cassandra and Apache Kafka to Scale Next Gen Applications
Using Apache Cassandra and Apache Kafka to Scale Next Gen Applications
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
Spark Streaming& Kafka-The Future of Stream Processing by Hari Shreedharan of...
 
Spark Streaming & Kafka-The Future of Stream Processing
Spark Streaming & Kafka-The Future of Stream ProcessingSpark Streaming & Kafka-The Future of Stream Processing
Spark Streaming & Kafka-The Future of Stream Processing
 
The MySQL High Availability Landscape and where Galera Cluster fits in
The MySQL High Availability Landscape and where Galera Cluster fits inThe MySQL High Availability Landscape and where Galera Cluster fits in
The MySQL High Availability Landscape and where Galera Cluster fits in
 
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
C* Summit 2013: Netflix Open Source Tools and Benchmarks for Cassandra by Adr...
 
Hacking apache cloud stack
Hacking apache cloud stackHacking apache cloud stack
Hacking apache cloud stack
 
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDiscoSD Big Data Monthly Meetup #4 - Session 2 - WANDisco
SD Big Data Monthly Meetup #4 - Session 2 - WANDisco
 
Real Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark StreamingReal Time Data Processing Using Spark Streaming
Real Time Data Processing Using Spark Streaming
 
Devops kc
Devops kcDevops kc
Devops kc
 
Apache kafka
Apache kafkaApache kafka
Apache kafka
 
Streaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and AkkaStreaming Analytics with Spark, Kafka, Cassandra and Akka
Streaming Analytics with Spark, Kafka, Cassandra and Akka
 
Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1Cassandra 2.0 to 2.1
Cassandra 2.0 to 2.1
 
Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)Introduction to Apache Geode (Cork, Ireland)
Introduction to Apache Geode (Cork, Ireland)
 
Aruman Cassandra database
Aruman Cassandra databaseAruman Cassandra database
Aruman Cassandra database
 

Último

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Highly available, scalable and secure data with Cassandra and DataStax Enterprise

  • 1. Highly available, scalable and secure data with Cassandra and DataStax Enterprise GOTO Berlin 27th February 2014
  • 2. DataStax •  Founded in April 2010 •  We drive Apache Cassandra™ •  400+ customers (24 of the Fortune 100) •  220+ employees •  Contribute approximately 80% of the code to Cassandra •  Home to Apache Cassandra Chair & most committers •  Headquartered in San Francisco Bay area •  European headquarters established in London Our Goal To be the first and best database choice for online applications ©2014 DataStax Confidential. Do not distribute without consent. 2
  • 3. Training •  Checkout the DataStax academy for free online virtual training! •  http://www.datastax.com/virtual-training •  Public courses •  http://www.datastax.com/course-catalog •  On-site training http://www.datastax.com/training ©2014 DataStax Confidential. Do not distribute without consent. 3
  • 4. DataStax Enterprise for start-ups •  DataStax gives qualifying start-ups access to DataStax Enterprise for free! •  For more information: •  http://www.datastax.com/startup ©2014 DataStax Confidential. Do not distribute without consent. 4
  • 5. DataStax •  DataStax supports both the open source community and enterprises. ©2014 DataStax Confidential. Do not distribute without consent. 5 Open Source/Community Enterprise Software •  Apache Cassandra (employ Cassandra chair and 80+% of the committers) •  DataStax Community Edition •  DataStax OpsCenter •  DataStax DevCenter •  DataStax Drivers/Connectors •  Online Documentation •  Online Training •  Mailing lists and forums •  DataStax Enterprise Edition •  Certified Cassandra •  Built-in Analytics •  Built-in Enterprise Search •  Enterprise Security •  DataStax OpsCenter •  Expert Support •  Consultative Help •  Professional Training
  • 6. DataStax Enterprise •  DataStax Enterprise: LOB* applications, with analytics and search for online/real-time application data. ©2014 DataStax Confidential. Do not distribute without consent. 6 LOB App NoSQL LOB App NoSQL LOB App NoSQL C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* C* Transactions: •  LOB Style •  Tunable consistency Analytics: •  MapReduce •  Hive •  Pig •  Mahout Search •  Solr Data Warehouse Hadoop Transactions: •  None Analytics: •  MapReduce •  Hive •  Pig •  Mahout Search •  Solr *Line of business •  Hadoop: data warehouse applications with analytics and search for data warehouse.
  • 7. Availability and Speed Matters for online apps! •  UK retailers lost 8.5 billion last year to slow web sites, which is 1 million for every 10 million in online sales •  Over half of all web users expect a response time of 2 seconds or less •  A 1 second delay causes a nearly 10% reduction in customer interactions •  A 1 second decrease in Amazon page load time costs the company $1.6 billion in sales ©2014 DataStax Confidential. Do not distribute without consent. 7
  • 8. Apache Cassandra™ •  Apache Cassandra™ is a massively scalable, open source, NoSQL, distributed database built for modern, mission-critical online applications. •  Written in Java and is a hybrid of Amazon Dynamo and Google BigTable •  Masterless with no single point of failure •  Distributed and data centre aware •  100% uptime •  Predictable scaling ©2014 DataStax Confidential. Do not distribute without consent. 8 Dynamo BigTable BigTable: http://research.google.com/archive/bigtable-osdi06.pdf Dynamo: http://www.allthingsdistributed.com/files/amazon-dynamo-sosp2007.pdf CASSANDRA"
  • 9. Cassandra – Core Values •  Ease of use •  Massive scalability •  High performance •  Always Available ©2014 DataStax Confidential. Do not distribute without consent. 9
  • 10. Cassandra – Performance and Scale ©2014 DataStax Confidential. Do not distribute without consent. 10 “In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.” Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2012. Benchmark paper presented at the Very Large Database Conference, 2012. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2012.pdf End Point Independent NoSQL Benchmark Lowest in latency… http://techblog.netflix.com/2011/11/benchmarking- cassandra-scalability-on.html http://www.datastax.com/wp-content/uploads/2013/02/WP- Benchmarking-Top-NoSQL-Databases.pdf Netflix Cloud Benchmark… Highest in throughput…
  • 11. Cassandra - Performance and Scale •  Cassandra works for small to huge deployments. ©2014 DataStax Confidential. Do not distribute without consent. 11 •  Cassandra Footprint @ Netflix •  80+ Clusters •  2500+ nodes •  4 Data Centres (Amazon Regions) •  > 1 Trillion transactions per day http://planetcassandra.org/functional-use-cases/
  • 12. Cassandra –Overview •  Cassandra was designed with the understanding that system/hardware failures can and do occur •  Peer-to-peer, distributed system •  All nodes the same •  Data partitioned among all nodes in the cluster •  Custom data replication to ensure fault tolerance •  Read/Write-anywhere and across data centres ©2014 DataStax Confidential. Do not distribute without consent. 12 Node 1 Node 2 Node 3Node 4 Node 5
  • 13. Cassandra – More Than One Server •  All nodes participate in a cluster •  Add or remove as needed •  All nodes the same – masterless with no single point of failure •  Each node communicates with each other through the Gossip protocol, which exchanges information across the cluster every second •  Data partitioned among all nodes in the cluster •  A commit log is used on each node to capture write activity. Data durability is assured •  Data also written to an in-memory structure (memtable) and then to disk once the memory structure is full (an SStable) •  More capacity? Add a server! •  More throughput? Add a server! ©2014 DataStax Confidential. Do not distribute without consent. 13 Node 1 Node 2 Node 3Node 4 Node 5
  • 14. •  Replication factor (RF): How many copies of your data? •  RF = 3 in this example •  Each node is storing 60% of the clusters total data i.e. 3/5 Handy Calculator: http://www.ecyrd.com/cassandracalculator/ Cassandra – Locally Distributed ©2014 DataStax Confidential. Do not distribute without consent. 14 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy •  Client reads or writes to any node •  Node coordinates with others •  Data read or replicated in parallel
  • 15. Cassandra – Rack Aware •  Cassandra is aware of which rack (or availability zone) each node resides in. •  It will attempt to place each data copy in a different rack. •  RF = 3 in this example ©2014 DataStax Confidential. Do not distribute without consent. 15 Node 1 1st copy Node 4 Node 2 Node 3 2nd copy Rack 1 Rack 2Rack 2 Rack 3 Rack 1 Node 5 3rd copy
  • 16. Cassandra – Data Centre Aware ©2014 DataStax Confidential. Do not distribute without consent. 16 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy DC: EUROPEDC: USA•  Active Everywhere – reads/writes in multiple data centres •  Client writes local •  Data syncs across WAN •  Replication Factor per DC •  Different number of nodes per data centre
  • 17. Cassandra – Tunable Consistency •  Consistency Level (CL) •  Client specifies per read or write •  Handles multi-data center operations •  ALL = All replicas ack •  QUORUM = > 51% of replicas ack •  LOCAL_QUORUM = > 51% in local DC ack •  ONE = Only one replica acks •  Plus more…. (see docs) •  Blog: Eventual Consistency != Hopeful Consistency http://planetcassandra.org/blog/post/a-netflix-experiment-eventual-consistency- hopeful-consistency-by-christos-kalantzis/ ©2014 DataStax Confidential. Do not distribute without consent. 17 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Parallel Write Write CL=QUORUM 5 μs ack 12 μs ack 500 μs ack 12 μs ack
  • 18. Node Failure •  A single node failure shouldn’t bring failure. •  Replication Factor + Consistency Level = Success •  This example: •  RF = 3 •  CL = QUORUM ©2014 DataStax Confidential. Do not distribute without consent. 18 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Parallel Write Write CL=QUORUM 5 μs ack 12 μs ack 12 μs ack >51% ack – so request is a success
  • 19. Node Recovery •  When a write is performed and a replica node for the row is unavailable the coordinator will store a hint locally. •  When the node recovers, the coordinator replays the missed writes. •  Note: a hinted write does not count towards the consistency level •  Note: you should still run repairs across your cluster http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/ dml_about_hh_c.html ©2014 DataStax Confidential. Do not distribute without consent. 19 Node 1 1st copy Node 4 Node 5 Node 2 2nd copy Node 3 3rd copy Stores Hints while Node 3 is offline
  • 20. Rack Failure •  Cassandra will place the data in as many different racks or availability zones as it can. •  This example: •  RF = 3 •  CL = QUORUM •  Rack 2 fails •  Data copies still available in Node 1 and Node 5 •  Quorum can be honored i.e. > 51% ack ©2014 DataStax Confidential. Do not distribute without consent. 20 Node 1 1st copy Node 4 Node 2 Node 3 2nd copy Rack 1 Rack 2Rack 2 Rack 3 Rack 1 Node 5 3rd copy request is a success
  • 21. Don’t be afraid of Weak Consistency •  More tolerant to failure •  Consistency Level of 1 is the most popular (I think) •  If you want stronger consistency go for LOCAL_QUORUM i.e. quorum is honored in the local data centre. •  If you go stronger than LOCAL_QUORUM – understand what this means and why you are doing it. •  Remember – you can have different consistency levels for reads and writes e.g. write with CL:1, read with CL:LOCAL_QUORUM ©2014 DataStax Confidential. Do not distribute without consent. 21
  • 22. Example Application ©2014 DataStax Confidential. Do not distribute without consent. 22 Cassandra Replication Cassandra Replication •  Active-Active-Active •  Service based DNS routing DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 23. Example Application - Uptime ©2014 DataStax Confidential. Do not distribute without consent. 23 Cassandra Replication Cassandra Replication •  Normal service maintenance •  Application is unaware DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 24. Example Application – DC Failure ©2014 DataStax Confidential. Do not distribute without consent. 24 Cassandra Replication Cassandra Replication •  Data is safe. •  Route Traffic DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 25. Tier Failure ©2014 DataStax Confidential. Do not distribute without consent. 25 Cassandra Replication •  App Tier is aware of the other DC •  Switches to access remote DC automatically DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier
  • 26. WAN Failure ©2014 DataStax Confidential. Do not distribute without consent. 26 Cassandra Replication Cassandra Replication DC: USA DC: Europe DC: Asia Web Tier Web Tier Web Tier Web Tier App TierApp TierApp Tier Consistency level?
  • 27. Cassandra Clients - Native Driver •  Clients that use the native driver also have access to various policies that enable the client to intelligently route requests as required. •  This includes: •  Load Balancing •  Data Centre Aware •  Latency Aware •  Token Aware •  Reconnection policies •  Retry policies •  Downgrading Consistency •  Plus others.. •  http://www.datastax.com/download/clientdrivers ©2014 DataStax Confidential. Do not distribute without consent. 27
  • 28. Quotes •  “Cassandra, our distributed cloud persistence store which is distributed across all zones and regions, dealt with the loss of one third of its regional nodes without any loss of data or availability”. http://techblog.netflix.com/2012/07/lessons-netflix-learned-from-aws-storm.html •  “During Hurricane Sandy, we lost an entire data center. Completely. Lost. It. Our application fail-over resulted in us losing just a few moments of serving requests for a particular region of the country, but our data in Cassandra never went offline.” http://planetcassandra.org/blog/post/outbrain-touches-over-80-of-all-us-online-users-with-help-from-cassandra/ ©2014 DataStax Confidential. Do not distribute without consent. 28
  • 29. Ring-fenced resources •  If you need to isolate resources for different uses, Cassandra is a great fit. •  You can create separate virtual data centres optimised as required – different workloads, hardware, availability etc.. •  Cassandra will replicate the data for you – no ETL is necessary ©2014 DataStax Confidential. Do not distribute without consent. 29 Cassandra Replication Customer Facing Analytics
  • 30. Hybrid Cloud •  DataStax Enterprise and Cassandra are multi-data centre and cloud capable •  Data written to any node is automatically and transparently written to all other nodes in multiple data centres i.e. no etl ©2014 DataStax Confidential. Do not distribute without consent. 30 Data Centre 1 Data Centre2 Public Cloud
  • 31. Security in Cassandra FEATURESBENEFITS Internal Authentication Manages login IDs and passwords inside the database +  Ensures only authorized users can access a database system using internal validation +  Simple to implement and easy to understand +  No learning curve from the relational world Object Permission Management controls who has access to what and who can do what in the database +  Provides granular based control over who can add/change/ delete/read data +  Uses familiar GRANT/REVOKE from relational systems +  No learning curve Client to Node Encryption protects data in flight to and from a database cluster +  Ensures data cannot be captured/stolen in route to a server +  Data is safe both in flight from/ to a database and on the database; complete coverage is ensured
  • 32. Advanced Security in DataStax Enterprise FEATURESBENEFITS External Authentication uses external security software packages to control security +  Only authorized users have access to a database system using external validation +  Uses most trusted external security packages (Kerberos), mainstays in government and finance +  Single sign on to all data domains Transparent Data Encryption encrypts data at rest +  Protects sensitive data at rest from theft and from being read at the file system level +  No changes needed at application level Data Auditing provides trail of who did and looked at what/when +  Supplies admins with an audit trail of all accesses and changes +  Granular control to audit only what’s needed +  Uses log4j interface to ensure performance and efficient audit operations
  • 33. Data Replication Security in Cassandra •  A popular feature from a data security perspective is the ability to control at a keyspace/schema level which data centres data should be replicated to. •  What this means is that in a multi-data centre (both physical and virtual) cluster you can ensure that data is not shipped anywhere it shouldn’t be and access to that data can be controlled. •  This is very simple to set-up and is extremely useful when you need to share some of your data, but not all of you data or if you have requirements around where your data is permitted to reside. DC 1 DC 2 Shared Data
  • 34. DataStax Enterprise 4.0, OpsCentre 4.1 •  DataStax Enterprise 4.0 •  New in-memory option. Brings all of the goodness of Cassandra to an in-memory database •  Production-certified version of Apache Cassandra (2.0) •  Enterprise search enhancements •  OpsCenter 4.1 •  Capacity planning updates •  Better insight into node performance •  More information: http://www.datastax.com/wp-content/uploads/2014/02/WP-WhatsNewDSE40.pdf http://www.datastax.com/download ©2014 DataStax Confidential. Do not distribute without consent. 34
  • 35. How does in-memory work? •  Developers can create new tables to be in-memory or alter existing tables to be in-memory •  Writes are durable •  10-100x improvement ©2014 DataStax Confidential. Do not distribute without consent. 35
  • 36. Find Out More •  DataStax: http://www.datastax.com •  Getting Started: http://www.datastax.com/documentation/gettingstarted/index.html •  Training: http://www.datatstax.com/training •  Downloads: http://www.datastax.com/download •  Documentation: http://www.datastax.com/docs •  Developer Blog: http://www.datastax.com/dev/blog •  Community Site: http://planetcassandra.org •  Webinars: http://planetcassandra.org/Learn/CassandraCommunityWebinars •  Summit Talks: http://planetcassandra.org/Learn/CassandraSummit ©2014 DataStax Confidential. Do not distribute without consent. 36
  • 37. Thank You We power the big data apps that transform business. ©2014 DataStax Confidential. Do not distribute without consent. 37