SlideShare una empresa de Scribd logo
1 de 60
www.edureka.in/cassandra
Slide 1
www.edureka.in/cassandra
Slide 2
Course Structure
 Module 1:
Getting Started With Cassandra
 Module 2:
Understanding Cassandra Data Model
 Module 3:
Understanding Cassandra Architecture
 Module 4:
Creating Sample Application
 Module 5:
Configuring, Monitoring, Maintenance and
Tuning Cassandra
 Module 6:
Integrating Cassandra With Hadoop
 Module 7:
CRUD operations in Cassandra
 Module 8:
Live Project
www.edureka.in/cassandra
Slide 3
How it Works?
 Live Classes
 Class Recordings
 Module wise Quizzes, Coding Assignments
 24x7 on-demand Technical Support
 Sample Application and Live Project
 Online Certification Exam
 Lifetime access to the Learning Management System
www.edureka.in/cassandra
Slide 4
Module 1
Getting Started With Cassandra
 New Problems which can’t be handled by traditional RDBMS
 Tradeoff between Consistency, Availability, Partition Tolerance (CAP theorem)
 What are the different solutions available?
 What is Cassandra?
 Use-Cases for Cassandra
 Cassandra Features – Tunable Consistency, P2P Architecture, Elastic Scalability, Col Orientation
 Demo Application using Cassandra
 Questions?
www.edureka.in/cassandra
Slide 5
Module 2
Understanding Cassandra Data Model
 Understand what database model is.
 Understand the analogy between the RDBMS and Cassandra Data Model.
 Understand the following Cassandra database elements:
 Cluster
 Keyspaces
 Column Families
 Columns
 Super Columns
 Rows
 Indexes in Cassandra
 Primary and Composite Keys and their limitations
 Design Differences between RDBMS and Cassandra
 Materialized Views
 Valueless Columns
 Aggregate Keys
www.edureka.in/cassandra
Slide 6
Module 3
Understanding Cassandra Architecture
 Learn about the System Keyspaces
 Learn about internode communication such as Peer to Peer structure as well as Gossip Protocols
 Learn how Cassandra detects the failures in the nodes and repairs it
 Learn about Anti Entropy and Read Repair
 Learn about the Memtables, Sstables, and Commit logs
 Hinted Handoffs
 Compaction
 Bloom Filters
 Tombstones
 SEDA
 Manager and Services
www.edureka.in/cassandra
Slide 7
Module 4
Creating Sample Application
 Identify challenges faced by RDBMS
 Identify various possible available solutions
 Identify the rational behind choosing Cassandra
 Understand how data modelling differs in Cassandra from traditional relational databases
 Understand how queries are used to design Cassandra data model
 Apply Cassandra data modelling to various use cases
 Create the application which would involve creating various data elements you learned about in
Module 2
 Perform batch updates and search column families
 Overview of the whole project specifying how Cassandra solved the problem which was laid out
in the beginning
www.edureka.in/cassandra
Slide 8
Module 5
Configuring, Monitoring, Maintenance and Tuning Cassandra
Learn about various options of configuring Keyspaces and Column Families
 Learn about various Cassandra Replacement Strategies
 Learn about Replication
 Learn about Partitioners
 Learn about Snitches
 Learn about configuring Cluster
 Learn about Security
 Learn about Monitoring Cassandra Cluster
 Learn about Cassandra Maintenance
 Getting Ring information
 Basic Maintenance
 Snapshots
 Load Balancing
 Decommissioning and Updating nodes
 Learn about Performance Tuning
 Data storage, Reply timeouts
 Commit Logs, MemTables, Caching and Buffer sizes
www.edureka.in/cassandra
Slide 9
Integrating Cassandra with Hadoop
 Learn what Hadoop is
 Learn Hadoop Disribution File System
 Learn how to work with Map Reduce
 Learn Tools like PIG and HIVE
 Learn PIG and HIVE interaction with Cassandra
Module 6
www.edureka.in/cassandra
Slide 10
CRUD Operations in Cassandra
 Learn about Reading and writing data in Cassandra
 Learn about Cassandra API (Thrift)
 Learn about Slice Predicates
 Learn Data Definition Language (DDL) in Cassandra
 Learn Data Manipulation Language (DML) statements within Cassandra
 Learn to execute CQL scripts from with in CQL and from Command prompt
 Learn to Create and Modify Users
 Learn about Batch Mutates and Batch Deletes
 Learn various Security configurations in Cassandra
 Learn to Capture CQL outputs to a file
 Learn to Import and Export data with CQL
Module 7
www.edureka.in/cassandra
Slide 11
Live Project!
Module 8
www.edureka.in/cassandra
Slide 12
What are we going to learn today?
 New Problems which can’t be handled by traditional RDBMS
 Tradeoff between Consistency, Availability, Partition Tolerance (CAP theorem)
 What are the different solutions available?
 What is Cassandra?
 Use-Cases for Cassandra
 Cassandra Features – Tunable Consistency, P2P Architecture, Elastic Scalability, Column Orientation
 Demo Application using Cassandra
 Questions
www.edureka.in/cassandra
Slide 13
Twitter – Massive Scale, High Availability
www.edureka.in/cassandra
Slide 14
Travel Booking – Scale and Availability
www.edureka.in/cassandra
Slide 15
Movie Booking – Consistency and Scale
www.edureka.in/cassandra
Slide 16
Facebook Graph Search – Fast, Complex Querying
www.edureka.in/cassandra
Slide 17
Facebook Messenger – Consistency and Scale
www.edureka.in/cassandra
Slide 18
So, What Is Common?
 Huge Data
 Fast Random access
 Variable Schema
 Need of Compression
 High Availability
 Need for Consistency
 Need of Distribution (Sharding)
www.edureka.in/cassandra
Slide 19
NoSQL Database
 Non Relational
 Distributed
 Open Source
 Horizontally Scalable
 Features of NoSQL Database
www.edureka.in/cassandra
Slide 20
NoSQL Database types
www.edureka.in/cassandra
Slide 21
NoSQL Database types
CouchDB, MongoDB
Collection of key value
Connections
Incomplete Data
Tolerant
Query Performance, No
Standard Query Syntax
Hbase, Cassandra
Column Families
Fast Look-ups
Very Low Level API
Amazon Simple DB,
Redis
Collection of Key
Value pairs
Fast Look-ups
Stored Data
has no Schema
InfoGrid, Infinite Graph
“Property Graph” - Nodes
Graph Algorithms – Shortest
Path, Connected ness, Etc
Not easy to Cluster, traverse
whole graph to get answer
Data Model
Example
Weakness
Strength
Data Model
Example
Weakness
Strength
Data Model
Example
Weakness
Strength
Data Model
Example
Weakness
Strength
Document Data
Store Databases
Key Value
Databases
Columnar NoSQL
Databases
Graph NoSQL
Databases
No SQL
Database Types
www.edureka.in/cassandra
Slide 22
Welcome To Cassandra!
www.edureka.in/cassandra
Slide 23
Cassandra Name’s Story
Troy Destruction
King Priam Hecuba
Cassandra Greek God Apollo
www.edureka.in/cassandra
Slide 24
Why Use Cassandra?
Why Use Cassandra…?
RDBMS
When there is RDBMS!
www.edureka.in/cassandra
Slide 25
Drawbacks of RDBMS
 Scalability
 Joins Slow Down
 Non-Availability of Data
 Queuing
www.edureka.in/cassandra
Slide 26
Solutions…
Vertical Scaling
 More Memory
 Faster Processor
 Upgrading Disks
www.edureka.in/cassandra
Slide 27
Further Steps…
What can go wrong??
Replication
Or even add boxes in database cluster…
Leading to new problems…
Consistency
Failover
Scenario
DATA
DATA
DATA
www.edureka.in/cassandra
Slide 28
More Steps…
Database Configuration
Caching Layer
Consistency problem between the updates in the Cache and
updates in the databases - Problem gets complex over clusters
Might mean manipulating the Write - Turning write logs off—
Not a desirable situation
www.edureka.in/cassandra
Slide 29
Current Data Challenges
 Massive Data Growth and Scalability
 100% Availability
 Quick Real Time Analytics
 No Failures
!
www.edureka.in/cassandra
Slide 30
Why to use Cassandra?
Why to Use Cassandra…?
For High Velocity Data
Writing Data Anywhere,
Everywhere
Scaling Writes and Reads
No Downtime
Scaling Out Strategy
Scaling for both READS
and WRITES
Voluminous Data
Data Originating from
Multiple Locations
Retaining Data for Long
Storing all types of Data
Delivering Fast Response
Time
Keeping Business Online and
Serving Customers
www.edureka.in/cassandra
Slide 31
Cassandra Characteristics…
For More Details, visit our Blog post…http://www.edureka.in/blog/cassandra-advantages/
www.edureka.in/cassandra
Slide 32
Column Oriented
Emp_no Dept_id Hire_date Emp_In Emp_fn
1 2 2010-08-05 Teresa Annie
2 4 2012-03-10 Ronald Susane
3 3 2012-11-06 Brown Donald
4 3 2011-07-03 Ruth David
5 1 2010-09-12 Stancy Elizabeth
6 2 2012-10-03 Catherine Amelia
1 2 2010-08-05 Teresa Annie
2 4 2012-03-10 Ronald Susane
3 3 2012-11-06 Brown Donald
1 2 3 4 5
2010-
08-05
2012-
03-10
2012-
11-06
2011-
07-03
2010-
09-12
2 4 3 3 1
Row-Oriented Database
Column-Oriented Database
www.edureka.in/cassandra
Slide 33
Schema Free
Primary Key First Name Last Name E-mail ID
1 Avril D’Souza NULL
2 David Gomes davidgomes1@yahoo.com
3 Susane NULL NULL
First Name Last Name
Avril D’Souza
First Name Last Name E-mail ID
David Gomes davidgomes1@yahoo.com
First Name
Susane
Schema Based Table
Schema Free
www.edureka.in/cassandra
Slide 34
Brewer’s CAP Theorem
http://www.w3resource.com/mongodb/nosql.php
Consistency
Partition
Tolerance
Availability
CA CP
AP
RDBMS MongoDB
HBase
Redis
CouchDB Cassandra DynamoDB Riak
www.edureka.in/cassandra
Slide 35
NoSQL Landscape
Scalability
&
Speed
Query and Navigational Complexity
Performance
Key-Value
Stores
Dynamo (Amazon),
Voldemort
(LinkedIn), Citrusleaf,
Membase, Riak,
Tokyo Cabinet
Big Table
Clones
BigTable
(Google),
Cassandra,
HBase,
Hypertable Document
Database
CouchOne,
MongoDB,
Terrastore,
OrientDB
Graph
Databases
FlockDB (Twitter),
AllegroGraph,
DEX, InfoGrid,
Neo4J, Sones
www.edureka.in/cassandra
Slide 36
Cassandra Usecase – Deep Drive
5000 TPS
Caching Layer
300 ~ 500 SQL
Transaction
100 ~ 200 SQL
Transaction
1000 TPS
WEB APPLICATION
RDBMS1
Applications Changing Data
RDBMS1
Elastic Scale
www.edureka.in/cassandra
Slide 37
Using Cassandra
1000 TPS
Elastic Scale WEB APPLICATION
Applications Changing Data
Elastic Scale
CASSANDRA
300 ~ 500 SQL
Transaction
100 ~ 200 SQL
Transaction
5000 TPS
www.edureka.in/cassandra
Slide 38
 E-Commerce (Travel Portal)
 Both B2B & B2C Consumers
 High volume of shopping transactions
(> 500 Million Visits / Day)
 High volume supply changes
(Manual & System) generated.
 Huge Inventory Database
(Millions of hotels)
 High Read/Write
(Thousands Reads & Writes/Second)
 Application has to 99.99% Available
 Fault Tolerant & Reliable.
 Fast & Quick Shopping Experience.
 Elastic Scale
 Innovative Recommendations & Algorithms.
 Should be fast for new changes
 Should be cost effective for maintenance.
 Development Approaches
 Legacy Way (Pure RDBMS)
 Augmented (RDBMS + Caching, Heavy
Database Hardware)
 Using Cassandra
Cassandra Usecase - Summary
www.edureka.in/cassandra
Slide 39
Apache Cassandra is an open source, distributed, decentralized, elastically scalable, highly available,
fault-tolerant, Tuneably consistent, column-oriented database.
What is Apache Cassandra?
Cassandra Features
Open
Source
Distributed
Decentralized
Elastically
Scalable
Highly
Scalable
Fault
Tolerant
Tuneably
Consistent
Column
Oriented
www.edureka.in/cassandra
Slide 40
Distributed and Decentralized
Post Office
Decentralised
Post Office
Centralised
CCY
Exchange stationary Letter/Couriers
Ccy Courier Stationary
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
Ccy Courier Stationary
www.edureka.in/cassandra
Slide 41
 Every Node Is Identical.
 Peer to Peer Protocol and uses Gossip Protocol to
maintain and keep the List of nodes in Sync.
 No Single Point of Failure.
 No Special Host to Coordinate Activities.
 Easier to Operate and Maintain because all nodes
are same.
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
Ccy Courier Stationary
Distributed and Decentralized
www.edureka.in/cassandra
Slide 42
Types of Scalability
 Vertical Scalability
 Horizontal Scalability
What is Elastic Scalability?
 This is special property of Horizontal Scalability.
 The cluster can seamlessly scale up and scale back down without major disruption.
Elastic Scalability
www.edureka.in/cassandra
Slide 43
 Cluster must accept new nodes without major disruption or
reconfiguration.
ADD A NODE AND MOVE ON!!
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
Ccy Courier Stationary
CCY, Stationary,
Letter/Couriers
 Process should not be restarted
 Do not have to change application charges
 Don’t have to rebalance data
Elastic Scalability
www.edureka.in/cassandra
Slide 44
 Highly Available
 No Downtime
High Availability and Fault Tolerance
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
CCY, Stationary,
Letter/Couriers
Ccy Courier Stationary
www.edureka.in/cassandra
Slide 45
Tunable Consistency
Strong Consistency
Eventual
Consistency
 Cassandra enables us to tune the Consistency based on the Application Requirement
www.edureka.in/cassandra
Slide 46
 Cassandra was designed specifically from the ground up to take full advantage
of multiprocessor/ multicore machines, and to run across many dozens of
these machines housed in multiple data centres.
 It scales consistently and seamlessly to hundreds of terabytes.
 Shows exceptional performance under heavy loads.
 Consistently shows very fast throughput for writes per second on a basic
commodity workstation.
High Performance
www.edureka.in/cassandra
Slide 47
Use if your application has:
 Big Data (Billions Of Records Rows & Columns)
 Very High Velocity Random Reads & Writes
 Flexible Sparse / Wide Column Requirements
 No Multiple Secondary Index Needs
 Low Latency
Use Cases:
 eCommerce Inventory Cache Use Cases
 Time Series / Events Use Cases
 Feed Based Activities / Use Cases
Where to Use Cassandra?
www.edureka.in/cassandra
Slide 48
Where NOT to Use Cassandra?
Don’t Use if you application has:
 Secondary Indexes.
 Relational Data.
 Transactional (Rollback, Commit)
 Primary & Financial Records.
 Stringent Security & Authorization Needs On Data
 Dynamic Queries on Columns.
 Searching Column Data
 Low Latency
www.edureka.in/cassandra
Slide 49
 Cassandra Installation & Configuration
 Conf/cassandra.yaml
 Tools
 Key Space Setup
 Column Family / Data Model Setup
 Key
 Columns & Data Types
 Indexes (Primary & Secondary)
 Programmatic Consistency
 Thrift Hector API
 CQL3 API
Application Demo
www.edureka.in/cassandra
Slide 50
Application Demo
www.edureka.in/cassandra
Slide 51
Application Demo
www.edureka.in/cassandra
Slide 52
Application Demo
www.edureka.in/cassandra
Slide 53
Application Demo
www.edureka.in/cassandra
Slide 54
Application Demo
www.edureka.in/cassandra
Slide 55
Application Demo
www.edureka.in/cassandra
Slide 56
Application Demo
www.edureka.in/cassandra
Slide 57
Module 2
Understanding Cassandra Data Model
 Understand what database model is.
 Understand the analogy between the RDBMS and Cassandra Data Model.
 Understand the following Cassandra database elements:
 Cluster
 Keyspaces
 Column Families
 Columns
 Super Columns
 Rows
 Indexes in Cassandra
 Primary and Composite Keys and their limitations
 Design Differences between RDBMS and Cassandra
 Materialized Views
 Valueless Columns
 Aggregate Keys
www.edureka.in/cassandra
Slide 58
Hands On
www.edureka.in/cassandra
Slide 59
Questions?
Thank You
See You in Class Next Module

Más contenido relacionado

La actualidad más candente

Cassandraのバックアップと運用を考える
Cassandraのバックアップと運用を考えるCassandraのバックアップと運用を考える
Cassandraのバックアップと運用を考える
Kazutaka Tomita
 
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
Databricks
 

La actualidad más candente (20)

Spark + Cassandra = Real Time Analytics on Operational Data
Spark + Cassandra = Real Time Analytics on Operational DataSpark + Cassandra = Real Time Analytics on Operational Data
Spark + Cassandra = Real Time Analytics on Operational Data
 
Cassandra 101
Cassandra 101Cassandra 101
Cassandra 101
 
Apache Spark overview
Apache Spark overviewApache Spark overview
Apache Spark overview
 
Databases on AWS Workshop.pdf
Databases on AWS Workshop.pdfDatabases on AWS Workshop.pdf
Databases on AWS Workshop.pdf
 
2023年はTiDBの時代!
2023年はTiDBの時代!2023年はTiDBの時代!
2023年はTiDBの時代!
 
Breakthrough OLAP performance with Cassandra and Spark
Breakthrough OLAP performance with Cassandra and SparkBreakthrough OLAP performance with Cassandra and Spark
Breakthrough OLAP performance with Cassandra and Spark
 
Cassandra an overview
Cassandra an overviewCassandra an overview
Cassandra an overview
 
An Overview of Apache Cassandra
An Overview of Apache CassandraAn Overview of Apache Cassandra
An Overview of Apache Cassandra
 
インメモリーデータグリッドの選択肢
インメモリーデータグリッドの選択肢インメモリーデータグリッドの選択肢
インメモリーデータグリッドの選択肢
 
Myths of Big Partitions (Robert Stupp, DataStax) | Cassandra Summit 2016
Myths of Big Partitions (Robert Stupp, DataStax) | Cassandra Summit 2016Myths of Big Partitions (Robert Stupp, DataStax) | Cassandra Summit 2016
Myths of Big Partitions (Robert Stupp, DataStax) | Cassandra Summit 2016
 
PySpark in practice slides
PySpark in practice slidesPySpark in practice slides
PySpark in practice slides
 
Modelos NoSQL e a Persistência Poliglota
Modelos NoSQL e a Persistência PoliglotaModelos NoSQL e a Persistência Poliglota
Modelos NoSQL e a Persistência Poliglota
 
SQL Server運用実践 - 3年間80台の運用経験から20の教訓
SQL Server運用実践 - 3年間80台の運用経験から20の教訓SQL Server運用実践 - 3年間80台の運用経験から20の教訓
SQL Server運用実践 - 3年間80台の運用経験から20の教訓
 
[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud[pgday.Seoul 2022] PostgreSQL with Google Cloud
[pgday.Seoul 2022] PostgreSQL with Google Cloud
 
MySQL 상태 메시지 분석 및 활용
MySQL 상태 메시지 분석 및 활용MySQL 상태 메시지 분석 및 활용
MySQL 상태 메시지 분석 및 활용
 
Cassandraのバックアップと運用を考える
Cassandraのバックアップと運用を考えるCassandraのバックアップと運用を考える
Cassandraのバックアップと運用を考える
 
Large Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured StreamingLarge Scale Lakehouse Implementation Using Structured Streaming
Large Scale Lakehouse Implementation Using Structured Streaming
 
Elasticsearch V/s Relational Database
Elasticsearch V/s Relational DatabaseElasticsearch V/s Relational Database
Elasticsearch V/s Relational Database
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & InternalsApache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
 
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
Redis + Structured Streaming—A Perfect Combination to Scale-Out Your Continuo...
 

Destacado

Cassandra
CassandraCassandra
Cassandra
exsuns
 
Erlang Message Passing Concurrency, For The Win
Erlang  Message  Passing  Concurrency,  For  The  WinErlang  Message  Passing  Concurrency,  For  The  Win
Erlang Message Passing Concurrency, For The Win
l xf
 
Cassandra Explained
Cassandra ExplainedCassandra Explained
Cassandra Explained
Eric Evans
 

Destacado (10)

NDC London 2014: Thinking Like an Erlanger
NDC London 2014: Thinking Like an ErlangerNDC London 2014: Thinking Like an Erlanger
NDC London 2014: Thinking Like an Erlanger
 
Cassandra
CassandraCassandra
Cassandra
 
Apache Cassandra at Macys
Apache Cassandra at MacysApache Cassandra at Macys
Apache Cassandra at Macys
 
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
 
Erlang Message Passing Concurrency, For The Win
Erlang  Message  Passing  Concurrency,  For  The  WinErlang  Message  Passing  Concurrency,  For  The  Win
Erlang Message Passing Concurrency, For The Win
 
Understanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache CassandraUnderstanding Data Partitioning and Replication in Apache Cassandra
Understanding Data Partitioning and Replication in Apache Cassandra
 
Learning Cassandra
Learning CassandraLearning Cassandra
Learning Cassandra
 
Cassandra Explained
Cassandra ExplainedCassandra Explained
Cassandra Explained
 
Cassandra Introduction & Features
Cassandra Introduction & FeaturesCassandra Introduction & Features
Cassandra Introduction & Features
 
Introduction to memcached
Introduction to memcachedIntroduction to memcached
Introduction to memcached
 

Similar a Cassandra

Similar a Cassandra (20)

Cassandra Tutorial
Cassandra TutorialCassandra Tutorial
Cassandra Tutorial
 
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL databaseHBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
HBase Vs Cassandra Vs MongoDB - Choosing the right NoSQL database
 
Mongo DB
Mongo DBMongo DB
Mongo DB
 
FULLTEXT02
FULLTEXT02FULLTEXT02
FULLTEXT02
 
The myth of Cassandra
The myth of CassandraThe myth of Cassandra
The myth of Cassandra
 
No sql
No sqlNo sql
No sql
 
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRAA NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
 
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRAA NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
A NOVEL APPROACH FOR HOTEL MANAGEMENT SYSTEM USING CASSANDRA
 
Architecture et modèle de données Cassandra
Architecture et modèle de données CassandraArchitecture et modèle de données Cassandra
Architecture et modèle de données Cassandra
 
Introduction to Cassandra and datastax DSE
Introduction to Cassandra and datastax DSEIntroduction to Cassandra and datastax DSE
Introduction to Cassandra and datastax DSE
 
Business Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital StrategyBusiness Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital Strategy
 
Apache Cassandra Training,Apache Cassandra Training in Bangalore india
Apache Cassandra Training,Apache Cassandra Training in Bangalore indiaApache Cassandra Training,Apache Cassandra Training in Bangalore india
Apache Cassandra Training,Apache Cassandra Training in Bangalore india
 
CASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEM
CASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEMCASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEM
CASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEM
 
NoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, ImplementationsNoSQL Introduction, Theory, Implementations
NoSQL Introduction, Theory, Implementations
 
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra RockstarWebinar: DataStax Training - Everything you need to become a Cassandra Rockstar
Webinar: DataStax Training - Everything you need to become a Cassandra Rockstar
 
Challenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS MeetupChallenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS Meetup
 
Cassandra tw presentation
Cassandra tw presentationCassandra tw presentation
Cassandra tw presentation
 
Stratio big data spain
Stratio   big data spainStratio   big data spain
Stratio big data spain
 
Scaling Your Database In The Cloud
Scaling Your Database In The CloudScaling Your Database In The Cloud
Scaling Your Database In The Cloud
 
Escalabilidad horizontal y arquitecturas elásticas en Microsoft azure
Escalabilidad horizontal y arquitecturas elásticas en Microsoft azureEscalabilidad horizontal y arquitecturas elásticas en Microsoft azure
Escalabilidad horizontal y arquitecturas elásticas en Microsoft azure
 

Más de Edureka!

Más de Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Último

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Último (20)

General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 

Cassandra

  • 2. www.edureka.in/cassandra Slide 2 Course Structure  Module 1: Getting Started With Cassandra  Module 2: Understanding Cassandra Data Model  Module 3: Understanding Cassandra Architecture  Module 4: Creating Sample Application  Module 5: Configuring, Monitoring, Maintenance and Tuning Cassandra  Module 6: Integrating Cassandra With Hadoop  Module 7: CRUD operations in Cassandra  Module 8: Live Project
  • 3. www.edureka.in/cassandra Slide 3 How it Works?  Live Classes  Class Recordings  Module wise Quizzes, Coding Assignments  24x7 on-demand Technical Support  Sample Application and Live Project  Online Certification Exam  Lifetime access to the Learning Management System
  • 4. www.edureka.in/cassandra Slide 4 Module 1 Getting Started With Cassandra  New Problems which can’t be handled by traditional RDBMS  Tradeoff between Consistency, Availability, Partition Tolerance (CAP theorem)  What are the different solutions available?  What is Cassandra?  Use-Cases for Cassandra  Cassandra Features – Tunable Consistency, P2P Architecture, Elastic Scalability, Col Orientation  Demo Application using Cassandra  Questions?
  • 5. www.edureka.in/cassandra Slide 5 Module 2 Understanding Cassandra Data Model  Understand what database model is.  Understand the analogy between the RDBMS and Cassandra Data Model.  Understand the following Cassandra database elements:  Cluster  Keyspaces  Column Families  Columns  Super Columns  Rows  Indexes in Cassandra  Primary and Composite Keys and their limitations  Design Differences between RDBMS and Cassandra  Materialized Views  Valueless Columns  Aggregate Keys
  • 6. www.edureka.in/cassandra Slide 6 Module 3 Understanding Cassandra Architecture  Learn about the System Keyspaces  Learn about internode communication such as Peer to Peer structure as well as Gossip Protocols  Learn how Cassandra detects the failures in the nodes and repairs it  Learn about Anti Entropy and Read Repair  Learn about the Memtables, Sstables, and Commit logs  Hinted Handoffs  Compaction  Bloom Filters  Tombstones  SEDA  Manager and Services
  • 7. www.edureka.in/cassandra Slide 7 Module 4 Creating Sample Application  Identify challenges faced by RDBMS  Identify various possible available solutions  Identify the rational behind choosing Cassandra  Understand how data modelling differs in Cassandra from traditional relational databases  Understand how queries are used to design Cassandra data model  Apply Cassandra data modelling to various use cases  Create the application which would involve creating various data elements you learned about in Module 2  Perform batch updates and search column families  Overview of the whole project specifying how Cassandra solved the problem which was laid out in the beginning
  • 8. www.edureka.in/cassandra Slide 8 Module 5 Configuring, Monitoring, Maintenance and Tuning Cassandra Learn about various options of configuring Keyspaces and Column Families  Learn about various Cassandra Replacement Strategies  Learn about Replication  Learn about Partitioners  Learn about Snitches  Learn about configuring Cluster  Learn about Security  Learn about Monitoring Cassandra Cluster  Learn about Cassandra Maintenance  Getting Ring information  Basic Maintenance  Snapshots  Load Balancing  Decommissioning and Updating nodes  Learn about Performance Tuning  Data storage, Reply timeouts  Commit Logs, MemTables, Caching and Buffer sizes
  • 9. www.edureka.in/cassandra Slide 9 Integrating Cassandra with Hadoop  Learn what Hadoop is  Learn Hadoop Disribution File System  Learn how to work with Map Reduce  Learn Tools like PIG and HIVE  Learn PIG and HIVE interaction with Cassandra Module 6
  • 10. www.edureka.in/cassandra Slide 10 CRUD Operations in Cassandra  Learn about Reading and writing data in Cassandra  Learn about Cassandra API (Thrift)  Learn about Slice Predicates  Learn Data Definition Language (DDL) in Cassandra  Learn Data Manipulation Language (DML) statements within Cassandra  Learn to execute CQL scripts from with in CQL and from Command prompt  Learn to Create and Modify Users  Learn about Batch Mutates and Batch Deletes  Learn various Security configurations in Cassandra  Learn to Capture CQL outputs to a file  Learn to Import and Export data with CQL Module 7
  • 12. www.edureka.in/cassandra Slide 12 What are we going to learn today?  New Problems which can’t be handled by traditional RDBMS  Tradeoff between Consistency, Availability, Partition Tolerance (CAP theorem)  What are the different solutions available?  What is Cassandra?  Use-Cases for Cassandra  Cassandra Features – Tunable Consistency, P2P Architecture, Elastic Scalability, Column Orientation  Demo Application using Cassandra  Questions
  • 13. www.edureka.in/cassandra Slide 13 Twitter – Massive Scale, High Availability
  • 16. www.edureka.in/cassandra Slide 16 Facebook Graph Search – Fast, Complex Querying
  • 18. www.edureka.in/cassandra Slide 18 So, What Is Common?  Huge Data  Fast Random access  Variable Schema  Need of Compression  High Availability  Need for Consistency  Need of Distribution (Sharding)
  • 19. www.edureka.in/cassandra Slide 19 NoSQL Database  Non Relational  Distributed  Open Source  Horizontally Scalable  Features of NoSQL Database
  • 21. www.edureka.in/cassandra Slide 21 NoSQL Database types CouchDB, MongoDB Collection of key value Connections Incomplete Data Tolerant Query Performance, No Standard Query Syntax Hbase, Cassandra Column Families Fast Look-ups Very Low Level API Amazon Simple DB, Redis Collection of Key Value pairs Fast Look-ups Stored Data has no Schema InfoGrid, Infinite Graph “Property Graph” - Nodes Graph Algorithms – Shortest Path, Connected ness, Etc Not easy to Cluster, traverse whole graph to get answer Data Model Example Weakness Strength Data Model Example Weakness Strength Data Model Example Weakness Strength Data Model Example Weakness Strength Document Data Store Databases Key Value Databases Columnar NoSQL Databases Graph NoSQL Databases No SQL Database Types
  • 23. www.edureka.in/cassandra Slide 23 Cassandra Name’s Story Troy Destruction King Priam Hecuba Cassandra Greek God Apollo
  • 24. www.edureka.in/cassandra Slide 24 Why Use Cassandra? Why Use Cassandra…? RDBMS When there is RDBMS!
  • 25. www.edureka.in/cassandra Slide 25 Drawbacks of RDBMS  Scalability  Joins Slow Down  Non-Availability of Data  Queuing
  • 26. www.edureka.in/cassandra Slide 26 Solutions… Vertical Scaling  More Memory  Faster Processor  Upgrading Disks
  • 27. www.edureka.in/cassandra Slide 27 Further Steps… What can go wrong?? Replication Or even add boxes in database cluster… Leading to new problems… Consistency Failover Scenario DATA DATA DATA
  • 28. www.edureka.in/cassandra Slide 28 More Steps… Database Configuration Caching Layer Consistency problem between the updates in the Cache and updates in the databases - Problem gets complex over clusters Might mean manipulating the Write - Turning write logs off— Not a desirable situation
  • 29. www.edureka.in/cassandra Slide 29 Current Data Challenges  Massive Data Growth and Scalability  100% Availability  Quick Real Time Analytics  No Failures !
  • 30. www.edureka.in/cassandra Slide 30 Why to use Cassandra? Why to Use Cassandra…? For High Velocity Data Writing Data Anywhere, Everywhere Scaling Writes and Reads No Downtime Scaling Out Strategy Scaling for both READS and WRITES Voluminous Data Data Originating from Multiple Locations Retaining Data for Long Storing all types of Data Delivering Fast Response Time Keeping Business Online and Serving Customers
  • 31. www.edureka.in/cassandra Slide 31 Cassandra Characteristics… For More Details, visit our Blog post…http://www.edureka.in/blog/cassandra-advantages/
  • 32. www.edureka.in/cassandra Slide 32 Column Oriented Emp_no Dept_id Hire_date Emp_In Emp_fn 1 2 2010-08-05 Teresa Annie 2 4 2012-03-10 Ronald Susane 3 3 2012-11-06 Brown Donald 4 3 2011-07-03 Ruth David 5 1 2010-09-12 Stancy Elizabeth 6 2 2012-10-03 Catherine Amelia 1 2 2010-08-05 Teresa Annie 2 4 2012-03-10 Ronald Susane 3 3 2012-11-06 Brown Donald 1 2 3 4 5 2010- 08-05 2012- 03-10 2012- 11-06 2011- 07-03 2010- 09-12 2 4 3 3 1 Row-Oriented Database Column-Oriented Database
  • 33. www.edureka.in/cassandra Slide 33 Schema Free Primary Key First Name Last Name E-mail ID 1 Avril D’Souza NULL 2 David Gomes davidgomes1@yahoo.com 3 Susane NULL NULL First Name Last Name Avril D’Souza First Name Last Name E-mail ID David Gomes davidgomes1@yahoo.com First Name Susane Schema Based Table Schema Free
  • 34. www.edureka.in/cassandra Slide 34 Brewer’s CAP Theorem http://www.w3resource.com/mongodb/nosql.php Consistency Partition Tolerance Availability CA CP AP RDBMS MongoDB HBase Redis CouchDB Cassandra DynamoDB Riak
  • 35. www.edureka.in/cassandra Slide 35 NoSQL Landscape Scalability & Speed Query and Navigational Complexity Performance Key-Value Stores Dynamo (Amazon), Voldemort (LinkedIn), Citrusleaf, Membase, Riak, Tokyo Cabinet Big Table Clones BigTable (Google), Cassandra, HBase, Hypertable Document Database CouchOne, MongoDB, Terrastore, OrientDB Graph Databases FlockDB (Twitter), AllegroGraph, DEX, InfoGrid, Neo4J, Sones
  • 36. www.edureka.in/cassandra Slide 36 Cassandra Usecase – Deep Drive 5000 TPS Caching Layer 300 ~ 500 SQL Transaction 100 ~ 200 SQL Transaction 1000 TPS WEB APPLICATION RDBMS1 Applications Changing Data RDBMS1 Elastic Scale
  • 37. www.edureka.in/cassandra Slide 37 Using Cassandra 1000 TPS Elastic Scale WEB APPLICATION Applications Changing Data Elastic Scale CASSANDRA 300 ~ 500 SQL Transaction 100 ~ 200 SQL Transaction 5000 TPS
  • 38. www.edureka.in/cassandra Slide 38  E-Commerce (Travel Portal)  Both B2B & B2C Consumers  High volume of shopping transactions (> 500 Million Visits / Day)  High volume supply changes (Manual & System) generated.  Huge Inventory Database (Millions of hotels)  High Read/Write (Thousands Reads & Writes/Second)  Application has to 99.99% Available  Fault Tolerant & Reliable.  Fast & Quick Shopping Experience.  Elastic Scale  Innovative Recommendations & Algorithms.  Should be fast for new changes  Should be cost effective for maintenance.  Development Approaches  Legacy Way (Pure RDBMS)  Augmented (RDBMS + Caching, Heavy Database Hardware)  Using Cassandra Cassandra Usecase - Summary
  • 39. www.edureka.in/cassandra Slide 39 Apache Cassandra is an open source, distributed, decentralized, elastically scalable, highly available, fault-tolerant, Tuneably consistent, column-oriented database. What is Apache Cassandra? Cassandra Features Open Source Distributed Decentralized Elastically Scalable Highly Scalable Fault Tolerant Tuneably Consistent Column Oriented
  • 40. www.edureka.in/cassandra Slide 40 Distributed and Decentralized Post Office Decentralised Post Office Centralised CCY Exchange stationary Letter/Couriers Ccy Courier Stationary CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers Ccy Courier Stationary
  • 41. www.edureka.in/cassandra Slide 41  Every Node Is Identical.  Peer to Peer Protocol and uses Gossip Protocol to maintain and keep the List of nodes in Sync.  No Single Point of Failure.  No Special Host to Coordinate Activities.  Easier to Operate and Maintain because all nodes are same. CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers Ccy Courier Stationary Distributed and Decentralized
  • 42. www.edureka.in/cassandra Slide 42 Types of Scalability  Vertical Scalability  Horizontal Scalability What is Elastic Scalability?  This is special property of Horizontal Scalability.  The cluster can seamlessly scale up and scale back down without major disruption. Elastic Scalability
  • 43. www.edureka.in/cassandra Slide 43  Cluster must accept new nodes without major disruption or reconfiguration. ADD A NODE AND MOVE ON!! CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers Ccy Courier Stationary CCY, Stationary, Letter/Couriers  Process should not be restarted  Do not have to change application charges  Don’t have to rebalance data Elastic Scalability
  • 44. www.edureka.in/cassandra Slide 44  Highly Available  No Downtime High Availability and Fault Tolerance CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers CCY, Stationary, Letter/Couriers Ccy Courier Stationary
  • 45. www.edureka.in/cassandra Slide 45 Tunable Consistency Strong Consistency Eventual Consistency  Cassandra enables us to tune the Consistency based on the Application Requirement
  • 46. www.edureka.in/cassandra Slide 46  Cassandra was designed specifically from the ground up to take full advantage of multiprocessor/ multicore machines, and to run across many dozens of these machines housed in multiple data centres.  It scales consistently and seamlessly to hundreds of terabytes.  Shows exceptional performance under heavy loads.  Consistently shows very fast throughput for writes per second on a basic commodity workstation. High Performance
  • 47. www.edureka.in/cassandra Slide 47 Use if your application has:  Big Data (Billions Of Records Rows & Columns)  Very High Velocity Random Reads & Writes  Flexible Sparse / Wide Column Requirements  No Multiple Secondary Index Needs  Low Latency Use Cases:  eCommerce Inventory Cache Use Cases  Time Series / Events Use Cases  Feed Based Activities / Use Cases Where to Use Cassandra?
  • 48. www.edureka.in/cassandra Slide 48 Where NOT to Use Cassandra? Don’t Use if you application has:  Secondary Indexes.  Relational Data.  Transactional (Rollback, Commit)  Primary & Financial Records.  Stringent Security & Authorization Needs On Data  Dynamic Queries on Columns.  Searching Column Data  Low Latency
  • 49. www.edureka.in/cassandra Slide 49  Cassandra Installation & Configuration  Conf/cassandra.yaml  Tools  Key Space Setup  Column Family / Data Model Setup  Key  Columns & Data Types  Indexes (Primary & Secondary)  Programmatic Consistency  Thrift Hector API  CQL3 API Application Demo
  • 57. www.edureka.in/cassandra Slide 57 Module 2 Understanding Cassandra Data Model  Understand what database model is.  Understand the analogy between the RDBMS and Cassandra Data Model.  Understand the following Cassandra database elements:  Cluster  Keyspaces  Column Families  Columns  Super Columns  Rows  Indexes in Cassandra  Primary and Composite Keys and their limitations  Design Differences between RDBMS and Cassandra  Materialized Views  Valueless Columns  Aggregate Keys
  • 60. Thank You See You in Class Next Module