SlideShare una empresa de Scribd logo
1 de 28
Evaluating Apache
   Cassandra as a
  Cloud Database
Overview of DataStax
 Founded  in April 2010
 Commercial leader in Apache Cassandra™,
  the popular open-source “big data” database
 100+ customers
 30+ employees
 Home to Apache Cassandra Chair & most
  committers
 Headquartered in San Francisco Bay area
 Secured $11M in Series B funding in Sep 2011
Why DataStax?
DataStax delivers database products and services
based on Apache Cassandra from experts who
are at the forefront of today's data revolution.


Database Software & Tools       Support & Services

    DataStax Enterprise       Production Support
    DataStax Community        Consultative Help
    DataStax OpsCenter        Professional Training
    Drivers & Connectors      Online Documentation
The Company We Keep
What Constitutes a
 Cloud Database?
What a Cloud Database is not
A Cloud database is not simply taking a traditional RDBMS
and running it in a Cloud provider’s environment.
Key Attributes of a Cloud Database
   Transparent elasticity – can add and subtract nodes online with load
    balancing
   Transparent scalability – addition of nodes increases both (1)
    performance throughput; (2)ability to handle Big Data and maintain
    high performance
   High availability – always up; no single point of failure
   Multi-geography/zone aware – able to span multiple geographies, data
    centers, and cloud provider zones. Can read/write to any node
   Data redundancy – data is protected via multiple copies held at
    different physical locations
   Dynamic schema – able to manage structured, semi-structured, and
    unstructured data
   Simple manageability – easy to administer a logical database across
    many nodes
   Software support – supports popular public and private Cloud providers
   Low cost – won’t break the bank
How does Apache
  Cassandra stack
              up?
What is Cassandra?
Apache Cassandra™ is a free
   Distributed…
   High performance…
   Extremely scalable…
   Fault tolerant (i.e. no single point of failure)…

post-relational database solution. Cassandra can serve
as both real-time datastore for online/transactional
applications, and as a read-intensive database for
business intelligence systems.
The History of Cassandra
     Bigtable              Dynamo
Cassandra Technical Advantages
Key technical attributes of Cassandra
include:
   Big Data scalability
   Fast /Linear scale performance
   No single point of failure
   Enterprise / multi-data center / Cloud data distribution
   Read/Write Anywhere capable
   Flexible schema
   Tunable data consistency
   Data compression
   Familiar SQL-Like language – CQL
   Easy setup
   No special hardware needed
   No special caching layer needed
Cassandra Architecture Overview
   Cassandra was designed with the understanding that
    system/hardware failures can and do occur
   Peer-to-peer, distributed system
   All nodes the same
   Data partitioned among all nodes in the cluster
   Custom data replication to ensure fault tolerance
   Read/Write-anywhere design
Cassandra Architecture Overview
   Each node communicates with each other through the
    Gossip protocol, which exchanges information across the
    cluster every second
   A commit log is used on each node to capture write
    activity. Data durability is assured
   Data also written to an in-memory structure (memtable)
    and then to disk once the memory structure is full (an
    SStable)
Cassandra Architecture Overview
   The schema used in Cassandra is mirrored after Google
    Bigtable. It is a row-oriented, column structure that can
    store structured, semi-structured, and unstructured data
   A keyspace is akin to a database in the RDBMS world
   A column family is similar to an RDBMS table but is more
    flexible/dynamic
   A row in a column family is indexed by its key. Other
    columns may be indexed as well

                               Portfolio Keyspace
                                   Customer Column Family

                                    ID    Name      SSN   DOB
Transparent Elasticity
Nodes can be added and removed from Cassandra
online, with no downtime being experienced.


                                            1
                                       12           2



       1
                                  11                        3

  6            2




                         10
                                                                4




  5            3

                                                            5
                              9
           4



                                                        6
                                       8

                                                7
Transparent Scalability
Addition of Cassandra nodes increases performance
linearly and ability to manage TB’s-PB’s of data.


                                                 1
                                         12              2



           1
                                    11                            3

  6                    2



      Performance                        Performance
      throughput = N                     throughput = N x 2
                           10
                                                                      4




  5                    3

                                                                  5
                                9
               4



                                                              6
                                         8

                                                     7
Transparent Scalability


    Over 1
    million
   writes/se
       c!




http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
High Availability
Cassandra, with its peer-to-peer architecture has no
single point of failure.
Multi-Geography/Zone Aware
Cassandra allows a single logical database to span 1-N
datacenters that are geographically dispersed. Also
supports a hybrid on-premise/Cloud implementation.
Data Redundancy
Cassandra allows for customizable data redundancy so
that data is completely protected. Also supports rack
awareness (data can be replicated between different
racks to guard against machine/rack failures).
Dynamic Schema
Cassandra’s data model – based on Google’s Bigtable –
allows a user to store structured, semi-structured, and
unstructured data with ease.




                         Portfolio Keyspace
                             Customer Column Family

                              ID    Name      SSN   DOB
Simple Manageability
AMI installers install and configure an entire multi-node
Cloud implementation in minutes. All can be managed
and monitored via Web-based console.
Cloud Provider/Software Support
Cassandra is supported on popular Cloud provider
platforms and operating systems.
Low Cost
Cassandra is open source software and is freely
available. Commercial/advanced versions of Cassandra
are available from DataStax along with support and
other services.
How Does Cassandra Stack Up?
Cloud Database Attribute          Meet?                    Info
Transparent elasticity                    Nodes can be added/removed
                                          online with auto load balancing
Transparent scalability                   Performance increases linearly with
                                          node additions. Big Data capable
High availability                         No single point of failure. Offers high
                                          degree of availability
Multi-geo/zone                            Supports multi data centers, geos,
                                          Cloud zones, read-write anywhere
Data redundancy                           Customizable data replication /
                                          redundancy
Dynamic Schema                            Able to manage all key types of
                                          data
Simple manageability                      Easy install, setup and managed via
                                          Web console
Cloud provider/software support           Support for all key providers and
                                          operating systems
Low cost                                  Free if use community; very low cost
                                          if using DataStax for advanced
                                          functionality and/or support
Next Steps
Download Cassandra and try it in your own
environment or on your Cloud provider’s platform.

 Go to
  www.datastax.com/do
  wnload
 Downloads available for
  both Cassandra installs
  that are on premise and
  AMI for Amazon EC2
For More Information
Questions?

Más contenido relacionado

La actualidad más candente

Cassandra background-and-architecture
Cassandra background-and-architectureCassandra background-and-architecture
Cassandra background-and-architecture
Markus Klems
 

La actualidad más candente (20)

NOSQL Database: Apache Cassandra
NOSQL Database: Apache CassandraNOSQL Database: Apache Cassandra
NOSQL Database: Apache Cassandra
 
An Overview of Apache Cassandra
An Overview of Apache CassandraAn Overview of Apache Cassandra
An Overview of Apache Cassandra
 
Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26Apache Cassandra @Geneva JUG 2013.02.26
Apache Cassandra @Geneva JUG 2013.02.26
 
Cassandra training
Cassandra trainingCassandra training
Cassandra training
 
Apache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek BerlinApache Cassandra at the Geek2Geek Berlin
Apache Cassandra at the Geek2Geek Berlin
 
The Cassandra Distributed Database
The Cassandra Distributed DatabaseThe Cassandra Distributed Database
The Cassandra Distributed Database
 
Cassandra Database
Cassandra DatabaseCassandra Database
Cassandra Database
 
Presentation of Apache Cassandra
Presentation of Apache Cassandra Presentation of Apache Cassandra
Presentation of Apache Cassandra
 
Cassandra tutorial
Cassandra tutorialCassandra tutorial
Cassandra tutorial
 
Cassandra internals
Cassandra internalsCassandra internals
Cassandra internals
 
Intro to cassandra
Intro to cassandraIntro to cassandra
Intro to cassandra
 
Cassandra ppt 2
Cassandra ppt 2Cassandra ppt 2
Cassandra ppt 2
 
Cassandra background-and-architecture
Cassandra background-and-architectureCassandra background-and-architecture
Cassandra background-and-architecture
 
Cassandra
CassandraCassandra
Cassandra
 
Apache Cassandra
Apache CassandraApache Cassandra
Apache Cassandra
 
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
Apache Cassandra and DataStax Enterprise Explained with Peter Halliday at Wil...
 
Introduction to Cassandra
Introduction to CassandraIntroduction to Cassandra
Introduction to Cassandra
 
Cassandra concepts, patterns and anti-patterns
Cassandra concepts, patterns and anti-patternsCassandra concepts, patterns and anti-patterns
Cassandra concepts, patterns and anti-patterns
 
Apache Cassandra in the Real World
Apache Cassandra in the Real WorldApache Cassandra in the Real World
Apache Cassandra in the Real World
 
Cassandra Day Atlanta 2015: Introduction to Apache Cassandra & DataStax Enter...
Cassandra Day Atlanta 2015: Introduction to Apache Cassandra & DataStax Enter...Cassandra Day Atlanta 2015: Introduction to Apache Cassandra & DataStax Enter...
Cassandra Day Atlanta 2015: Introduction to Apache Cassandra & DataStax Enter...
 

Destacado

Diarrhea:Myths and facts, Precaution
Diarrhea:Myths and facts, Precaution Diarrhea:Myths and facts, Precaution
Diarrhea:Myths and facts, Precaution
Wuzna Haroon
 
Alta White Paper D2C eCommerce Case Study 2016
Alta White Paper D2C eCommerce Case Study 2016Alta White Paper D2C eCommerce Case Study 2016
Alta White Paper D2C eCommerce Case Study 2016
Patrick Nicholson
 
Energy Strategy Group_Report 2012 efficienza energetica
Energy Strategy Group_Report 2012 efficienza energeticaEnergy Strategy Group_Report 2012 efficienza energetica
Energy Strategy Group_Report 2012 efficienza energetica
Eugenio Bacile di Castiglione
 

Destacado (13)

How Big Data is Changing SEO - by AnalyticsSEO at BrightonSEO
How Big Data is Changing SEO - by AnalyticsSEO at BrightonSEOHow Big Data is Changing SEO - by AnalyticsSEO at BrightonSEO
How Big Data is Changing SEO - by AnalyticsSEO at BrightonSEO
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Diarrhea:Myths and facts, Precaution
Diarrhea:Myths and facts, Precaution Diarrhea:Myths and facts, Precaution
Diarrhea:Myths and facts, Precaution
 
Alta White Paper D2C eCommerce Case Study 2016
Alta White Paper D2C eCommerce Case Study 2016Alta White Paper D2C eCommerce Case Study 2016
Alta White Paper D2C eCommerce Case Study 2016
 
Energy Strategy Group_Report 2012 efficienza energetica
Energy Strategy Group_Report 2012 efficienza energeticaEnergy Strategy Group_Report 2012 efficienza energetica
Energy Strategy Group_Report 2012 efficienza energetica
 
Context Based Authentication
Context Based AuthenticationContext Based Authentication
Context Based Authentication
 
Nt1310 project
Nt1310 projectNt1310 project
Nt1310 project
 
Information från Läkemedelsverket #5 2013
Information från Läkemedelsverket #5 2013Information från Läkemedelsverket #5 2013
Information från Läkemedelsverket #5 2013
 
Secure PIN Management How to Issue and Change PINs Securely over the Web
Secure PIN Management How to Issue and Change PINs Securely over the WebSecure PIN Management How to Issue and Change PINs Securely over the Web
Secure PIN Management How to Issue and Change PINs Securely over the Web
 
mpx Replay, Expedite Your Catch-Up and C3 Workflow 2 of 2
mpx Replay, Expedite Your Catch-Up and C3 Workflow 2 of 2mpx Replay, Expedite Your Catch-Up and C3 Workflow 2 of 2
mpx Replay, Expedite Your Catch-Up and C3 Workflow 2 of 2
 
cathy resume
cathy resumecathy resume
cathy resume
 
"15 Business Story Ideas to Jump on Now"
"15 Business Story Ideas to Jump on Now""15 Business Story Ideas to Jump on Now"
"15 Business Story Ideas to Jump on Now"
 
Credit cards
Credit cardsCredit cards
Credit cards
 

Similar a Evaluating Apache Cassandra as a Cloud Database

DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQLDataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax
 
Cloud Strategies for a modern hybrid datacenter - Dec 2015
Cloud Strategies for a modern hybrid datacenter - Dec 2015Cloud Strategies for a modern hybrid datacenter - Dec 2015
Cloud Strategies for a modern hybrid datacenter - Dec 2015
Miguel Pérez Colino
 

Similar a Evaluating Apache Cassandra as a Cloud Database (20)

Cassandra
CassandraCassandra
Cassandra
 
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQLDataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
 
Business Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital StrategyBusiness Growth Is Fueled By Your Event-Centric Digital Strategy
Business Growth Is Fueled By Your Event-Centric Digital Strategy
 
Why Cassandra?
Why Cassandra?Why Cassandra?
Why Cassandra?
 
Azure and cloud design patterns
Azure and cloud design patternsAzure and cloud design patterns
Azure and cloud design patterns
 
Cloud Strategies for a modern hybrid datacenter - Dec 2015
Cloud Strategies for a modern hybrid datacenter - Dec 2015Cloud Strategies for a modern hybrid datacenter - Dec 2015
Cloud Strategies for a modern hybrid datacenter - Dec 2015
 
Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...
Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...
Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...
 
SQL and NoSQL in SQL Server
SQL and NoSQL in SQL ServerSQL and NoSQL in SQL Server
SQL and NoSQL in SQL Server
 
Elastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using ConfluentElastically Scaling Kafka Using Confluent
Elastically Scaling Kafka Using Confluent
 
Cloudian_Cassandra Summit 2012
Cloudian_Cassandra Summit 2012Cloudian_Cassandra Summit 2012
Cloudian_Cassandra Summit 2012
 
Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk Dragonflow Austin Summit Talk
Dragonflow Austin Summit Talk
 
Webinar Slides: Multi-Master MySQL
Webinar Slides: Multi-Master MySQLWebinar Slides: Multi-Master MySQL
Webinar Slides: Multi-Master MySQL
 
NoSQL Database
NoSQL DatabaseNoSQL Database
NoSQL Database
 
Migrating Oracle database to Cassandra
Migrating Oracle database to CassandraMigrating Oracle database to Cassandra
Migrating Oracle database to Cassandra
 
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech TalkA Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
 
Webinar Slides: Geo-Scale MySQL in AWS
Webinar Slides: Geo-Scale MySQL in AWSWebinar Slides: Geo-Scale MySQL in AWS
Webinar Slides: Geo-Scale MySQL in AWS
 
Cassandra tw presentation
Cassandra tw presentationCassandra tw presentation
Cassandra tw presentation
 
01 milan roadshow keynote
01 milan roadshow keynote01 milan roadshow keynote
01 milan roadshow keynote
 
Cassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting dataCassandra implementation for collecting data and presenting data
Cassandra implementation for collecting data and presenting data
 
Tour de France Azure PaaS 3/7 Stocker des informations
Tour de France Azure PaaS 3/7 Stocker des informationsTour de France Azure PaaS 3/7 Stocker des informations
Tour de France Azure PaaS 3/7 Stocker des informations
 

Más de DataStax

Más de DataStax (20)

Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?Is Your Enterprise Ready to Shine This Holiday Season?
Is Your Enterprise Ready to Shine This Holiday Season?
 
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
Designing Fault-Tolerant Applications with DataStax Enterprise and Apache Cas...
 
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid EnvironmentsRunning DataStax Enterprise in VMware Cloud and Hybrid Environments
Running DataStax Enterprise in VMware Cloud and Hybrid Environments
 
Best Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise GraphBest Practices for Getting to Production with DataStax Enterprise Graph
Best Practices for Getting to Production with DataStax Enterprise Graph
 
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step JourneyWebinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
Webinar | Data Management for Hybrid and Multi-Cloud: A Four-Step Journey
 
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...Webinar  |  How to Understand Apache Cassandra™ Performance Through Read/Writ...
Webinar | How to Understand Apache Cassandra™ Performance Through Read/Writ...
 
Webinar | Better Together: Apache Cassandra and Apache Kafka
Webinar  |  Better Together: Apache Cassandra and Apache KafkaWebinar  |  Better Together: Apache Cassandra and Apache Kafka
Webinar | Better Together: Apache Cassandra and Apache Kafka
 
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax EnterpriseTop 10 Best Practices for Apache Cassandra and DataStax Enterprise
Top 10 Best Practices for Apache Cassandra and DataStax Enterprise
 
Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0Introduction to Apache Cassandra™ + What’s New in 4.0
Introduction to Apache Cassandra™ + What’s New in 4.0
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
 
How to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerceHow to Evaluate Cloud Databases for eCommerce
How to Evaluate Cloud Databases for eCommerce
 
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
Webinar: DataStax Enterprise 6: 10 Ways to Multiply the Power of Apache Cassa...
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
 
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
Webinar - Real-Time Customer Experience for the Right-Now Enterprise featurin...
 
Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)Datastax - The Architect's guide to customer experience (CX)
Datastax - The Architect's guide to customer experience (CX)
 
An Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking ApplicationsAn Operational Data Layer is Critical for Transformative Banking Applications
An Operational Data Layer is Critical for Transformative Banking Applications
 
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design ThinkingBecoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
Becoming a Customer-Centric Enterprise Via Real-Time Data and Design Thinking
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Evaluating Apache Cassandra as a Cloud Database

  • 1. Evaluating Apache Cassandra as a Cloud Database
  • 2. Overview of DataStax  Founded in April 2010  Commercial leader in Apache Cassandra™, the popular open-source “big data” database  100+ customers  30+ employees  Home to Apache Cassandra Chair & most committers  Headquartered in San Francisco Bay area  Secured $11M in Series B funding in Sep 2011
  • 3. Why DataStax? DataStax delivers database products and services based on Apache Cassandra from experts who are at the forefront of today's data revolution. Database Software & Tools Support & Services  DataStax Enterprise  Production Support  DataStax Community  Consultative Help  DataStax OpsCenter  Professional Training  Drivers & Connectors  Online Documentation
  • 5. What Constitutes a Cloud Database?
  • 6. What a Cloud Database is not A Cloud database is not simply taking a traditional RDBMS and running it in a Cloud provider’s environment.
  • 7. Key Attributes of a Cloud Database  Transparent elasticity – can add and subtract nodes online with load balancing  Transparent scalability – addition of nodes increases both (1) performance throughput; (2)ability to handle Big Data and maintain high performance  High availability – always up; no single point of failure  Multi-geography/zone aware – able to span multiple geographies, data centers, and cloud provider zones. Can read/write to any node  Data redundancy – data is protected via multiple copies held at different physical locations  Dynamic schema – able to manage structured, semi-structured, and unstructured data  Simple manageability – easy to administer a logical database across many nodes  Software support – supports popular public and private Cloud providers  Low cost – won’t break the bank
  • 8. How does Apache Cassandra stack up?
  • 9. What is Cassandra? Apache Cassandra™ is a free  Distributed…  High performance…  Extremely scalable…  Fault tolerant (i.e. no single point of failure)… post-relational database solution. Cassandra can serve as both real-time datastore for online/transactional applications, and as a read-intensive database for business intelligence systems.
  • 10. The History of Cassandra Bigtable Dynamo
  • 11. Cassandra Technical Advantages Key technical attributes of Cassandra include:  Big Data scalability  Fast /Linear scale performance  No single point of failure  Enterprise / multi-data center / Cloud data distribution  Read/Write Anywhere capable  Flexible schema  Tunable data consistency  Data compression  Familiar SQL-Like language – CQL  Easy setup  No special hardware needed  No special caching layer needed
  • 12. Cassandra Architecture Overview  Cassandra was designed with the understanding that system/hardware failures can and do occur  Peer-to-peer, distributed system  All nodes the same  Data partitioned among all nodes in the cluster  Custom data replication to ensure fault tolerance  Read/Write-anywhere design
  • 13. Cassandra Architecture Overview  Each node communicates with each other through the Gossip protocol, which exchanges information across the cluster every second  A commit log is used on each node to capture write activity. Data durability is assured  Data also written to an in-memory structure (memtable) and then to disk once the memory structure is full (an SStable)
  • 14. Cassandra Architecture Overview  The schema used in Cassandra is mirrored after Google Bigtable. It is a row-oriented, column structure that can store structured, semi-structured, and unstructured data  A keyspace is akin to a database in the RDBMS world  A column family is similar to an RDBMS table but is more flexible/dynamic  A row in a column family is indexed by its key. Other columns may be indexed as well Portfolio Keyspace Customer Column Family ID Name SSN DOB
  • 15. Transparent Elasticity Nodes can be added and removed from Cassandra online, with no downtime being experienced. 1 12 2 1 11 3 6 2 10 4 5 3 5 9 4 6 8 7
  • 16. Transparent Scalability Addition of Cassandra nodes increases performance linearly and ability to manage TB’s-PB’s of data. 1 12 2 1 11 3 6 2 Performance Performance throughput = N throughput = N x 2 10 4 5 3 5 9 4 6 8 7
  • 17. Transparent Scalability Over 1 million writes/se c! http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
  • 18. High Availability Cassandra, with its peer-to-peer architecture has no single point of failure.
  • 19. Multi-Geography/Zone Aware Cassandra allows a single logical database to span 1-N datacenters that are geographically dispersed. Also supports a hybrid on-premise/Cloud implementation.
  • 20. Data Redundancy Cassandra allows for customizable data redundancy so that data is completely protected. Also supports rack awareness (data can be replicated between different racks to guard against machine/rack failures).
  • 21. Dynamic Schema Cassandra’s data model – based on Google’s Bigtable – allows a user to store structured, semi-structured, and unstructured data with ease. Portfolio Keyspace Customer Column Family ID Name SSN DOB
  • 22. Simple Manageability AMI installers install and configure an entire multi-node Cloud implementation in minutes. All can be managed and monitored via Web-based console.
  • 23. Cloud Provider/Software Support Cassandra is supported on popular Cloud provider platforms and operating systems.
  • 24. Low Cost Cassandra is open source software and is freely available. Commercial/advanced versions of Cassandra are available from DataStax along with support and other services.
  • 25. How Does Cassandra Stack Up? Cloud Database Attribute Meet? Info Transparent elasticity Nodes can be added/removed online with auto load balancing Transparent scalability Performance increases linearly with node additions. Big Data capable High availability No single point of failure. Offers high degree of availability Multi-geo/zone Supports multi data centers, geos, Cloud zones, read-write anywhere Data redundancy Customizable data replication / redundancy Dynamic Schema Able to manage all key types of data Simple manageability Easy install, setup and managed via Web console Cloud provider/software support Support for all key providers and operating systems Low cost Free if use community; very low cost if using DataStax for advanced functionality and/or support
  • 26. Next Steps Download Cassandra and try it in your own environment or on your Cloud provider’s platform.  Go to www.datastax.com/do wnload  Downloads available for both Cassandra installs that are on premise and AMI for Amazon EC2