SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
THE DATABASE
                         REVOLUTION




                        Robin Bloor, Ph D



Tuesday, August 2, 11
This Presentation

                          Intro: The RDBMS
                          Computer Hardware Trends
                          The NoSQL trend (Either No as
                          in none or NO as in Not Only)
                          What to do...



                        Main Take Away:

                        Database is no longer a commodity



Tuesday, August 2, 11
A Point Of Departure
            In the 1990s, Relational Database
            quickly became the dominant form
            of database.
            The SQL language became the
            dominant data access mechanism.
            The RDBMS conferred mathematical
            respectability on itself and even
            claimed an underlying “Relational
            Algebra.”
            The RDBMS dominated because it
            dealt effectively with transactional
            and BI apps.



Tuesday, August 2, 11
Relational Dogma
          Data and Process should be kept
          separate.
          The database embodies a data
          model within a schema
          Normalization to 3NF (or 5NF) is
          the correct way to design the
          schema
          The query language (SQL) is part
          DDL and part DML (Select,
          Project, Join)
          Ordering doesn’t matter

Tuesday, August 2, 11
The 1990s RDBMS
             The RDBMS of the 1990s was
             physically based on B-tree
             structures and an optimizer.
             This scaled up within reason but
             it scaled out poorly.
             It was fundamentally an index-
             based data store.
             It managed megabytes and
             gigabytes fine.
             But look what happened to
             data....

Tuesday, August 2, 11
Moore’s Law Cubed
                  Moore’s Law suggests that CPU power increases
                  10-fold every 6 years (and other technologies have
                  stayed in step to some degree)
                  Large database volumes have grown 1000-fold:
                    In ~1992 measured in megabytes
                    In ~1998 measured in gigabytes
                    In ~2004 measured in terabytes
                    in ~2010 measured in petabytes
                  Exabytes by ~2016?




Tuesday, August 2, 11
HARDWARE




Tuesday, August 2, 11
RDBMS




Tuesday, August 2, 11
Tuesday, August 2, 11
RDBMS




Tuesday, August 2, 11
A Database is a Cupboard

              Some are transactional (for
              operational systems)

              Some service large queries
              against large data heaps

              Some are content oriented for
              accessing complex objects
              (object based systems mainly)

              All databases need to deliver
              performance


Tuesday, August 2, 11
A Database is a Cupboard

                    RDBMS ✔
              Some are transactional (for
              operational systems)

              Some service large queries
              against large data heaps

              Some are content oriented for
              accessing complex objects
              (object based systems mainly)

              All databases need to deliver
              performance


Tuesday, August 2, 11
A Database is a Cupboard

                    RDBMS ✔
              Some are transactional (for
              operational systems)


                   RDBMS ??
              Some service large queries
              against large data heaps

              Some are content oriented for
              accessing complex objects
              (object based systems mainly)

              All databases need to deliver
              performance


Tuesday, August 2, 11
A Database is a Cupboard

                    RDBMS ✔
              Some are transactional (for
              operational systems)


                   RDBMS ??
              Some service large queries
              against large data heaps


                   RDBMS ??
              Some are content oriented for
              accessing complex objects
              (object based systems mainly)

              All databases need to deliver
              performance


Tuesday, August 2, 11
Hardware Data Points
          Moore’s Law now proceeds by adding
          cores rather than by increasing clock
          speed. Vector registers now standard on
          Intel chips
          Parallelism is now on the rise and will
          eventually become the normal mode of
          processing
          Memory is about 1 million times faster
          than disk and random reads have become
          very expensive in respect of latency
          The Intel processor is now being
          challenged by the ARM processor (it’s
          about heat)


Tuesday, August 2, 11
Memory v Disk




Tuesday, August 2, 11
Memory v Disk
          The decline in memory
          costs is (on current
          trends) likely to have
          memory cheaper than
          disk around 2016
          This means that non-
          volatile SSDs will
          prevail relatively soon.
          SSDs are between
          1000 and 100,000
          times faster than
          spinning disk



Tuesday, August 2, 11
Massive Scale-Out
          CPUS are now
          doubling cores every
          18 months or so.
          This trend, combined
          with memory cost
          trends, suggests that
          massive scale out will
          eventually become a
          much rarer
          requirement.
          But we cannot know
          that for sure.



Tuesday, August 2, 11
Consequences
          SSD will replace disk - but slowly...
          Many DBMS tasks can now be
          handled in memory - but better
          physical architectures are possible
          for this.
          Physical indexes are becoming
          irrelevant
          Scale out and parallelism are now
          the driving force for large data
          volume applications.
          The physical architecture of the
          traditional RDBMS is now an
          anachronism


Tuesday, August 2, 11
NoSQL




Tuesday, August 2, 11
A Plethora of Databases
                  4th Dimension, Adabas D, AllegroGraph, Alpha Five, Altibase, Apache Derby, Aster
                   Data, Azure Table Storage, BaseX, Berkeley DB, Bigdata, BlackRay, CA-Datacom,
                  Cassandra, Chordless, Citrusleaf , Clarion, Cloudata, Cloudera, Clustrix, CouchDB,
                      Network                            OLAP                                 OR
                    CSQL, CUBRID, Daffodil database, Data Management Center (DMC), Database
                       DBMS
                                       RDBMS
                                                         DBMS
                                                                          ODBMS
                                                                                             DBMS
                 Management Library, DataEase, Dataphor, DB-Fast, db4o, Derby aka Java DB, DEX,
                  Dynomite, EffiProz, ElevateDB, Empress Embedded Database, EnterpriseDB, eXist,
                    eXtremeDB, Faircom C-Tree, fastDB, FileDB, FileMaker Pro, Firebird, FlockDB,
                   FrontBase, GenieDB, GigaSpaces, Gladius DB, Greenplum, GroveSite, GT.M, H2,
                  Hadoop / HBase, HamsterDB, Hazelcast, Helix database, Hibari, HPCC, HSQLDB,
                                                                           Open                In
                HyperGraphDB, Hypertable, IBM DB2, IBM DB2 Express-C, IBM Lotus Approach, IBM
                        Text           Content            XML
                                                                          Source            Memory
                       DBMS             DBMS             DBMS
                     Lotus/Domino, Infinite Graph, Infobright, InfoGrid, Informix, Ingres, InterBase,
                                                                           DBMS              DBMS
               Intersystems Cache, InterSystems Caché, ISIS Family, KAI, Kognitio, LightCloud, Linter,
                   Magma, MariaDB, Mark Logic Server, MaxDB, Mckoi SQL Database, MEMBASE,
                  MemcacheDB, Microsoft Access, Microsoft Jet Database Engine (part of Microsoft
               Access), Microsoft SQL Server, Microsoft SQL Server Express, Microsoft Visual FoxPro,
                 Mimer SQL, Mnesia , Analytic
                      Column           MonetDB, MongoDB, Morantex, mSQL, MySQL, Neo4J, NEO,
                                                        Streams          Temporal
                                                                                            Hadoop
                       Store NonStop SQL, Objectivity, Openbase, OpenInsight, OpenLink HBASE
                   Netezza,                                                                & Virtuoso,
                                       DBMS              DBMS              DBMS
                OpenLink Virtuoso, OpenLink Virtuoso Universal Server, OpenQM, Oracle,(MPP) Rdb
                       DBMS
                                                                                              Oracle
                    for OpenVMS, OrientDB, Panorama, Perst, PervasiveSQL, PicoLisp, Pincaster,
                 PostgreSQL, Prevayler, Progress Software, Qizx, Queplix, RaptorDB, RavenDB, RDM
                 Embedded, RDM Server, Recutils, Redis, Riak, SAND CDBMS, Sav Zigzag, Scalaris,
                  Scalien, SciDB, ScimoreDB, Sedna, SisoDB, SmallSQL, solidDB, Sones, SQLBase,
                                       Hyper-
                       Graph                           Algebraic           Cloud             Triple
                   SQLDB, SQLite, Starcounter, Sterling, Stratosphere, STSdb, Sybase, Sybase IQ,
                                        media
                       DBMS                              DBMS              DBMS              Stores
                tdbengine, Teradata, Terrastore, The SAS system, ThruDB, TimesTen, Tokutek , Trinity,
                                        DBMS
                txtSQL, U2, UniData, UniVerse, Valentina, Versant, VertexDB , Vertica, VistaDB, VMDS,
                              Voldemort, WCE SL Plus, XSPRADA, Yserial, ZODB, Zoduna




Tuesday, August 2, 11
RDBMS & SQL As Anachronisms
          For big BI, RDBMS has been
          superseded by column store dbms
          primarily because it didn’t scale out
          and indexes have become far less
          important.
          The use of snowflake schemas and
          star schemas had already
          demonstrated that 3NF was a limited
          modeling technique and nothing
          more.
          And then came Hadoop & MapReduce
          for massive scale-out - which cares
          nothing for SQL or RDBMS


Tuesday, August 2, 11
A Fundamental Error
          Actions: Add, Modify, Delete,
          Archive
          From day 1 there was a fundamental
          error in the simple mechanics of
          database and file systems.
          When you update data you destroy
          the old value. No audit trail.
          A correct theory of data was
          invented by (perhaps) Luca Pacioli.
          It is the basis of accounting.
          A few databases (Firebird is one)
          were built so that data was only ever
          added or archived.


Tuesday, August 2, 11
The Ordering Of Data
          “A data set is an unordered
          collection of unique, non-duplicated
          items.”
          This is an absurd constraint to place
          upon data, as data is naturally
          ordered by time if by nothing else.
               Events are ordered by time.
               Changes to entities are ordered
               by time
          There are lots of applications.
          requiring time series capability.
          This has led to TSDB products like
          Streambase, Vhayu, Open TSDB,
          etc.


Tuesday, August 2, 11
The Separation of Data and Process
          The assumption was that this
          separation could be enforced
          But when you try to enforce it, you      Process
          forever encounter data and process
          locked together in a guilty embrace.
          It is a wrong separation of concerns.
                                                    SQL      SCHEMA
          In truth it cannot be enforced without
          there being a true algebra of data
          So many databases (object
          databases and other NoSQL
          databases) do not enforce it.            DBMS

          However their interfaces to data are
          not perfect either.




Tuesday, August 2, 11
Relational Algebra Isn’t An Algebra
          Set aside that fact that RDBMS
          focus so strongly on Table structures
          that they cannot naturally represent
          other important data structures
          (such as BOMP and MOLAP).
          And that RDBMS rail against the
          ordering of data (“No order”)
          Ignore the stored procedures (which
          violate the separation of data and
          process).
          Even so Relational Algebra is not
          even an algebra. (NULLs?)
          There is at least one algebraic
          (NoSQL) database



Tuesday, August 2, 11
The SQL Barrier
          SQL has:
            DDL (for data definition)                                     SQL
                                                                        Barrier
               DML (for Select, Project and Join)
                                                         Results                  Or results
               But it has no MML or TML                processing
                                                    must be done here
                                                                                  processing
                                                                               must be done here

          Usually result sets are brought to the
          client for further manipulation, but
          using them for further data access
                                                                         SQL
          becomes problematic.
          Conclusions:                                                            Analytic
                                                                                   DBMS

               This separation of data from
               process is arbitrary and unhelpful
               Any database to which this
               doesn’t apply is NoSQL



Tuesday, August 2, 11
Other NDBMS Directions
          Some NDBMS do not attempt to provide all ACID
          properties. (Atomicity, Consistency, Isolation, Durability)
          Some NDBMS deploy a distributed scale-out
          architecture with data redundancy.
          XML DBMS using XQuery are NDBMS.
          Some documents stores are NDBMS (OrientDB,
          Terrastore, etc.)
          Object databases are NDBMS (Gemstone, Objectivity,
          ObjectStore, etc.)
          Key value stores = schema-less stores (Cassandra,
          MongoDB, Berkeley DB, etc.)
          Graph DBMS (DEX, OrientDB, etc.) are NDMBS
          Large data pools (BigTable, Hbase, Mnesia, etc.) are
          NDBMS


Tuesday, August 2, 11
What To Do...




Tuesday, August 2, 11
What Is The Problem You Are
                   Trying To Solve?
                The primary message of this presentation is that
                database is no longer a commodity (if it ever
                was).
                Despite faults and weaknesses the General
                Purpose Relations Database works fine for many
                areas of application and:
                  It is well understood
                  Skills (for any popular product) are abundant
                  It can be inexpensive (by license or Open
                  Source)
                Beyond such products, it is “horses for courses”
                and “caveat emptor.”


Tuesday, August 2, 11
Other Selection Criteria
                Don’t fall for fashion.
                Proven performance?
                Skills, both for design and for administration.
                Interfaces & middleware
                The hardware bill.
                Product roadmap.
                External support/internal support.
                Calculate a TCO (note that even for expensive
                DBMS the licenses fees are rarely more than
                15% of the TCO)




Tuesday, August 2, 11
Take Aways
                        Hardware trends have brought change,
                        will bring more change
                        There are many RDBMS weaknesses
                        There are a huge number of “new”
                        database products both
                         No SQL Whatsoever, and
                         Not Only SQL
                        Select database products with caution
                        Main Take Away:

                        Database is no longer a commodity


Tuesday, August 2, 11
Tuesday, August 2, 11
Thank You
                        For Your
                        Attention



Tuesday, August 2, 11

Más contenido relacionado

La actualidad más candente

NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and UsesSuvradeep Rudra
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sqlRam kumar
 
Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL DatabasesDerek Stainer
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introductionPooyan Mehrparvar
 
NoSQL Databases: Why, what and when
NoSQL Databases: Why, what and whenNoSQL Databases: Why, what and when
NoSQL Databases: Why, what and whenLorenzo Alberton
 
Non Relational Databases
Non Relational DatabasesNon Relational Databases
Non Relational DatabasesChris Baglieri
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture PatternsMaynooth University
 
NoSQL-Database-Concepts
NoSQL-Database-ConceptsNoSQL-Database-Concepts
NoSQL-Database-ConceptsBhaskar Gunda
 
An introduction to Nosql
An introduction to NosqlAn introduction to Nosql
An introduction to Nosqlgreprep
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational DatabasesUdi Bauman
 
Intro to NoSQL and MongoDB
Intro to NoSQL and MongoDBIntro to NoSQL and MongoDB
Intro to NoSQL and MongoDBDATAVERSITY
 
Backbone using Extensible Database APIs over HTTP
Backbone using Extensible Database APIs over HTTPBackbone using Extensible Database APIs over HTTP
Backbone using Extensible Database APIs over HTTPMax Neunhöffer
 

La actualidad más candente (20)

Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
NOSQL Databases types and Uses
NOSQL Databases types and UsesNOSQL Databases types and Uses
NOSQL Databases types and Uses
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
Nosql seminar
Nosql seminarNosql seminar
Nosql seminar
 
Non relational databases-no sql
Non relational databases-no sqlNon relational databases-no sql
Non relational databases-no sql
 
Selecting best NoSQL
Selecting best NoSQL Selecting best NoSQL
Selecting best NoSQL
 
Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL Databases
 
NoSQL databases - An introduction
NoSQL databases - An introductionNoSQL databases - An introduction
NoSQL databases - An introduction
 
NoSQL Databases: Why, what and when
NoSQL Databases: Why, what and whenNoSQL Databases: Why, what and when
NoSQL Databases: Why, what and when
 
Non Relational Databases
Non Relational DatabasesNon Relational Databases
Non Relational Databases
 
Introduction to NoSQL
Introduction to NoSQLIntroduction to NoSQL
Introduction to NoSQL
 
NoSQL Data Architecture Patterns
NoSQL Data ArchitecturePatternsNoSQL Data ArchitecturePatterns
NoSQL Data Architecture Patterns
 
NoSQL-Database-Concepts
NoSQL-Database-ConceptsNoSQL-Database-Concepts
NoSQL-Database-Concepts
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
 
An introduction to Nosql
An introduction to NosqlAn introduction to Nosql
An introduction to Nosql
 
Nonrelational Databases
Nonrelational DatabasesNonrelational Databases
Nonrelational Databases
 
NoSQL databases
NoSQL databasesNoSQL databases
NoSQL databases
 
Intro to NoSQL and MongoDB
Intro to NoSQL and MongoDBIntro to NoSQL and MongoDB
Intro to NoSQL and MongoDB
 
RDBMS vs NoSQL
RDBMS vs NoSQLRDBMS vs NoSQL
RDBMS vs NoSQL
 
Backbone using Extensible Database APIs over HTTP
Backbone using Extensible Database APIs over HTTPBackbone using Extensible Database APIs over HTTP
Backbone using Extensible Database APIs over HTTP
 

Similar a The Coming Database Revolution

Database management-system
Database management-systemDatabase management-system
Database management-systemkalasalingam
 
Prepare Your Data For The Cloud
Prepare Your Data For The CloudPrepare Your Data For The Cloud
Prepare Your Data For The CloudIndicThreads
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12mark madsen
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastInside Analysis
 
05 No SQL Sudarshan.ppt
05 No SQL Sudarshan.ppt05 No SQL Sudarshan.ppt
05 No SQL Sudarshan.pptAnandKonj1
 
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'sankarapu posibabu
 
No SQL Databases.ppt
No SQL Databases.pptNo SQL Databases.ppt
No SQL Databases.pptssuser8c8fc1
 
Persistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerPersistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerNeil Raden
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfajajkhan16
 
Where Does Big Data Meet Big Database - QCon 2012
Where Does Big Data Meet Big Database - QCon 2012Where Does Big Data Meet Big Database - QCon 2012
Where Does Big Data Meet Big Database - QCon 2012Ben Stopford
 
NoSQL powerpoint presentation difference with rdbms
NoSQL powerpoint presentation difference with rdbmsNoSQL powerpoint presentation difference with rdbms
NoSQL powerpoint presentation difference with rdbmsAtulKabbur
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesshnkr_rmchndrn
 
Implementation of nosql for robotics
Implementation of nosql for roboticsImplementation of nosql for robotics
Implementation of nosql for roboticsJoão Gabriel Lima
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...InfiniteGraph
 

Similar a The Coming Database Revolution (20)

Preparing yourdataforcloud
Preparing yourdataforcloudPreparing yourdataforcloud
Preparing yourdataforcloud
 
No sql
No sqlNo sql
No sql
 
Database management-system
Database management-systemDatabase management-system
Database management-system
 
Prepare Your Data For The Cloud
Prepare Your Data For The CloudPrepare Your Data For The Cloud
Prepare Your Data For The Cloud
 
Preparing your data for the cloud
Preparing your data for the cloudPreparing your data for the cloud
Preparing your data for the cloud
 
Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12Database revolution opening webcast 01 18-12
Database revolution opening webcast 01 18-12
 
Database Revolution - Exploratory Webcast
Database Revolution - Exploratory WebcastDatabase Revolution - Exploratory Webcast
Database Revolution - Exploratory Webcast
 
Report 2.0.docx
Report 2.0.docxReport 2.0.docx
Report 2.0.docx
 
05 No SQL Sudarshan.ppt
05 No SQL Sudarshan.ppt05 No SQL Sudarshan.ppt
05 No SQL Sudarshan.ppt
 
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'
No SQL Databases sdfghjkl;sdfghjkl;sdfghjkl;'
 
No SQL Databases.ppt
No SQL Databases.pptNo SQL Databases.ppt
No SQL Databases.ppt
 
Persistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the AnswerPersistence of memory: In-memory Is Not Often the Answer
Persistence of memory: In-memory Is Not Often the Answer
 
NOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdfNOSQL in big data is the not only structure langua.pdf
NOSQL in big data is the not only structure langua.pdf
 
Where Does Big Data Meet Big Database - QCon 2012
Where Does Big Data Meet Big Database - QCon 2012Where Does Big Data Meet Big Database - QCon 2012
Where Does Big Data Meet Big Database - QCon 2012
 
Report 1.0.docx
Report 1.0.docxReport 1.0.docx
Report 1.0.docx
 
NoSQL powerpoint presentation difference with rdbms
NoSQL powerpoint presentation difference with rdbmsNoSQL powerpoint presentation difference with rdbms
NoSQL powerpoint presentation difference with rdbms
 
Navigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skiesNavigating NoSQL in cloudy skies
Navigating NoSQL in cloudy skies
 
Implementation of nosql for robotics
Implementation of nosql for roboticsImplementation of nosql for robotics
Implementation of nosql for robotics
 
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
New Data Technologies, Graph Computing and Relationship Discovery in the Ente...
 
Nosql and newsql
Nosql and newsqlNosql and newsql
Nosql and newsql
 

Más de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Más de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Último

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 

Último (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 

The Coming Database Revolution

  • 1. THE DATABASE REVOLUTION Robin Bloor, Ph D Tuesday, August 2, 11
  • 2. This Presentation Intro: The RDBMS Computer Hardware Trends The NoSQL trend (Either No as in none or NO as in Not Only) What to do... Main Take Away: Database is no longer a commodity Tuesday, August 2, 11
  • 3. A Point Of Departure In the 1990s, Relational Database quickly became the dominant form of database. The SQL language became the dominant data access mechanism. The RDBMS conferred mathematical respectability on itself and even claimed an underlying “Relational Algebra.” The RDBMS dominated because it dealt effectively with transactional and BI apps. Tuesday, August 2, 11
  • 4. Relational Dogma Data and Process should be kept separate. The database embodies a data model within a schema Normalization to 3NF (or 5NF) is the correct way to design the schema The query language (SQL) is part DDL and part DML (Select, Project, Join) Ordering doesn’t matter Tuesday, August 2, 11
  • 5. The 1990s RDBMS The RDBMS of the 1990s was physically based on B-tree structures and an optimizer. This scaled up within reason but it scaled out poorly. It was fundamentally an index- based data store. It managed megabytes and gigabytes fine. But look what happened to data.... Tuesday, August 2, 11
  • 6. Moore’s Law Cubed Moore’s Law suggests that CPU power increases 10-fold every 6 years (and other technologies have stayed in step to some degree) Large database volumes have grown 1000-fold: In ~1992 measured in megabytes In ~1998 measured in gigabytes In ~2004 measured in terabytes in ~2010 measured in petabytes Exabytes by ~2016? Tuesday, August 2, 11
  • 11. A Database is a Cupboard Some are transactional (for operational systems) Some service large queries against large data heaps Some are content oriented for accessing complex objects (object based systems mainly) All databases need to deliver performance Tuesday, August 2, 11
  • 12. A Database is a Cupboard RDBMS ✔ Some are transactional (for operational systems) Some service large queries against large data heaps Some are content oriented for accessing complex objects (object based systems mainly) All databases need to deliver performance Tuesday, August 2, 11
  • 13. A Database is a Cupboard RDBMS ✔ Some are transactional (for operational systems) RDBMS ?? Some service large queries against large data heaps Some are content oriented for accessing complex objects (object based systems mainly) All databases need to deliver performance Tuesday, August 2, 11
  • 14. A Database is a Cupboard RDBMS ✔ Some are transactional (for operational systems) RDBMS ?? Some service large queries against large data heaps RDBMS ?? Some are content oriented for accessing complex objects (object based systems mainly) All databases need to deliver performance Tuesday, August 2, 11
  • 15. Hardware Data Points Moore’s Law now proceeds by adding cores rather than by increasing clock speed. Vector registers now standard on Intel chips Parallelism is now on the rise and will eventually become the normal mode of processing Memory is about 1 million times faster than disk and random reads have become very expensive in respect of latency The Intel processor is now being challenged by the ARM processor (it’s about heat) Tuesday, August 2, 11
  • 16. Memory v Disk Tuesday, August 2, 11
  • 17. Memory v Disk The decline in memory costs is (on current trends) likely to have memory cheaper than disk around 2016 This means that non- volatile SSDs will prevail relatively soon. SSDs are between 1000 and 100,000 times faster than spinning disk Tuesday, August 2, 11
  • 18. Massive Scale-Out CPUS are now doubling cores every 18 months or so. This trend, combined with memory cost trends, suggests that massive scale out will eventually become a much rarer requirement. But we cannot know that for sure. Tuesday, August 2, 11
  • 19. Consequences SSD will replace disk - but slowly... Many DBMS tasks can now be handled in memory - but better physical architectures are possible for this. Physical indexes are becoming irrelevant Scale out and parallelism are now the driving force for large data volume applications. The physical architecture of the traditional RDBMS is now an anachronism Tuesday, August 2, 11
  • 21. A Plethora of Databases 4th Dimension, Adabas D, AllegroGraph, Alpha Five, Altibase, Apache Derby, Aster Data, Azure Table Storage, BaseX, Berkeley DB, Bigdata, BlackRay, CA-Datacom, Cassandra, Chordless, Citrusleaf , Clarion, Cloudata, Cloudera, Clustrix, CouchDB, Network OLAP OR CSQL, CUBRID, Daffodil database, Data Management Center (DMC), Database DBMS RDBMS DBMS ODBMS DBMS Management Library, DataEase, Dataphor, DB-Fast, db4o, Derby aka Java DB, DEX, Dynomite, EffiProz, ElevateDB, Empress Embedded Database, EnterpriseDB, eXist, eXtremeDB, Faircom C-Tree, fastDB, FileDB, FileMaker Pro, Firebird, FlockDB, FrontBase, GenieDB, GigaSpaces, Gladius DB, Greenplum, GroveSite, GT.M, H2, Hadoop / HBase, HamsterDB, Hazelcast, Helix database, Hibari, HPCC, HSQLDB, Open In HyperGraphDB, Hypertable, IBM DB2, IBM DB2 Express-C, IBM Lotus Approach, IBM Text Content XML Source Memory DBMS DBMS DBMS Lotus/Domino, Infinite Graph, Infobright, InfoGrid, Informix, Ingres, InterBase, DBMS DBMS Intersystems Cache, InterSystems Caché, ISIS Family, KAI, Kognitio, LightCloud, Linter, Magma, MariaDB, Mark Logic Server, MaxDB, Mckoi SQL Database, MEMBASE, MemcacheDB, Microsoft Access, Microsoft Jet Database Engine (part of Microsoft Access), Microsoft SQL Server, Microsoft SQL Server Express, Microsoft Visual FoxPro, Mimer SQL, Mnesia , Analytic Column MonetDB, MongoDB, Morantex, mSQL, MySQL, Neo4J, NEO, Streams Temporal Hadoop Store NonStop SQL, Objectivity, Openbase, OpenInsight, OpenLink HBASE Netezza, & Virtuoso, DBMS DBMS DBMS OpenLink Virtuoso, OpenLink Virtuoso Universal Server, OpenQM, Oracle,(MPP) Rdb DBMS Oracle for OpenVMS, OrientDB, Panorama, Perst, PervasiveSQL, PicoLisp, Pincaster, PostgreSQL, Prevayler, Progress Software, Qizx, Queplix, RaptorDB, RavenDB, RDM Embedded, RDM Server, Recutils, Redis, Riak, SAND CDBMS, Sav Zigzag, Scalaris, Scalien, SciDB, ScimoreDB, Sedna, SisoDB, SmallSQL, solidDB, Sones, SQLBase, Hyper- Graph Algebraic Cloud Triple SQLDB, SQLite, Starcounter, Sterling, Stratosphere, STSdb, Sybase, Sybase IQ, media DBMS DBMS DBMS Stores tdbengine, Teradata, Terrastore, The SAS system, ThruDB, TimesTen, Tokutek , Trinity, DBMS txtSQL, U2, UniData, UniVerse, Valentina, Versant, VertexDB , Vertica, VistaDB, VMDS, Voldemort, WCE SL Plus, XSPRADA, Yserial, ZODB, Zoduna Tuesday, August 2, 11
  • 22. RDBMS & SQL As Anachronisms For big BI, RDBMS has been superseded by column store dbms primarily because it didn’t scale out and indexes have become far less important. The use of snowflake schemas and star schemas had already demonstrated that 3NF was a limited modeling technique and nothing more. And then came Hadoop & MapReduce for massive scale-out - which cares nothing for SQL or RDBMS Tuesday, August 2, 11
  • 23. A Fundamental Error Actions: Add, Modify, Delete, Archive From day 1 there was a fundamental error in the simple mechanics of database and file systems. When you update data you destroy the old value. No audit trail. A correct theory of data was invented by (perhaps) Luca Pacioli. It is the basis of accounting. A few databases (Firebird is one) were built so that data was only ever added or archived. Tuesday, August 2, 11
  • 24. The Ordering Of Data “A data set is an unordered collection of unique, non-duplicated items.” This is an absurd constraint to place upon data, as data is naturally ordered by time if by nothing else. Events are ordered by time. Changes to entities are ordered by time There are lots of applications. requiring time series capability. This has led to TSDB products like Streambase, Vhayu, Open TSDB, etc. Tuesday, August 2, 11
  • 25. The Separation of Data and Process The assumption was that this separation could be enforced But when you try to enforce it, you Process forever encounter data and process locked together in a guilty embrace. It is a wrong separation of concerns. SQL SCHEMA In truth it cannot be enforced without there being a true algebra of data So many databases (object databases and other NoSQL databases) do not enforce it. DBMS However their interfaces to data are not perfect either. Tuesday, August 2, 11
  • 26. Relational Algebra Isn’t An Algebra Set aside that fact that RDBMS focus so strongly on Table structures that they cannot naturally represent other important data structures (such as BOMP and MOLAP). And that RDBMS rail against the ordering of data (“No order”) Ignore the stored procedures (which violate the separation of data and process). Even so Relational Algebra is not even an algebra. (NULLs?) There is at least one algebraic (NoSQL) database Tuesday, August 2, 11
  • 27. The SQL Barrier SQL has: DDL (for data definition) SQL Barrier DML (for Select, Project and Join) Results Or results But it has no MML or TML processing must be done here processing must be done here Usually result sets are brought to the client for further manipulation, but using them for further data access SQL becomes problematic. Conclusions: Analytic DBMS This separation of data from process is arbitrary and unhelpful Any database to which this doesn’t apply is NoSQL Tuesday, August 2, 11
  • 28. Other NDBMS Directions Some NDBMS do not attempt to provide all ACID properties. (Atomicity, Consistency, Isolation, Durability) Some NDBMS deploy a distributed scale-out architecture with data redundancy. XML DBMS using XQuery are NDBMS. Some documents stores are NDBMS (OrientDB, Terrastore, etc.) Object databases are NDBMS (Gemstone, Objectivity, ObjectStore, etc.) Key value stores = schema-less stores (Cassandra, MongoDB, Berkeley DB, etc.) Graph DBMS (DEX, OrientDB, etc.) are NDMBS Large data pools (BigTable, Hbase, Mnesia, etc.) are NDBMS Tuesday, August 2, 11
  • 29. What To Do... Tuesday, August 2, 11
  • 30. What Is The Problem You Are Trying To Solve? The primary message of this presentation is that database is no longer a commodity (if it ever was). Despite faults and weaknesses the General Purpose Relations Database works fine for many areas of application and: It is well understood Skills (for any popular product) are abundant It can be inexpensive (by license or Open Source) Beyond such products, it is “horses for courses” and “caveat emptor.” Tuesday, August 2, 11
  • 31. Other Selection Criteria Don’t fall for fashion. Proven performance? Skills, both for design and for administration. Interfaces & middleware The hardware bill. Product roadmap. External support/internal support. Calculate a TCO (note that even for expensive DBMS the licenses fees are rarely more than 15% of the TCO) Tuesday, August 2, 11
  • 32. Take Aways Hardware trends have brought change, will bring more change There are many RDBMS weaknesses There are a huge number of “new” database products both No SQL Whatsoever, and Not Only SQL Select database products with caution Main Take Away: Database is no longer a commodity Tuesday, August 2, 11
  • 34. Thank You For Your Attention Tuesday, August 2, 11