Einführung in NoSQL-Datenbanken

MongoDB, Neo4j & Co.
Einführung NoSQL-Datenbanken_
Tobias Trelle
@tobiastrelle

Senior IT Consultant @codecentric
Software Architect
Java/JEE/RDBMS background
Committer Spring Data
@tobiastrelle
ttrelle
2

What‘s My Name Again?
It’s not about using no SQL any more but about
thinking beyond SQL for certain problems!
NoSQL
5

Why NoSQL?
NoSQL databases help to solve the 3 problems of
BigData
(Gartner, 2009):
• Volume (Menge an Daten)
• Variety (verschiedene Strukturen)
• Velocity (Geschwindigkeit)
6

The Trouble with Scaling
Vertical
vs.
Horizontal
Scalability
7

Vertical Scalability
8
RAM
CPU
Storage

9
RAM
CPU
Storage

10
RAM
CPU
Storage

Size 
Cost
„BIG“ DATABASES
SCALE
VERTICALLY
BUT AT
WHAT
PRICE?
11

Horizontal Scalability
RAM
CPU
Storage
12

RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
13

RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
RAM
CPU
Storage
14

Vertical vs. Horizontal Scalability
NOSQL DATABASES
SCALE
HORIZONTALLY
NEED
AN
EXAMPLE?
MESSAGING SYSTEM OF
(POWERED BY )
- 100 NODE CLUSTER
- 87 BILLION R/W-OPERATIONS / DAY
- 9+ BILLION MESSAGES / DAY
- 4 PB COMPRESSED DATA / DAY
- 45% GROWTH / MONTH
SOURCE: NICOLAS SPIEGELBERG
BERLIN BUZZWORDS, JUNE 2012
16

For a distributed
computer system
it is impossible to
simultaneously
provide all
these three
guarantees!
Partition
Tolerance
failure of single
nodes doesn‘t
effect the overall
system
CAP Theorem a.k.a Brewer’s Theorem (2000)
Consisteny
all nodes see
the same data at
the same time
Availability
a guarantee that
every request
receives a
response
17

Partion
Tolerance
Visual Guide to (No)SQL systems
Consistency
Availability
18

ACID vs. BASE
Atomicity
Consistency
Isolation
Durability1983
RDBMS
20

ACID vs. BASE
ACID is a good idea,
but it is not an
unbreakable law!
21

ACID vs. BASE
Basically Available
Soft State
Eventually consistent
2008
NoSQL
22

Overview of NoSQL Categories
1
1 1 1
1 1
1 1
1
1
1
Column StoresKey-Value Stores
K V
K V
K V
Graph Databases
23
Document Stores
K
K
K
V
V
K
V
V

Current trends in the (No)SQL World
http://www.db-engines.com
24

MongoDB
humongous, engl. = gigantisch, riesig
26

MongoDB > Fakten
Version 3.0.5
Kategorie: Dokumentenorientiert
Abfragen: • Query-by-Example + reichhaltige Operatoren
• Aggregationen / MapReduce
• Geodaten, Volltextsuche
Indizes: Primär-Index + frei wählbare, auch
zusammengesetzte Sekundär-Indizes
Replikation: Master/Slave mit automatischen Failover
Sharding: Auto-Sharding anhand wählbarem Schlüssel
TX: Atomarität auf einem Datensatz
API: • Binäres TCP/IP-basiertes Protokoll
• Treiber für viele Sprachen
27

MongoDB > Fakten
Lizenz: APGL (DB), Apache V2 (Treiber)
Hersteller: MongoDB Inc. (vormals 10gen)
Web: http://www.mongodb.org
Github: https://github.com/mongodb
28

MongoDB > Dokument
{
"_id" : ObjectId("53e3663ccb3bd259f9252f67"),
"typ" : ["gastro", "kultur"],
"name" : "Unperfekthaus",
"tags" : "uph unperfekt perfekt haus essen",
"desc" : "Im Unperfekthaus bekommen Künstler & Gründer ... ",
"adresse" : { "str" : "Friedrich-Ebert-Straße 18",
"plz" : 45127,
"ort" : "Essen"
},
"location" : {"type" : "Point",
"coordinates" : [ 7.0075, 51.45902 ]
}
}
30

MongoDB > Tools > Mongo Shell
34

MongoDB > Tools > Robomongo
35

Neo4j > Fakten
Version 2.2.5
Kategorie: Graphen-Datenbank
Abfragen: Query Languages: Cypher, Gremlin:
• Graph Traversal
• Aggregation, Projektion, …
Indizes: Primär-Index + frei wählbare, auch
zusammengesetzte Sekundär-Indizes
Replikation: Master/Slave mit automatischen Failover
Sharding: Keine disjunkte Verteilung, aber sog. Cache
Sharding
TX: ACID, mit JDBC auch JTA
38

Neo4j > Fakten
API: • Java-API (Embedded)
• REST / HTTP
• JDBC
Lizenz: GPL v3 (Community Edtion)
Hersteller: Neo Technology Inc.
Web: http://www.neo4j.org/
Github: https://github.com/neo4j
39

Neo4j > Graph
Graph := ( V, E ) mit
V := Menge Knoten (Vertices)
E := Teilmenge von (V x V) (Edges)
Neo4j-Graph := 
Gerichteter Graph,
Knoten und Beziehungen können Eigenschaften
haben  sog. Property Graph
40

Neo4j > Graph
name: "Joe"
age: 35
name: "Neo4j" name: "MongoDB"
:Person
:Database :Database
:Graph :Document
:KNOWS
skill: 10
since: 2007
:KNOWS
skill: 5
41

Neo4j > Tools > Web-Interface
44

Redis > Fakten
Version 3.0
Kategorie: Key-Value-Datenbank
Abfragen: Key-only, Pub/Sub
Indizes: Zugriff per se nur über Key
Replikation: Master/Slave mit autom. Failover
Sharding: Hash-basierte, disjunkte Verteilung der Daten
TX: ACID mit opt. Locking auf Menge von Keys
API: • Protokoll: ASCII over TCP/IP
• Treiber für viele Sprachen
47

Redis > Fakten
Lizenz: Three clause BSD license
Hersteller: Salvatore Sanfilippo (Pivotal) and Pieter Noordhuis
Web: http://www.redis.io/
Github: https://github.com/antirez/redis
48

Redis >
Beispiel
Simple a:1
l:12
"hello Redis"
"hello"
List
"Redis" "hello"
Set s:1 "hello" "Redis"
Hash h:1 name "Redis"
typ "key-value"
version 2.8
ZSet z:1 42 "/"
23 "/inbox"
2 "/inbox/joe"
50

Redis > Tools > Redis Desktop Manager
54

Einführung in NoSQL-Datenbanken

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Einführung in NoSQL-Datenbanken

Similar a Einführung in NoSQL-Datenbanken (20)

Más de Tobias Trelle

Más de Tobias Trelle (13)

Einführung in NoSQL-Datenbanken

Notas del editor