SlideShare a Scribd company logo
1 of 36
Download to read offline
The	
  Cassandra	
  Experience	
  at	
  Orange
	
  
Project	
  PnS	
  3.0
	
  
Jean	
  Armel	
  Luce	
  
Orange	
  France/DSIF/DF/SDF	
  
V1.0	
  
Summary	
  

§ 
§ 

Our	
  migraGon	
  strategy	
  

§ 

AMer	
  the	
  migraGon	
  …	
  

§ 

AnalyGcs	
  with	
  Hadoop/Pig/Hive	
  over	
  Cassandra	
  

§ 

2

Short	
  descripGon	
  of	
  PnS.	
  Why	
  did	
  we	
  choose	
  C*	
  ?	
  

ContribuGons	
  and	
  open	
  sourced	
  modules	
  from	
  Orange	
  &	
  conclusions	
  

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Short description of PnS3
PnS – Short description

§ 

PnS means Profiles and Syndication : PnS is a highly available
service for collecting and serving live data about Orange customers

§ 

End users of PnS are :
–  Orange customers (logged to Portal www.orange.fr)
–  Sellers in Orange shops
–  Some services in Orange (advertisements, …)

4

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
PnS – The Big Picture
Millions of HTTP requests
(Rest or Soap)
Fast and highly available

WebService to get or set
data stored by pns :
- postProcessing(data1)
- postProcessing(data2)
- postProcessing(data3)
- postProcessing(datax)
- …

End users
DB Queries
R/W operations
Thousands of files
(Csv or Xml)
Scheduled data injection

PNS

Data providers
5

Jean Armel Luce - Orange-DOP-PnS 3.0

Database

Cassandra Summit Europe – October 17 2013
PnS2 – Architecture
2 DCs architecture for high availability
§ 

Until 2012, data were stored in 2
differents backends :

ü 

MySQL cluster (for volatile data)

ü 

PostGres « cluster » (sharding and
replication)
Bagnol
et

and

§ 

web services

(read and writes)

Sophia
Antipolis

for batch updates

§ 

6

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Timeline	
  –	
  Key	
  dates	
  of	
  PnS	
  3.0	
  
PNS 2

•  Study	
  phase	
  
2010
to 2012

We	
  did	
  a	
  large	
  study	
  about	
  a	
  few	
  NoSQL	
  databases	
  (Cassandra,	
  MongoDB,	
  Riak,	
  Hbase,	
  Hypertable,	
  …)	
  
è	
  We	
  chose	
  Cassandra	
  as	
  the	
  single	
  backend	
  for	
  PnS

•  Design	
  phase	
  
06/2012

09/2012

We	
  started	
  the	
  design	
  phase	
  of	
  PnS3.0	
  

•  Proof	
  Of	
  Concept	
  

We	
  started	
  a	
  1st	
  (small)	
  Cassandra	
  cluster	
  in	
  producGon	
  for	
  a	
  non	
  criGcal	
  applicaGon	
  :	
  1	
  table,	
  key	
  value	
  access	
  

•  Produc7on	
  phase	
  
04/2013

MigraGon	
  of	
  the	
  1st	
  subset	
  of	
  data	
  of	
  PnS	
  from	
  mysql	
  cluster	
  to	
  Cassandra	
  in	
  produc7on	
  

•  Complete	
  migra7on	
  
05/2013
to
12/2013

7

MigraGon	
  of	
  all	
  other	
  subsets	
  of	
  data	
  from	
  Mysql	
  cluster	
  and	
  Postgres	
  to	
  Cassandra	
  	
  
Add	
  new	
  nodes	
  in	
  the	
  cluster	
  (From	
  8	
  nodes	
  in	
  each	
  DC	
  to	
  16	
  nodes	
  in	
  each	
  DC)	
  
Add	
  a	
  3rd	
  datacenter	
  for	
  AnalyGcs	
  

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
PnS – Why did we choose Cassandra ?

§ 

Cassandra fits our requirements :
–  Very high availability

PnS2 = 99,95% availability
we want to improve it !!!

–  Low latency

20 ms < RT PnS2 web service < 150 ms
we want to improve it !!!
Higher load, higher volume next
years ? unpredictable; better scalability
brings new businesses

–  Scalability

§ 

And also :
–  Ease of use : Cassandra is easy to administrate and operate
–  Some features that I like (rack aware, CL per request, …)
–  Cassandra is very efficient for simple requests :

«	
  SELECT	
  mycol1,	
  mycol2,	
  …,	
  mycolx	
  FROM	
  mytable	
  WHERE	
  myprimarykey	
  =	
  ‘mycustomerid’	
  »	
  
8

	
  

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Our migration strategy
The	
  migraGon	
  -­‐	
  Input	
  

§ 

During the migration, we need to :
§ 
§ 

maintain (or lower) the latency during the migration

§ 

§ 

maintain a very high availability
guarantee no functional regression

Question :
§ 

10

How can we migrate the data to Cassandra without any interruption of
service ?

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The	
  migraGon	
  :	
  Step	
  by	
  step	
  processing	
  

§ 

Subdivision of data into many subsets according to many criteria :
§ 
§ 

§ 

Same source of data
Relationships between data

And then, migrate each subset 1 by 1
S ubdivision	
  
into subsets

§ 

Definition of a generic process for all the subsets
S witch	
  q ueries
to	
  C assandra	
  
for	
  the	
  s ubset

Check /validation	
  
of	
  the	
  m ig ration
11

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

goto
1st subset

goto next
subset

Mig ration	
  
data	
  of	
  the	
  
subset
The	
  migraGon	
  :	
  Tools	
  and	
  Gps	
  
§ 

The strategy of migration is based on 2 main facilities :

mod_dup

PNS 2

An Apache module developped (and open sourced) by Orange
teams.
HTTP Req
Mod_dup can duplicate web requests, filter them on some criteria,
substitute characters (regexp), and send the duplicated requests to
another pool of web servers.
Used in order to fill legacy (relational) database and Cassandra
database simultaneously during the migration of the subset

mod_du
p

PNS 3

the timestamp management by Cassandra
Each data stored in C* is timestamped.
It is possible to set this timestamp when inserting/updating/deleting a data in Cassandra.
When Cassandra retrieves a data item, it returns the value having the most recent
timestamp.
We use this feature to distinguish the values stored before the migration started and the
values inserted during or inserted after the migration

12

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The migration : initial state

Step 0

HTTP Rest/Soap Read

HTTP Rest/Soap Write
WebServer

End users
SQL Read/
Write

PNS 2
DB

Files transfer via FTP or CFT

BatchInjector

Data providers
13

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The migration : double feed

Step 1

Duplicate HTTP update
streams from end users
mod_dup

WebServer

CQL
update

Data providers

Duplicate streams (files)
from data providers
14

Jean Armel Luce - Orange-DOP-PnS 3.0

BatchInjector

Cassandra Summit Europe – October 17 2013

PNS 3
Cassandra
DB
The migration : copy data form PnS2 to PnS3

Step 2

HTTP Write

mod_dup

WebServer

Batc

h In

jecti

on
TimeStamp	
  =	
  start	
  
	
  date	
  of	
  extraction	
  

Data providers

BatchInjector

15

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

PNS 3
Cassandra
DB
The migration : control

Step 3
HTTP Write

mod_dup

WebServer

SQL

Synchro
Control
Data providers

CQL

BatchInjector

16

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

PNS 3
Cassandra
DB
The migration : switch reads

Step 4
100 % read now on Cassandra

HTTP Read
requests
HTTP Write

mod_dup
HTTP Rest/Soap
Read

HTTP Rest/Soap Write
WebServer

WebServer

End users

PNS 2
DB

Files transfer via FTP or CFT

Data providers

BatchInjector

BatchInjector

17

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

PNS 3
Cassandra
DB
The migration : stop double feed

Step 5

HTTP Read/Write
HTTPrequests
Read
requests

100 % read on Cassandra
100% write on Cassandra
for HTTP request

HTTP Write

mod_dup

HTTP Rest/Soap Write
WebServer

WebServer

End users

PNS 2
DB

Files transfer via FTP or CFT

Data providers

BatchInjector

Files transfer

100% write on Cassandra
for Data injection
18

Jean Armel Luce - Orange-DOP-PnS 3.0

BatchInjector

Cassandra Summit Europe – October 17 2013

PNS 3
Cassandra
DB
The	
  migraGon	
  
§ 

Using this procedure :
§ 

§ 

During the control phase, we can take time (a few days, a few
weeks) to check that everything is OK before switching to
Cassandra

§ 

It is possible to easily rollback the migration of a subset if errors
are found during the control phase, without losing any update

§ 

19

It is possible to switch progressively to Cassandra rather than
doing a one shot switch

§ 

§ 

We can migrate to Cassandra without any interruption of
service

Doesn’t work if the queries are not idempotent.

After the migration, we can easily duplicate production requests
(entirely or partially) and send them to a bench platform thanks to
mod_dup

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
After the migration …
The	
  latency	
  
§ 

Comparison before/after migration to Cassandra
§ 

Some graphs about the latency of the web services are very
explicit :

Service push mail

Service push webxms

dates of
migration to C*
21

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The	
  latency	
  
§ 

Read and write latencies are now in microseconds in the datanodes :

Thanks to

and

This	
  latency	
  will	
  be	
  improved	
  by	
  (tests	
  in	
  progress)	
  :	
  
ALTER	
  TABLE	
  syndic	
  WITH	
  compacGon	
  =	
  {	
  'class'	
  :	
  'LeveledCompacGonStrategy',	
  'sstable_size_in_mb'	
  :	
  ??	
  };	
  

22

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The	
  availability	
  
• 

We got a few hardware failures and network outages

• 

No impact on QoS :
• 
• 

23

no error returned by the application
no real impact on latency

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The	
  scalability	
  
• 

PnS activity is always increasing (volume of data and requests/sec)

• 

How to measure the capacity of a cluster ?
Capacity of a C* cluster = capacity of a node * number of nodes
(true if all nodes are identical)

• 

there are 2 ways to deal with the expansion of activity :
Ø 

scale up (add more resources such as CPU, disks, RAM to each
node)

Ø  scale	
  out	
  (add	
  new	
  nodes	
  in	
  the	
  cluster)	
  
24

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
The	
  scalability	
  
• 

Thanks to vnodes (available since Cassandra 1.2), it is easy to scale
out

With NetworkTopologyStrategy, make sure to distribute evenly the nodes
in the racks
25

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Analytics with Hadoop/ Pig/
Hive over Cassandra
Basic	
  architecture	
  of	
  the	
  Cassandra	
  cluster	
  
§ 

Cluster without Hadoop : 2 datacenters, 16 nodes in each DC
§ 

RF (DC1, DC2) = (3, 3)

§ 

Requests from web servers in DC1 are sent to C* nodes in DC1

§ 

Requests from web servers in DC2 are sent to C* nodes in DC2

Pool
of
web
servers
DC1

27

Jean Armel Luce - Orange-DOP-PnS 3.0

DC1

DC2

Cassandra Summit Europe – October 17 2013

Pool
of
web
servers
DC2
Architecture	
  of	
  the	
  Cassandra	
  cluster	
  with	
  the	
  datacenter	
  for	
  
analyGcs	
  

§ 

Cluster with Hadoop : 3 datacenters, 16 nodes in DC1, 16 nodes in
DC2, 4 nodes in DC3
§ 

RF (DC1, DC2, DC3) = (3, 3, 1)

§ 

§ 

We favor cheaper disks (SATA) in DC3 rather than SSDs or
FusionIo cards

§ 

28

Because RF = 1 in DC3, we shall need less storage space in this
datacenter

Works better with HSHA Thrift server (tests in progress)

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Architecture	
  of	
  the	
  Cassandra	
  cluster	
  with	
  the	
  datacenter	
  for	
  
analyGcs	
  

Pool
of
web
servers
DC1

DC1

DC2

DC3

29

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

Pool
of
web
servers
DC2
Contributions and open
sourced modules from
Orange & conclusions
ContribuGons	
  and	
  open	
  sourced	
  modules	
  
§ 

Open sources by Orange
§ 

PHP driver for Cassandra :
https://github.com/Orange-OpenSource/YACassandraPDO
Thanks to Sandro Lex & Mathieu Lornac

§ 

Mod_dup (Migration to Cassandra)
§ https://github.com/Orange-OpenSource/mod_dup

Thanks to Jonas Wustrack & Emmanuel Courreges

§ 

Other contributions
§ 

C driver (libdbi driver) :
http://libdbi-drivers.cvs.sourceforge.net/viewvc/libdbi-drivers/libdbidrivers/?pathrev=Branch-2012-07-02-cassandra
Thanks to Emmanuel Courreges

31

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Conclusions
§ 

With Cassandra, we have improved our QoS

§ 

We are able to open our service to new opportunities

§ 

There is an ecosystem around C* (Hadoop, Hive, Pig, Storm, Shark,
…), which offers more capabilities. However, we would love to have
some of the components (Hive) integrated in C* core (as Pig)

§ 

32

PnS3 works better and hopefully cheaper than
PnS2

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Thank	
  you	
  

33

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
Questions

34

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
A	
  few	
  answers	
  about	
  hardware/OS	
  version	
  /Java	
  version/
Cassandra	
  version	
  
§ 

Hardware :
§ 

16 nodes in each DC at the end of 2013 :
§ 
§ 

1.2.2 (with a few patches backported from 1.2.3)

Java version :
§ 

35

Ubuntu Precise (12.04 LTS)

Cassandra version :
§ 

§ 

FusionIO 320 GB MLC

OS :
§ 

§ 

24 GB RAM

§ 

§ 

6 CPU Intel® Xeon® 2.00 GHz

Java7u7 : not recommended, upgrade scheduled soon

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013
A	
  few	
  answers	
  about	
  data	
  and	
  requests	
  
§ 

Data types :
§ 
§ 

elementary types : boolean, integer, string, date

§ 

collection types

§ 

§ 

Volume : 6 TB at the end of 2013

complex types : json, xml (between 1 and 20 KB)

Requests :
§ 
§ 

80% get

§ 

§ 

10.000 requests/sec at the end of 2013
20% set

Consistency level used by PnS :
§ 
§ 

36

ONE (95% of the queries)
LOCAL_QUORUM (5% of the queries)

Jean Armel Luce - Orange-DOP-PnS 3.0

Cassandra Summit Europe – October 17 2013

More Related Content

What's hot

HBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon
 
Webinar- Tea for the Tillerman
Webinar- Tea for the TillermanWebinar- Tea for the Tillerman
Webinar- Tea for the TillermanCumulus Networks
 
Accelerating Networked Applications with Flexible Packet Processing
Accelerating Networked Applications with Flexible Packet ProcessingAccelerating Networked Applications with Flexible Packet Processing
Accelerating Networked Applications with Flexible Packet ProcessingOpen-NFP
 
HBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon
 
Network simulator 2 a simulation tool for linux
Network simulator 2 a simulation tool for linuxNetwork simulator 2 a simulation tool for linux
Network simulator 2 a simulation tool for linuxPratik Joshi
 
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon
 
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on KubernetesHBaseCon
 
LF_DPDK17_Power Aware Packet Processing
LF_DPDK17_Power Aware Packet ProcessingLF_DPDK17_Power Aware Packet Processing
LF_DPDK17_Power Aware Packet ProcessingLF_DPDK
 
OpenContrail, Real Speed: Offloading vRouter
OpenContrail, Real Speed: Offloading vRouterOpenContrail, Real Speed: Offloading vRouter
OpenContrail, Real Speed: Offloading vRouterOpen-NFP
 
ONS Summit 2017 SKT TINA
ONS Summit 2017 SKT TINAONS Summit 2017 SKT TINA
ONS Summit 2017 SKT TINAJunho Suh
 
Distributed Postgres
Distributed PostgresDistributed Postgres
Distributed PostgresStas Kelvich
 
Principles in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, ConfluentPrinciples in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, ConfluentHostedbyConfluent
 
Measuring a 25 and 40Gb/s Data Plane
Measuring a 25 and 40Gb/s Data PlaneMeasuring a 25 and 40Gb/s Data Plane
Measuring a 25 and 40Gb/s Data PlaneOpen-NFP
 
Ceph Day Beijing: Big Data Analytics on Ceph Object Store
Ceph Day Beijing: Big Data Analytics on Ceph Object Store Ceph Day Beijing: Big Data Analytics on Ceph Object Store
Ceph Day Beijing: Big Data Analytics on Ceph Object Store Ceph Community
 
Apache samza past, present and future
Apache samza  past, present and futureApache samza  past, present and future
Apache samza past, present and futureEd Yakabosky
 
Apache Samza Past, Present and Future
Apache Samza  Past, Present and FutureApache Samza  Past, Present and Future
Apache Samza Past, Present and FutureKartik Paramasivam
 
HBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon
 
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...confluent
 

What's hot (20)

HBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBase
 
Webinar- Tea for the Tillerman
Webinar- Tea for the TillermanWebinar- Tea for the Tillerman
Webinar- Tea for the Tillerman
 
Accelerating Networked Applications with Flexible Packet Processing
Accelerating Networked Applications with Flexible Packet ProcessingAccelerating Networked Applications with Flexible Packet Processing
Accelerating Networked Applications with Flexible Packet Processing
 
HBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at Xiaomi
 
Network simulator 2 a simulation tool for linux
Network simulator 2 a simulation tool for linuxNetwork simulator 2 a simulation tool for linux
Network simulator 2 a simulation tool for linux
 
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
 
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
 
LF_DPDK17_Power Aware Packet Processing
LF_DPDK17_Power Aware Packet ProcessingLF_DPDK17_Power Aware Packet Processing
LF_DPDK17_Power Aware Packet Processing
 
OpenContrail, Real Speed: Offloading vRouter
OpenContrail, Real Speed: Offloading vRouterOpenContrail, Real Speed: Offloading vRouter
OpenContrail, Real Speed: Offloading vRouter
 
ONS Summit 2017 SKT TINA
ONS Summit 2017 SKT TINAONS Summit 2017 SKT TINA
ONS Summit 2017 SKT TINA
 
Way to kafka connect
Way to kafka connectWay to kafka connect
Way to kafka connect
 
Distributed Postgres
Distributed PostgresDistributed Postgres
Distributed Postgres
 
Principles in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, ConfluentPrinciples in Data Stream Processing | Matthias J Sax, Confluent
Principles in Data Stream Processing | Matthias J Sax, Confluent
 
Apache Kafka at LinkedIn
Apache Kafka at LinkedInApache Kafka at LinkedIn
Apache Kafka at LinkedIn
 
Measuring a 25 and 40Gb/s Data Plane
Measuring a 25 and 40Gb/s Data PlaneMeasuring a 25 and 40Gb/s Data Plane
Measuring a 25 and 40Gb/s Data Plane
 
Ceph Day Beijing: Big Data Analytics on Ceph Object Store
Ceph Day Beijing: Big Data Analytics on Ceph Object Store Ceph Day Beijing: Big Data Analytics on Ceph Object Store
Ceph Day Beijing: Big Data Analytics on Ceph Object Store
 
Apache samza past, present and future
Apache samza  past, present and futureApache samza  past, present and future
Apache samza past, present and future
 
Apache Samza Past, Present and Future
Apache Samza  Past, Present and FutureApache Samza  Past, Present and Future
Apache Samza Past, Present and Future
 
HBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environment
 
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...
Running large scale Kafka upgrades at Yelp (Manpreet Singh,Yelp) Kafka Summit...
 

Similar to C* Summit EU 2013: The Cassandra Experience at Orange

The Apache Cassandra ecosystem
The Apache Cassandra ecosystemThe Apache Cassandra ecosystem
The Apache Cassandra ecosystemAlex Thompson
 
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin Berger
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin BergerTrivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin Berger
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin BergerTrivadis
 
Open vSwitch Implementation Options
Open vSwitch Implementation Options Open vSwitch Implementation Options
Open vSwitch Implementation Options Netronome
 
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkThe Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkOpen Networking Summits
 
Moving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureMoving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureGabriele Modena
 
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2DataStax Academy
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...HostedbyConfluent
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...HostedbyConfluent
 
Implementation and Performance Analysis of a UDP Binding for SOAP
Implementation and Performance Analysis of a UDP Binding for SOAPImplementation and Performance Analysis of a UDP Binding for SOAP
Implementation and Performance Analysis of a UDP Binding for SOAPDr. Fahad Aijaz
 
Openlab.2014 02-13.major.vi sion
Openlab.2014 02-13.major.vi sionOpenlab.2014 02-13.major.vi sion
Openlab.2014 02-13.major.vi sionCcie Light
 
Accelerating SDN Applications with Open Source Network Overlays
Accelerating SDN Applications with Open Source Network OverlaysAccelerating SDN Applications with Open Source Network Overlays
Accelerating SDN Applications with Open Source Network OverlaysCumulus Networks
 
Dataservices - Processing Big Data The Microservice Way
Dataservices - Processing Big Data The Microservice WayDataservices - Processing Big Data The Microservice Way
Dataservices - Processing Big Data The Microservice WayJosef Adersberger
 
Pack prediction based cloud bandwidth and cost reduction system
Pack prediction based cloud bandwidth and cost reduction systemPack prediction based cloud bandwidth and cost reduction system
Pack prediction based cloud bandwidth and cost reduction systemPapitha Velumani
 
Moolle fan-out control for scalable distributed data stores
Moolle  fan-out control for scalable distributed data storesMoolle  fan-out control for scalable distributed data stores
Moolle fan-out control for scalable distributed data storesSungJu Cho
 
Feedback on Big Compute & HPC on Windows Azure
Feedback on Big Compute & HPC on Windows AzureFeedback on Big Compute & HPC on Windows Azure
Feedback on Big Compute & HPC on Windows AzureANEO
 
Qos aware data replication for data-intensive applications in cloud computing...
Qos aware data replication for data-intensive applications in cloud computing...Qos aware data replication for data-intensive applications in cloud computing...
Qos aware data replication for data-intensive applications in cloud computing...Papitha Velumani
 
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...WebCamp
 
Beyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in KubernetesBeyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in KubernetesMark McBride
 

Similar to C* Summit EU 2013: The Cassandra Experience at Orange (20)

The Apache Cassandra ecosystem
The Apache Cassandra ecosystemThe Apache Cassandra ecosystem
The Apache Cassandra ecosystem
 
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin Berger
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin BergerTrivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin Berger
Trivadis TechEvent 2017 Migrating to Cloud: Capacity Management Martin Berger
 
An Optics Life
An Optics LifeAn Optics Life
An Optics Life
 
Open vSwitch Implementation Options
Open vSwitch Implementation Options Open vSwitch Implementation Options
Open vSwitch Implementation Options
 
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale NetworkThe Challenges of SDN/OpenFlow in an Operational and Large-scale Network
The Challenges of SDN/OpenFlow in an Operational and Large-scale Network
 
Moving Towards a Streaming Architecture
Moving Towards a Streaming ArchitectureMoving Towards a Streaming Architecture
Moving Towards a Streaming Architecture
 
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2
Cassandra Summit 2014: The Cassandra Experience at Orange — Season 2
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
 
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
SingleStore & Kafka: Better Together to Power Modern Real-Time Data Architect...
 
Implementation and Performance Analysis of a UDP Binding for SOAP
Implementation and Performance Analysis of a UDP Binding for SOAPImplementation and Performance Analysis of a UDP Binding for SOAP
Implementation and Performance Analysis of a UDP Binding for SOAP
 
Openlab.2014 02-13.major.vi sion
Openlab.2014 02-13.major.vi sionOpenlab.2014 02-13.major.vi sion
Openlab.2014 02-13.major.vi sion
 
Accelerating SDN Applications with Open Source Network Overlays
Accelerating SDN Applications with Open Source Network OverlaysAccelerating SDN Applications with Open Source Network Overlays
Accelerating SDN Applications with Open Source Network Overlays
 
Dataservices - Processing Big Data The Microservice Way
Dataservices - Processing Big Data The Microservice WayDataservices - Processing Big Data The Microservice Way
Dataservices - Processing Big Data The Microservice Way
 
Pack prediction based cloud bandwidth and cost reduction system
Pack prediction based cloud bandwidth and cost reduction systemPack prediction based cloud bandwidth and cost reduction system
Pack prediction based cloud bandwidth and cost reduction system
 
Moolle fan-out control for scalable distributed data stores
Moolle  fan-out control for scalable distributed data storesMoolle  fan-out control for scalable distributed data stores
Moolle fan-out control for scalable distributed data stores
 
Feedback on Big Compute & HPC on Windows Azure
Feedback on Big Compute & HPC on Windows AzureFeedback on Big Compute & HPC on Windows Azure
Feedback on Big Compute & HPC on Windows Azure
 
Qos aware data replication for data-intensive applications in cloud computing...
Qos aware data replication for data-intensive applications in cloud computing...Qos aware data replication for data-intensive applications in cloud computing...
Qos aware data replication for data-intensive applications in cloud computing...
 
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...
WebCamp 2016: DevOps. Николай Дойков: Опыт создания клауда для потокового вид...
 
TransPAC3/ACE Measurement & PerfSONAR Update
TransPAC3/ACE Measurement & PerfSONAR UpdateTransPAC3/ACE Measurement & PerfSONAR Update
TransPAC3/ACE Measurement & PerfSONAR Update
 
Beyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in KubernetesBeyond Ingresses - Better Traffic Management in Kubernetes
Beyond Ingresses - Better Traffic Management in Kubernetes
 

More from DataStax Academy

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftDataStax Academy
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseDataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraDataStax Academy
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsDataStax Academy
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingDataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackDataStax Academy
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache CassandraDataStax Academy
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready CassandraDataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonDataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2DataStax Academy
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First ClusterDataStax Academy
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with DseDataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraDataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseDataStax Academy
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraDataStax Academy
 

More from DataStax Academy (20)

Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craftForrester CXNYC 2017 - Delivering great real-time cx is a true craft
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
 
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph DatabaseIntroduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Graph Database
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache CassandraIntroduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
 
Cassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart LabsCassandra on Docker @ Walmart Labs
Cassandra on Docker @ Walmart Labs
 
Cassandra 3.0 Data Modeling
Cassandra 3.0 Data ModelingCassandra 3.0 Data Modeling
Cassandra 3.0 Data Modeling
 
Cassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stackCassandra Adoption on Cisco UCS & Open stack
Cassandra Adoption on Cisco UCS & Open stack
 
Data Modeling for Apache Cassandra
Data Modeling for Apache CassandraData Modeling for Apache Cassandra
Data Modeling for Apache Cassandra
 
Coursera Cassandra Driver
Coursera Cassandra DriverCoursera Cassandra Driver
Coursera Cassandra Driver
 
Production Ready Cassandra
Production Ready CassandraProduction Ready Cassandra
Production Ready Cassandra
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & PythonCassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 1
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
 
Standing Up Your First Cluster
Standing Up Your First ClusterStanding Up Your First Cluster
Standing Up Your First Cluster
 
Real Time Analytics with Dse
Real Time Analytics with DseReal Time Analytics with Dse
Real Time Analytics with Dse
 
Introduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache CassandraIntroduction to Data Modeling with Apache Cassandra
Introduction to Data Modeling with Apache Cassandra
 
Cassandra Core Concepts
Cassandra Core ConceptsCassandra Core Concepts
Cassandra Core Concepts
 
Enabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax EnterpriseEnabling Search in your Cassandra Application with DataStax Enterprise
Enabling Search in your Cassandra Application with DataStax Enterprise
 
Bad Habits Die Hard
Bad Habits Die Hard Bad Habits Die Hard
Bad Habits Die Hard
 
Advanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache CassandraAdvanced Data Modeling with Apache Cassandra
Advanced Data Modeling with Apache Cassandra
 
Advanced Cassandra
Advanced CassandraAdvanced Cassandra
Advanced Cassandra
 

Recently uploaded

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 

C* Summit EU 2013: The Cassandra Experience at Orange

  • 1. The  Cassandra  Experience  at  Orange   Project  PnS  3.0   Jean  Armel  Luce   Orange  France/DSIF/DF/SDF   V1.0  
  • 2. Summary   §  §  Our  migraGon  strategy   §  AMer  the  migraGon  …   §  AnalyGcs  with  Hadoop/Pig/Hive  over  Cassandra   §  2 Short  descripGon  of  PnS.  Why  did  we  choose  C*  ?   ContribuGons  and  open  sourced  modules  from  Orange  &  conclusions   Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 4. PnS – Short description §  PnS means Profiles and Syndication : PnS is a highly available service for collecting and serving live data about Orange customers §  End users of PnS are : –  Orange customers (logged to Portal www.orange.fr) –  Sellers in Orange shops –  Some services in Orange (advertisements, …) 4 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 5. PnS – The Big Picture Millions of HTTP requests (Rest or Soap) Fast and highly available WebService to get or set data stored by pns : - postProcessing(data1) - postProcessing(data2) - postProcessing(data3) - postProcessing(datax) - … End users DB Queries R/W operations Thousands of files (Csv or Xml) Scheduled data injection PNS Data providers 5 Jean Armel Luce - Orange-DOP-PnS 3.0 Database Cassandra Summit Europe – October 17 2013
  • 6. PnS2 – Architecture 2 DCs architecture for high availability §  Until 2012, data were stored in 2 differents backends : ü  MySQL cluster (for volatile data) ü  PostGres « cluster » (sharding and replication) Bagnol et and §  web services (read and writes) Sophia Antipolis for batch updates §  6 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 7. Timeline  –  Key  dates  of  PnS  3.0   PNS 2 •  Study  phase   2010 to 2012 We  did  a  large  study  about  a  few  NoSQL  databases  (Cassandra,  MongoDB,  Riak,  Hbase,  Hypertable,  …)   è  We  chose  Cassandra  as  the  single  backend  for  PnS •  Design  phase   06/2012 09/2012 We  started  the  design  phase  of  PnS3.0   •  Proof  Of  Concept   We  started  a  1st  (small)  Cassandra  cluster  in  producGon  for  a  non  criGcal  applicaGon  :  1  table,  key  value  access   •  Produc7on  phase   04/2013 MigraGon  of  the  1st  subset  of  data  of  PnS  from  mysql  cluster  to  Cassandra  in  produc7on   •  Complete  migra7on   05/2013 to 12/2013 7 MigraGon  of  all  other  subsets  of  data  from  Mysql  cluster  and  Postgres  to  Cassandra     Add  new  nodes  in  the  cluster  (From  8  nodes  in  each  DC  to  16  nodes  in  each  DC)   Add  a  3rd  datacenter  for  AnalyGcs   Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 8. PnS – Why did we choose Cassandra ? §  Cassandra fits our requirements : –  Very high availability PnS2 = 99,95% availability we want to improve it !!! –  Low latency 20 ms < RT PnS2 web service < 150 ms we want to improve it !!! Higher load, higher volume next years ? unpredictable; better scalability brings new businesses –  Scalability §  And also : –  Ease of use : Cassandra is easy to administrate and operate –  Some features that I like (rack aware, CL per request, …) –  Cassandra is very efficient for simple requests : «  SELECT  mycol1,  mycol2,  …,  mycolx  FROM  mytable  WHERE  myprimarykey  =  ‘mycustomerid’  »   8   Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 10. The  migraGon  -­‐  Input   §  During the migration, we need to : §  §  maintain (or lower) the latency during the migration §  §  maintain a very high availability guarantee no functional regression Question : §  10 How can we migrate the data to Cassandra without any interruption of service ? Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 11. The  migraGon  :  Step  by  step  processing   §  Subdivision of data into many subsets according to many criteria : §  §  §  Same source of data Relationships between data And then, migrate each subset 1 by 1 S ubdivision   into subsets §  Definition of a generic process for all the subsets S witch  q ueries to  C assandra   for  the  s ubset Check /validation   of  the  m ig ration 11 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013 goto 1st subset goto next subset Mig ration   data  of  the   subset
  • 12. The  migraGon  :  Tools  and  Gps   §  The strategy of migration is based on 2 main facilities : mod_dup PNS 2 An Apache module developped (and open sourced) by Orange teams. HTTP Req Mod_dup can duplicate web requests, filter them on some criteria, substitute characters (regexp), and send the duplicated requests to another pool of web servers. Used in order to fill legacy (relational) database and Cassandra database simultaneously during the migration of the subset mod_du p PNS 3 the timestamp management by Cassandra Each data stored in C* is timestamped. It is possible to set this timestamp when inserting/updating/deleting a data in Cassandra. When Cassandra retrieves a data item, it returns the value having the most recent timestamp. We use this feature to distinguish the values stored before the migration started and the values inserted during or inserted after the migration 12 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 13. The migration : initial state Step 0 HTTP Rest/Soap Read HTTP Rest/Soap Write WebServer End users SQL Read/ Write PNS 2 DB Files transfer via FTP or CFT BatchInjector Data providers 13 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 14. The migration : double feed Step 1 Duplicate HTTP update streams from end users mod_dup WebServer CQL update Data providers Duplicate streams (files) from data providers 14 Jean Armel Luce - Orange-DOP-PnS 3.0 BatchInjector Cassandra Summit Europe – October 17 2013 PNS 3 Cassandra DB
  • 15. The migration : copy data form PnS2 to PnS3 Step 2 HTTP Write mod_dup WebServer Batc h In jecti on TimeStamp  =  start    date  of  extraction   Data providers BatchInjector 15 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013 PNS 3 Cassandra DB
  • 16. The migration : control Step 3 HTTP Write mod_dup WebServer SQL Synchro Control Data providers CQL BatchInjector 16 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013 PNS 3 Cassandra DB
  • 17. The migration : switch reads Step 4 100 % read now on Cassandra HTTP Read requests HTTP Write mod_dup HTTP Rest/Soap Read HTTP Rest/Soap Write WebServer WebServer End users PNS 2 DB Files transfer via FTP or CFT Data providers BatchInjector BatchInjector 17 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013 PNS 3 Cassandra DB
  • 18. The migration : stop double feed Step 5 HTTP Read/Write HTTPrequests Read requests 100 % read on Cassandra 100% write on Cassandra for HTTP request HTTP Write mod_dup HTTP Rest/Soap Write WebServer WebServer End users PNS 2 DB Files transfer via FTP or CFT Data providers BatchInjector Files transfer 100% write on Cassandra for Data injection 18 Jean Armel Luce - Orange-DOP-PnS 3.0 BatchInjector Cassandra Summit Europe – October 17 2013 PNS 3 Cassandra DB
  • 19. The  migraGon   §  Using this procedure : §  §  During the control phase, we can take time (a few days, a few weeks) to check that everything is OK before switching to Cassandra §  It is possible to easily rollback the migration of a subset if errors are found during the control phase, without losing any update §  19 It is possible to switch progressively to Cassandra rather than doing a one shot switch §  §  We can migrate to Cassandra without any interruption of service Doesn’t work if the queries are not idempotent. After the migration, we can easily duplicate production requests (entirely or partially) and send them to a bench platform thanks to mod_dup Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 21. The  latency   §  Comparison before/after migration to Cassandra §  Some graphs about the latency of the web services are very explicit : Service push mail Service push webxms dates of migration to C* 21 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 22. The  latency   §  Read and write latencies are now in microseconds in the datanodes : Thanks to and This  latency  will  be  improved  by  (tests  in  progress)  :   ALTER  TABLE  syndic  WITH  compacGon  =  {  'class'  :  'LeveledCompacGonStrategy',  'sstable_size_in_mb'  :  ??  };   22 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 23. The  availability   •  We got a few hardware failures and network outages •  No impact on QoS : •  •  23 no error returned by the application no real impact on latency Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 24. The  scalability   •  PnS activity is always increasing (volume of data and requests/sec) •  How to measure the capacity of a cluster ? Capacity of a C* cluster = capacity of a node * number of nodes (true if all nodes are identical) •  there are 2 ways to deal with the expansion of activity : Ø  scale up (add more resources such as CPU, disks, RAM to each node) Ø  scale  out  (add  new  nodes  in  the  cluster)   24 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 25. The  scalability   •  Thanks to vnodes (available since Cassandra 1.2), it is easy to scale out With NetworkTopologyStrategy, make sure to distribute evenly the nodes in the racks 25 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 26. Analytics with Hadoop/ Pig/ Hive over Cassandra
  • 27. Basic  architecture  of  the  Cassandra  cluster   §  Cluster without Hadoop : 2 datacenters, 16 nodes in each DC §  RF (DC1, DC2) = (3, 3) §  Requests from web servers in DC1 are sent to C* nodes in DC1 §  Requests from web servers in DC2 are sent to C* nodes in DC2 Pool of web servers DC1 27 Jean Armel Luce - Orange-DOP-PnS 3.0 DC1 DC2 Cassandra Summit Europe – October 17 2013 Pool of web servers DC2
  • 28. Architecture  of  the  Cassandra  cluster  with  the  datacenter  for   analyGcs   §  Cluster with Hadoop : 3 datacenters, 16 nodes in DC1, 16 nodes in DC2, 4 nodes in DC3 §  RF (DC1, DC2, DC3) = (3, 3, 1) §  §  We favor cheaper disks (SATA) in DC3 rather than SSDs or FusionIo cards §  28 Because RF = 1 in DC3, we shall need less storage space in this datacenter Works better with HSHA Thrift server (tests in progress) Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 29. Architecture  of  the  Cassandra  cluster  with  the  datacenter  for   analyGcs   Pool of web servers DC1 DC1 DC2 DC3 29 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013 Pool of web servers DC2
  • 30. Contributions and open sourced modules from Orange & conclusions
  • 31. ContribuGons  and  open  sourced  modules   §  Open sources by Orange §  PHP driver for Cassandra : https://github.com/Orange-OpenSource/YACassandraPDO Thanks to Sandro Lex & Mathieu Lornac §  Mod_dup (Migration to Cassandra) § https://github.com/Orange-OpenSource/mod_dup Thanks to Jonas Wustrack & Emmanuel Courreges §  Other contributions §  C driver (libdbi driver) : http://libdbi-drivers.cvs.sourceforge.net/viewvc/libdbi-drivers/libdbidrivers/?pathrev=Branch-2012-07-02-cassandra Thanks to Emmanuel Courreges 31 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 32. Conclusions §  With Cassandra, we have improved our QoS §  We are able to open our service to new opportunities §  There is an ecosystem around C* (Hadoop, Hive, Pig, Storm, Shark, …), which offers more capabilities. However, we would love to have some of the components (Hive) integrated in C* core (as Pig) §  32 PnS3 works better and hopefully cheaper than PnS2 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 33. Thank  you   33 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 34. Questions 34 Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 35. A  few  answers  about  hardware/OS  version  /Java  version/ Cassandra  version   §  Hardware : §  16 nodes in each DC at the end of 2013 : §  §  1.2.2 (with a few patches backported from 1.2.3) Java version : §  35 Ubuntu Precise (12.04 LTS) Cassandra version : §  §  FusionIO 320 GB MLC OS : §  §  24 GB RAM §  §  6 CPU Intel® Xeon® 2.00 GHz Java7u7 : not recommended, upgrade scheduled soon Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013
  • 36. A  few  answers  about  data  and  requests   §  Data types : §  §  elementary types : boolean, integer, string, date §  collection types §  §  Volume : 6 TB at the end of 2013 complex types : json, xml (between 1 and 20 KB) Requests : §  §  80% get §  §  10.000 requests/sec at the end of 2013 20% set Consistency level used by PnS : §  §  36 ONE (95% of the queries) LOCAL_QUORUM (5% of the queries) Jean Armel Luce - Orange-DOP-PnS 3.0 Cassandra Summit Europe – October 17 2013