Mongo DB Athens user group replication and high availability

Replication and High Availability

MongoDB Athens User Group
Athens, Greece 16/1/2013
Alex Giamas
alexandergiamas@yahoo.com

History

● Oracle shop
● Non existent OLAP
● Queries in live DB

(c) Alex Giamas, Persado Inc, All rights reserved

Initial investigation

● CouchDB
● Riak
● Hbase
● Cassandra
● MongoDB
● Voldemort


History

● Missed all the fun while being in the States
● Hadoop / HBase / llama / Pig
● Sharded MySQL
● Voldemort
● Huge RAC deployments
● Following MongoDB since 1.4 (Replica sets? Nah..
Sharding?..alpha)

Reporting and Analytics
● Settled on MongoDB
● Document oriented
● No clue about schema at the time
● No clue about what are we going to do with client data


Prototyping, version 0.5

● MT, MO, User collections
● Sync Map Reduce for reporting
● One DB to rule them all


Show stoppers

● Single server deployment
● Global write lock
● MR in real time


Results

● Demo Christmas Eve 2010


Results

● Demo Christmas Eve 2010

● Slow....


Reporting, version 1.0

● Spring Batch for async computations
● Quartz scheduler firing every 3 minutes
● Separate nodes for OLTP and OLAP Dbs
● Custom cloneCollection()


Real world kicks in

● Everything designed for online integration
● Huge client coming in offering offline integration
● Ride the cloud wagon!


Reporting Version 2 (the real world)
● Files coming in via FTP containing all sorts of time
inconsistencies
● No longer a linear timeline of events, more like a soup of
results


MongoDB on EC2
● 2 replica sets of 2 nodes+arb
● Arbiters crossed wrt replica sets
● Third node could be different availability zone


Replica Sets Configuration



“server seen down”



● Can afford 1 failure with fully functional cluster
● Can afford 2 failures with partially functional cluster**

** Terms and conditions may apply



● Rolling upgrades without DB downtime
● Schemaless, document oriented offers great flexibility in
application terms



● Unix level tweaks:
– raise ulimit
– raise tcp timeout
– Noatime nodiratime
– XFS, ext4
– LVM for snapshotting
● Mongo level tips:
– Use journaling. USE JOURNALING



EC2 specific tips:
– Can and will steal back time, plan for it
– Can get flaky at times..
– Design around EBS



● EC2 storage:
– Local storage. Ephemeral
– EBS storage. Lasts but not strong durability guarantees.
– S3 storage. Lasts more, slower



● Settled for EBS storage.
● Nightly backups, 30 day window


Reporting Version 3

● Aggregation Framework effort led by Chris Westin
– Simpler way to perform Map Reduce jobs without all the pain of JS
– Integrates cleanly with our business logic
● Initial design on sharding
– More on that next..


Reporting Version 3

● Aggregation framework for both storing and retrieving
aggregate data
– New collection for double checking results with MR.
● Faster, simpler, most of the times fits in our problem domain.

● Worked better in dev than production versions ;)


Reporting Version 3

● More fine grained write semantics.
– WriteConcern.SAFE for most write queries
– .REPLICAS_SAFE for non idempotent queries that are costly to
recompute
● Do you feel lucky punk?
– Reactive Mongo
● Asynchronous & Non-Blocking Scala Driver for MongoDB
– Brings the best of WriteConcern.SAFE and WriteConcern.NORMAL


Replication and High Availability Take aways

● Use delayed members
● Size your oplog
● Use writeconcern and readpreference to balance between
providing fresh data and overloading servers
● Failover happens automagically but not instantaneously
● Think your security model



● More important: Think who has access to your systems.
– No commit, no rollback
● Prepare people for change
– Educate non engineers
– Use morphia



● Audit – audit – audit
– Monitor closely your MongoDB servers for potential bottlenecks
● mms.10gen.com great tool to do so
– Github is your friend:
● https://github.com/mongolab/dex


Q&A

Ask me anything...
or drop me a line:
alexandergiamas@yahoo.com
alexandros.giamas@persado.com


Mongo DB Athens user group replication and high availability

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (10)

Similar a Mongo DB Athens user group replication and high availability

Similar a Mongo DB Athens user group replication and high availability (20)

Último

Último (20)

Mongo DB Athens user group replication and high availability

Notas del editor