Here are the slides from Joseph Lynch's PuppetConf 2016 presentation called Automating Datastore Fleets with Puppet. Watch the videos at https://www.youtube.com/playlist?list=PLV86BgbREluVjwwt-9UL8u2Uy8xnzpIqa
7. Logic
Web Search Ads API Logging Svc 5 Platform
Data
MySQL Lucene Cass ... Kafka DB 5 Zookeeper
...
...
8. Logic
Web Search Ads API Logging Svc 5 Platform
Data
MySQL Lucene Cass ... Kafka DB 5 Zookeeper
...
...
It’s Complicated
9. MUST support
● Different datastores
● Different configurations
● Cloud and datacenter ready
● Ready for production
● Cope with Data Gravity
Yelp’s Datastore Problem
33. Using Puppet to Manage Datastores
● About 100 clusters
● Over 500 datastore nodes
● Puppet runs take ~5 minutes
● Run Puppet every hour, CI/CD
○ Automation runs via separate scheduler