SlideShare a Scribd company logo
1 of 25
Download to read offline
Laine Campbell, Owner/Principal, laine@palominodb.com
Charlie Killian, Director of Engineering, charlie@palominodb.com

Scaling and Performance for
Operational Excellence
Who we are

●   A boutique consultancy offering custom solutions.

●   An operations support team providing a combined
    100+ years of experience in distributed, performant
    and scalable solutions.

●   A team of architects, engineers and operators who
    have worked at some of the most trafficked sites,
    games and companies since 1999.



                                                          *
Operational Excellence


●   Configuration management and documentation.
●   Change management.
●   Availability management.
●   Incident and problem management
●   Backup, recovery and business continuity.
●   Monitoring and Trending.




                                                  *
Configuration Management

●   Consistent couchbase configurations.
     ○ Guis are great, but don't meet automation needs.



●   Self documenting environments.

●   Incorporating your infrastructure into your application
    to leverage couchbase ease of scale.

●   Chef, puppet, ansible or "roll your own" using the
    couchbase API.

                                                              *
Change and Release
              Management
●   Schemaless is great, but data governance is key.

●   Your code needs to build a data dictionary or
    confusion reigns.

●   DevOps style relationships build collaboration that
    can overcome the wild west mentality of schemaless
    environments.



                                                          *
Availability Management

●   Moxi provides availability during node failures,
    supporting reads and writes.

●   XDCR support in Couchbase 2.0 provides availability
    across datacenters and regions in an active/active
    topology.

●   Special consideration in cloud environments must
    take into account AZ and region failovers.



                                                          *
Incident and Problem
             Management
●   While not Couchbase specific, crucial to maintaining
    any highly available architecture.
●   Appropriate alerting, response and communication
    processes ensure that isolated issues don't cascade
    into massive failures.
●   Failing hardware, networks, design issues can all
    cause failures that can cascade into an entire cluster
    being down.
●   Tracking recurring problems help with a continuous
    improvement on meeting SLAs.



                                                             *
Backup and Recovery

●   Define your recovery SLAs.
●   Track how long backups take.
●   Test restores and track how long they take.
●   Recognize all failure scenarios:
     ○ Node failure
     ○ Physical data corruption
     ○ Logical data corruption
     ○ Audits and forensics




                                                  *
Backup and Recovery 1.8


●   In 1.8, per node backup is supported. Replica sets
    are also backed-up, which can cause long, or non-
    completing backups.
●   SQLite3 can be used as a logical dump to ease
    backups.
●   Cluster-wide consistency can not be guaranteed.
●   No incremental backups available.




                                                         *
Backup and Recovery 2.0

●   Cluster wide backups are now available, as well as
    incremental.

●   EBS snapshots (or LVM, hardware, etc...) work well
    due to log-style writes to disk.

●   With incremental, it is easier to meet SLAs without
    breaking the bank on storage.



                                                          *
Monitoring and Alerting

●   Use logs! Centralized syslogs, splunk, custom
    scripts to identify and track error types and rates.
●   Track your app! Latency of web pages, forms and
    api-calls are key indicators.
●   Define key alerts, make them actionable and tied to
    documentation.
●   Palomino builds plugins and templates to provide
    proper alerts that are useful and work!




                                                           *
Trending and Diagnostics

●   Alerts aren't enough, you must track usage and
    internal metrics to understand trends, workloads and
    bottlenecks.
●   Graph everything! All exposed metrics, trend health
    checks.
●   Interleave graphs from internal metrics to external
    factors: Code pushes, application metrics (logins,
    purchases, api calls)




                                                           *
Care and Feeding

●   Regular performance reviews.
●   Defragmentation.
●   Incorporate recovery tests into building test and dev
    environments.
●   Scale-up/Scale-down, preferably via automated
    processes.
●   Rolling upgrades.
●   Coffee, pie, beer.




                                                            *
Partnering with Couchbase

Providing remote Architecture, Engineering and DBA
                 services to clients.
 Vendor neutral operations and scaling expertise for
      Couchbase clients in need of operators.




                                                       *
Remote Architecture and
       Engineering Services

●   Architecture review and recommendations
●   Data modeling
●   Data model migration
●   Data migration
●   Cluster sizing
●   Tools development




                                              *
DBA and Operations Services

●   Infrastructure builds and management
●   Proactive operational support
●   24x7 operational support with 30 minutes SLA
●   System health checks
●   Backup and recovery
●   Tuning for performance and scale
●   Query reviews, indexing, benchmarking
●   Capacity reviews




                                                   *
How we can help



● Support your proof of concept
● Migrate you to Couchbase Server
● Support your Couchbase Server clusters




                                           *
Is Couchbase Server a good fit?



●   Architecture review
●   Data model review
●   Recommendation on moving to Couchbase Server
●   Data access best practices




                                                   *
Migrating from a RDBMS to Couchbase
               Server?



● Data model migration from relational to document
● Data migration from SQL Server to Couchbase
  Server
● Couchbase Server cluster sizing
● Infrastructure builds




                                                     *
Do you need operational experts?




●   24x7 operational support with 30 minutes SLA
●   Multiple Couchbase Server 1.8 clusters
●   Wanted Couchbase operational experts
●   Escalate to Couchbase for software support




                                                   *
Contact Info


Laine Campbell, laine@palominodb.com
Charlie Killian, charlie@palominodb.com

        www.palominodb.com
       @palominodb on Twitter




                                          *

More Related Content

What's hot

What is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreWhat is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreSanjay Willie
 
EnterpriseDB BackUp and Recovery Tool
EnterpriseDB BackUp and Recovery ToolEnterpriseDB BackUp and Recovery Tool
EnterpriseDB BackUp and Recovery ToolEDB
 
MyHeritage backend group - build to scale
MyHeritage backend group - build to scaleMyHeritage backend group - build to scale
MyHeritage backend group - build to scaleRan Levy
 
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]Fanky Christian
 
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!Continuent
 
Share point disaster avoidance architecture for large scale enterprises
Share point disaster avoidance architecture for large scale enterprisesShare point disaster avoidance architecture for large scale enterprises
Share point disaster avoidance architecture for large scale enterprisesSentri
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...NetApp
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentEDB
 
(ATS4-PLAT06) Considerations for sizing and deployment
(ATS4-PLAT06) Considerations for sizing and deployment(ATS4-PLAT06) Considerations for sizing and deployment
(ATS4-PLAT06) Considerations for sizing and deploymentBIOVIA
 
SANJAY M JADHAV _May 28th Resume
SANJAY M JADHAV _May 28th ResumeSANJAY M JADHAV _May 28th Resume
SANJAY M JADHAV _May 28th Resumesanjay jadhav
 
Quick and dirty performance analysis
Quick and dirty performance analysisQuick and dirty performance analysis
Quick and dirty performance analysisChris Kernaghan
 
Best Practices for a Complete Postgres Enterprise Architecture Setup
Best Practices for a Complete Postgres Enterprise Architecture SetupBest Practices for a Complete Postgres Enterprise Architecture Setup
Best Practices for a Complete Postgres Enterprise Architecture SetupEDB
 
MinervaDB corporate profile
MinervaDB corporate profileMinervaDB corporate profile
MinervaDB corporate profileMinervaDB
 
2016 NCS ASE short
2016 NCS ASE short2016 NCS ASE short
2016 NCS ASE shortSteve Stuck
 
Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Vinay Mistry
 

What's hot (20)

CV_Sudhindra Srinivasamurthy
CV_Sudhindra SrinivasamurthyCV_Sudhindra Srinivasamurthy
CV_Sudhindra Srinivasamurthy
 
What is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios CoreWhat is Nagios XI and how is it different from Nagios Core
What is Nagios XI and how is it different from Nagios Core
 
EnterpriseDB BackUp and Recovery Tool
EnterpriseDB BackUp and Recovery ToolEnterpriseDB BackUp and Recovery Tool
EnterpriseDB BackUp and Recovery Tool
 
MyHeritage backend group - build to scale
MyHeritage backend group - build to scaleMyHeritage backend group - build to scale
MyHeritage backend group - build to scale
 
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]
Nagios core vs. nagios xi presentation power point.pptx [diperbaiki]
 
Vivek Resume
Vivek ResumeVivek Resume
Vivek Resume
 
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
Webinar Slides: Geo-Distributed MySQL Clustering Done Right!
 
Share point disaster avoidance architecture for large scale enterprises
Share point disaster avoidance architecture for large scale enterprisesShare point disaster avoidance architecture for large scale enterprises
Share point disaster avoidance architecture for large scale enterprises
 
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
TechTarget Event - Storage Architectures for the Modern Data Center - Howard ...
 
Top 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres DeploymentTop 10 Tips for an Effective Postgres Deployment
Top 10 Tips for an Effective Postgres Deployment
 
Garry_Chatman
Garry_ChatmanGarry_Chatman
Garry_Chatman
 
June09-2016
June09-2016June09-2016
June09-2016
 
(ATS4-PLAT06) Considerations for sizing and deployment
(ATS4-PLAT06) Considerations for sizing and deployment(ATS4-PLAT06) Considerations for sizing and deployment
(ATS4-PLAT06) Considerations for sizing and deployment
 
SANJAY M JADHAV _May 28th Resume
SANJAY M JADHAV _May 28th ResumeSANJAY M JADHAV _May 28th Resume
SANJAY M JADHAV _May 28th Resume
 
Quick and dirty performance analysis
Quick and dirty performance analysisQuick and dirty performance analysis
Quick and dirty performance analysis
 
Best Practices for a Complete Postgres Enterprise Architecture Setup
Best Practices for a Complete Postgres Enterprise Architecture SetupBest Practices for a Complete Postgres Enterprise Architecture Setup
Best Practices for a Complete Postgres Enterprise Architecture Setup
 
MinervaDB corporate profile
MinervaDB corporate profileMinervaDB corporate profile
MinervaDB corporate profile
 
2016 NCS ASE short
2016 NCS ASE short2016 NCS ASE short
2016 NCS ASE short
 
kaushal resume1
kaushal resume1kaushal resume1
kaushal resume1
 
Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015
 

Similar to CouchConf SF 2012 Lightning Talk - Operational Excellence

Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...Continuent
 
Modern MySQL Monitoring and Dashboards.
Modern MySQL Monitoring and Dashboards.Modern MySQL Monitoring and Dashboards.
Modern MySQL Monitoring and Dashboards.Mydbops
 
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...DevOps.com
 
The Rocky Cloud Road
The Rocky Cloud RoadThe Rocky Cloud Road
The Rocky Cloud RoadGert Drapers
 
Amit Kumar_Resume
Amit Kumar_ResumeAmit Kumar_Resume
Amit Kumar_ResumeAmit Kumar
 
Designing for operability and managability
Designing for operability and managabilityDesigning for operability and managability
Designing for operability and managabilityGaurav Bahrani
 
Technology insights: Decision Science Platform
Technology insights: Decision Science PlatformTechnology insights: Decision Science Platform
Technology insights: Decision Science PlatformDecision Science Community
 
SaaS - Software as a Service - Charles University - Prague - March 2013
SaaS - Software as a Service - Charles University - Prague - March 2013SaaS - Software as a Service - Charles University - Prague - March 2013
SaaS - Software as a Service - Charles University - Prague - March 2013Jaroslav Gergic
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutionssolarisyougood
 
Vinoth_Perumal_Datawarehousing
Vinoth_Perumal_DatawarehousingVinoth_Perumal_Datawarehousing
Vinoth_Perumal_Datawarehousingvinoth perumal
 
Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Neo4j
 
Getting to Walk with DevOps
Getting to Walk with DevOpsGetting to Walk with DevOps
Getting to Walk with DevOpsEklove Mohan
 
The architecture of SkySQL
The architecture of SkySQLThe architecture of SkySQL
The architecture of SkySQLMariaDB plc
 
Getting more into GCP.pdf
Getting more into GCP.pdfGetting more into GCP.pdf
Getting more into GCP.pdfKnoldus Inc.
 
DesignMind SQL Server 2008 Migration
DesignMind SQL Server 2008 MigrationDesignMind SQL Server 2008 Migration
DesignMind SQL Server 2008 MigrationMark Ginnebaugh
 

Similar to CouchConf SF 2012 Lightning Talk - Operational Excellence (20)

Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
Marketing Automation at Scale: How Marketo Solved Key Data Management Challen...
 
Modern MySQL Monitoring and Dashboards.
Modern MySQL Monitoring and Dashboards.Modern MySQL Monitoring and Dashboards.
Modern MySQL Monitoring and Dashboards.
 
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
Microservices at Scale: How to Reduce Overhead and Increase Developer Product...
 
The Rocky Cloud Road
The Rocky Cloud RoadThe Rocky Cloud Road
The Rocky Cloud Road
 
Amit Kumar_Resume
Amit Kumar_ResumeAmit Kumar_Resume
Amit Kumar_Resume
 
Designing for operability and managability
Designing for operability and managabilityDesigning for operability and managability
Designing for operability and managability
 
NVReddy
NVReddyNVReddy
NVReddy
 
Technology insights: Decision Science Platform
Technology insights: Decision Science PlatformTechnology insights: Decision Science Platform
Technology insights: Decision Science Platform
 
ChaitanyaPrati
ChaitanyaPratiChaitanyaPrati
ChaitanyaPrati
 
SaaS - Software as a Service - Charles University - Prague - March 2013
SaaS - Software as a Service - Charles University - Prague - March 2013SaaS - Software as a Service - Charles University - Prague - March 2013
SaaS - Software as a Service - Charles University - Prague - March 2013
 
Oracle big data appliance and solutions
Oracle big data appliance and solutionsOracle big data appliance and solutions
Oracle big data appliance and solutions
 
Vinoth_Perumal_Datawarehousing
Vinoth_Perumal_DatawarehousingVinoth_Perumal_Datawarehousing
Vinoth_Perumal_Datawarehousing
 
Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j Ultime Novità di Prodotto Neo4j
Ultime Novità di Prodotto Neo4j
 
SubbaReddy dba Resume
SubbaReddy dba ResumeSubbaReddy dba Resume
SubbaReddy dba Resume
 
Getting to Walk with DevOps
Getting to Walk with DevOpsGetting to Walk with DevOps
Getting to Walk with DevOps
 
The architecture of SkySQL
The architecture of SkySQLThe architecture of SkySQL
The architecture of SkySQL
 
rakesh_resume
rakesh_resumerakesh_resume
rakesh_resume
 
Getting more into GCP.pdf
Getting more into GCP.pdfGetting more into GCP.pdf
Getting more into GCP.pdf
 
I one Service Offerings
I one Service OfferingsI one Service Offerings
I one Service Offerings
 
DesignMind SQL Server 2008 Migration
DesignMind SQL Server 2008 MigrationDesignMind SQL Server 2008 Migration
DesignMind SQL Server 2008 Migration
 

More from Laine Campbell

Recruiting for diversity in tech
Recruiting for diversity in techRecruiting for diversity in tech
Recruiting for diversity in techLaine Campbell
 
Velocity pythian operational visibility
Velocity pythian operational visibilityVelocity pythian operational visibility
Velocity pythian operational visibilityLaine Campbell
 
Pythian operational visibility
Pythian operational visibilityPythian operational visibility
Pythian operational visibilityLaine Campbell
 
Scaling MySQL in Amazon Web Services
Scaling MySQL in Amazon Web ServicesScaling MySQL in Amazon Web Services
Scaling MySQL in Amazon Web ServicesLaine Campbell
 
RDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and PatternsRDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and PatternsLaine Campbell
 
Hybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouseHybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouseLaine Campbell
 
Methods of Sharding MySQL
Methods of Sharding MySQLMethods of Sharding MySQL
Methods of Sharding MySQLLaine Campbell
 
Understanding MySQL Performance through Benchmarking
Understanding MySQL Performance through BenchmarkingUnderstanding MySQL Performance through Benchmarking
Understanding MySQL Performance through BenchmarkingLaine Campbell
 

More from Laine Campbell (10)

Recruiting for diversity in tech
Recruiting for diversity in techRecruiting for diversity in tech
Recruiting for diversity in tech
 
Database engineering
Database engineeringDatabase engineering
Database engineering
 
Velocity pythian operational visibility
Velocity pythian operational visibilityVelocity pythian operational visibility
Velocity pythian operational visibility
 
Pythian operational visibility
Pythian operational visibilityPythian operational visibility
Pythian operational visibility
 
Scaling MySQL in Amazon Web Services
Scaling MySQL in Amazon Web ServicesScaling MySQL in Amazon Web Services
Scaling MySQL in Amazon Web Services
 
RDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and PatternsRDS for MySQL, No BS Operations and Patterns
RDS for MySQL, No BS Operations and Patterns
 
Running MySQL in AWS
Running MySQL in AWSRunning MySQL in AWS
Running MySQL in AWS
 
Hybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouseHybrid my sql_hadoop_datawarehouse
Hybrid my sql_hadoop_datawarehouse
 
Methods of Sharding MySQL
Methods of Sharding MySQLMethods of Sharding MySQL
Methods of Sharding MySQL
 
Understanding MySQL Performance through Benchmarking
Understanding MySQL Performance through BenchmarkingUnderstanding MySQL Performance through Benchmarking
Understanding MySQL Performance through Benchmarking
 

Recently uploaded

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

CouchConf SF 2012 Lightning Talk - Operational Excellence

  • 1. Laine Campbell, Owner/Principal, laine@palominodb.com Charlie Killian, Director of Engineering, charlie@palominodb.com Scaling and Performance for Operational Excellence
  • 2. Who we are ● A boutique consultancy offering custom solutions. ● An operations support team providing a combined 100+ years of experience in distributed, performant and scalable solutions. ● A team of architects, engineers and operators who have worked at some of the most trafficked sites, games and companies since 1999. *
  • 3.
  • 4. Operational Excellence ● Configuration management and documentation. ● Change management. ● Availability management. ● Incident and problem management ● Backup, recovery and business continuity. ● Monitoring and Trending. *
  • 5. Configuration Management ● Consistent couchbase configurations. ○ Guis are great, but don't meet automation needs. ● Self documenting environments. ● Incorporating your infrastructure into your application to leverage couchbase ease of scale. ● Chef, puppet, ansible or "roll your own" using the couchbase API. *
  • 6. Change and Release Management ● Schemaless is great, but data governance is key. ● Your code needs to build a data dictionary or confusion reigns. ● DevOps style relationships build collaboration that can overcome the wild west mentality of schemaless environments. *
  • 7. Availability Management ● Moxi provides availability during node failures, supporting reads and writes. ● XDCR support in Couchbase 2.0 provides availability across datacenters and regions in an active/active topology. ● Special consideration in cloud environments must take into account AZ and region failovers. *
  • 8. Incident and Problem Management ● While not Couchbase specific, crucial to maintaining any highly available architecture. ● Appropriate alerting, response and communication processes ensure that isolated issues don't cascade into massive failures. ● Failing hardware, networks, design issues can all cause failures that can cascade into an entire cluster being down. ● Tracking recurring problems help with a continuous improvement on meeting SLAs. *
  • 9.
  • 10. Backup and Recovery ● Define your recovery SLAs. ● Track how long backups take. ● Test restores and track how long they take. ● Recognize all failure scenarios: ○ Node failure ○ Physical data corruption ○ Logical data corruption ○ Audits and forensics *
  • 11. Backup and Recovery 1.8 ● In 1.8, per node backup is supported. Replica sets are also backed-up, which can cause long, or non- completing backups. ● SQLite3 can be used as a logical dump to ease backups. ● Cluster-wide consistency can not be guaranteed. ● No incremental backups available. *
  • 12. Backup and Recovery 2.0 ● Cluster wide backups are now available, as well as incremental. ● EBS snapshots (or LVM, hardware, etc...) work well due to log-style writes to disk. ● With incremental, it is easier to meet SLAs without breaking the bank on storage. *
  • 13. Monitoring and Alerting ● Use logs! Centralized syslogs, splunk, custom scripts to identify and track error types and rates. ● Track your app! Latency of web pages, forms and api-calls are key indicators. ● Define key alerts, make them actionable and tied to documentation. ● Palomino builds plugins and templates to provide proper alerts that are useful and work! *
  • 14. Trending and Diagnostics ● Alerts aren't enough, you must track usage and internal metrics to understand trends, workloads and bottlenecks. ● Graph everything! All exposed metrics, trend health checks. ● Interleave graphs from internal metrics to external factors: Code pushes, application metrics (logins, purchases, api calls) *
  • 15. Care and Feeding ● Regular performance reviews. ● Defragmentation. ● Incorporate recovery tests into building test and dev environments. ● Scale-up/Scale-down, preferably via automated processes. ● Rolling upgrades. ● Coffee, pie, beer. *
  • 16.
  • 17. Partnering with Couchbase Providing remote Architecture, Engineering and DBA services to clients. Vendor neutral operations and scaling expertise for Couchbase clients in need of operators. *
  • 18. Remote Architecture and Engineering Services ● Architecture review and recommendations ● Data modeling ● Data model migration ● Data migration ● Cluster sizing ● Tools development *
  • 19. DBA and Operations Services ● Infrastructure builds and management ● Proactive operational support ● 24x7 operational support with 30 minutes SLA ● System health checks ● Backup and recovery ● Tuning for performance and scale ● Query reviews, indexing, benchmarking ● Capacity reviews *
  • 20. How we can help ● Support your proof of concept ● Migrate you to Couchbase Server ● Support your Couchbase Server clusters *
  • 21. Is Couchbase Server a good fit? ● Architecture review ● Data model review ● Recommendation on moving to Couchbase Server ● Data access best practices *
  • 22. Migrating from a RDBMS to Couchbase Server? ● Data model migration from relational to document ● Data migration from SQL Server to Couchbase Server ● Couchbase Server cluster sizing ● Infrastructure builds *
  • 23. Do you need operational experts? ● 24x7 operational support with 30 minutes SLA ● Multiple Couchbase Server 1.8 clusters ● Wanted Couchbase operational experts ● Escalate to Couchbase for software support *
  • 24.
  • 25. Contact Info Laine Campbell, laine@palominodb.com Charlie Killian, charlie@palominodb.com www.palominodb.com @palominodb on Twitter *