SlideShare una empresa de Scribd logo
1 de 45
Cassandra.Toolkit
Tools of Cassandra
Titans
Making Apache Cassandra Smarter, Faster, Better for
Architects, Engineers, Operators
Create and
manage global
data platforms.
www.anant.us | solutions@anant.us | (855) 262-6826
3 Washington Circle, NW | Suite 301 | Washington, DC 20037
ARCHITECT
noun: architect; chief builder
verb: architect; design or make (COMPUTING)
“We create and manage global platforms that run on
Cassandra and related technologies.”
4
Things We Love : Scalable Fast Data
Without Datastax
With Datastax
What are those!!!
Day to Day -
Engineer/Operator/Architect
MobaXTerm/ DBeaver / Hackolade - Day to Day
1. MobaXTerm - Really good multi-exec
for Windows or (SuperPutty)
2. DBeaver - You can rig the free one to
work with Cassandra, but the paid
one is awesome.
3. Hackolade - Really great tool for
general NoSQL data modeling.
Takeaways
1. Do you really want to keep 100 nodes
info in Putty?
2. Sometimes you just want to help
someone out with a simple query
without having to jump into CQLSH
3. Good design tools like Hackolade also
produce good Docs.
MobaXTerm in Action (multi-exec)
DBeaver in Action for Cassandra
Hackolade in Action
Data Operations - In / Out /
Around
DSBulk / Spark - Data Operations
1. DSBulk - Not too shabby way to get
data in / out in CSV or JSON format.
2. Spark Migrator - By Scylla, but totally
works for Cassandra to Cassandra
3. Spark Shell - The way to do updates,
deletes, on thousands of partitions
interactively.
Takeaways
1. DSBulk is free and works with
Cassandra.
2. Spark Migrator is free. Just try it out.
3. You don’t need Datastax to have
Spark with Cassandra, but it makes it
easier.
DSBulk in Action
https://blog.anant.us/apache-cassandra-lunch-43-dsbulk-with-sed-and-awk/
Airflow / Jenkins - Data Operations/Scheduling
1. Airflow - Airflow has become the
defacto tool for managing data
pipelines / operations.
2. Jenkins - Can use this for data
operations if you really want to, if you
already have it.
Takeaways
1. Airflow helps you run all your jobs for
data pipeline processes and manages
dependencies in a DAG.
2. Jenkins can also do this, but can’t do
complex pipelines.
Airflow in Action
https://github.com/Anant/example-cassandra-etl-with-airflow-and-spark
https://blog.anant.us/apache-cassandra-lunch-53-cassandra-etl-with-airflow-and-spark/
Operate - Backup / Restore /
Repair
Reaper / Medusa - Automatic Repair / Backup
1. Cassandra Reaper - GUI for managing
repairs for Cassandra.
2. Cassandra Medusa - Backup / restore
nodes, clusters ( to/from S3, GCS,
etc.)
Takeaways
1. CRONning your own Repairs is not
sustainable.
2. CRONning your own Backups is not
sustainable.
3. Don’t reinvent the wheel.
Cassandra Reaper in Action
Cassandra Medusa in Action
Understand -
Diagnostics/Analysis
Cassandra.Vision - Offline Diagnostics
1. Diagnostic-collection- Grab all your
logs, configs, etc for analysis.
2. Cassandra.Vision/CassandraAnalyzer -
Visualize diagnostics/ logs offline
3. Cassandra.Vision/TableAnalyzer-
Visualize data/traffic/tombstone skew
across all tables/keyspaces.
Takeaways
1. If you don’t have ELK online , you can
still use it on your desktop using
offline tools.
2. Analyzing the tablestats visually in
one place can help you avert disaster
later.
Offline Cassandra Analyzer (ELK)
Table Analyzer
Support - Metrics / Logs
MCAC / Prometheus + Grafana - Metrics
Takeaways
1. Don’t have OpsCenter, get MCAC
2. Need to keep data for thousands of
nodes, look into Cortex
3. Prometheus/Grafana work with
everything else, you can’t go wrong.
1. Metrics Collector for Apache
Cassandra - All in one package.
2. Cortex - Prometheus at Scale on
Cassandra
3. Prometheus / Grafana - The O.G. of
time series system data vis.
MCAC all Setup in Action
Cortex - 1 Trillion Samples a Day
1. Filebeat (or Logstash) - Parse, dissect
logs before sending them to elastic.
2. Elasticsearch / Opendistro- Self
explanatory.
3. Kibana - Search your logs in one place,
visually.
Filebeat / Elastic / Kibana - “Free” Log Analytics
Takeaways
1. Tailing logs in MobaXTerm works for
up to 6, maybe 8 computers.
2. Log aggregationwith intelligent
parsing helps find patterns faster.
3. Having dashboards setup beforehand
makes it even easier.
ELK in Action for Cassandra
DevOps - Infrastructure/Software
Configuration/Large Scale Admin
Terraform - Infrastructure as Code
1. Terraform - Manage different clouds
with one language.
2. Atlantis - Manage terraform with
Github
3. Terragrunt - Makes your terraform
code simpler for different
environments.
Takeaways
1. Terraform is the best way to manage
infrastructure as code.
2. Will help operators create and destroy
VM Clusters / configurations.
3. Scaling clusters up and down is easy
with Terraform
Terraform in Action
Atlantis in Action
Ansible - Configuration Management
1. Ansible - Organizes commands that
need to be run better - Setup,
Configure, Run ad-hoc commands.
2. Ansible Semaphore - OS GUI for
Ansible
3. AWX - OS version of Ansible Tower
Takeaways
1. Manage configuration consistently
across nodes/ datacenters.
2. Manage environments easier with
variables / templates.
3. Run rolling commands on a cluster,
data center, or multiple clusters.
Ansible in Action
Ansible Semaphore in Action
Ansible AWX in Action
Bleeding Edge : Containers
Docker/ Kubernetes / K8ssandra / Stargate
1. Docker - Customize your image.
2. Kubernetes - Run your cluster.
3. K8ssandra - Run your cluster, easier.
4. Stargate - C* API Layer on Day One
Takeaways
1. Containers are the future, play with
this now.
2. Containers make it easier for people
to test out things without as heavy of a
hardware investment.
3. Developers want APIs when possible.
K8ssandra in Action
Stargate in Action
Tools for Cassandra Titans
Any Questions?
Apache Cassandra Lunch #58: Tools for Cassandra Titans
Apache Cassandra Lunch #58: Tools for Cassandra Titans

Más contenido relacionado

Similar a Apache Cassandra Lunch #58: Tools for Cassandra Titans

Using Cassandra with your Web Application
Using Cassandra with your Web ApplicationUsing Cassandra with your Web Application
Using Cassandra with your Web Application
supertom
 
DockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and ContainerizationDockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and Containerization
Docker, Inc.
 
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Spark Summit
 
TupleJump: Breakthrough OLAP performance on Cassandra and Spark
TupleJump: Breakthrough OLAP performance on Cassandra and SparkTupleJump: Breakthrough OLAP performance on Cassandra and Spark
TupleJump: Breakthrough OLAP performance on Cassandra and Spark
DataStax Academy
 

Similar a Apache Cassandra Lunch #58: Tools for Cassandra Titans (20)

ZDLRA in Action
ZDLRA in ActionZDLRA in Action
ZDLRA in Action
 
Databricks and Logging in Notebooks
Databricks and Logging in NotebooksDatabricks and Logging in Notebooks
Databricks and Logging in Notebooks
 
SMACK Stack 1.1
SMACK Stack 1.1SMACK Stack 1.1
SMACK Stack 1.1
 
Apache Cassandra Lunch #63: How to Install Cassandra 4.0 From a Tarball On Linux
Apache Cassandra Lunch #63: How to Install Cassandra 4.0 From a Tarball On LinuxApache Cassandra Lunch #63: How to Install Cassandra 4.0 From a Tarball On Linux
Apache Cassandra Lunch #63: How to Install Cassandra 4.0 From a Tarball On Linux
 
Using Cassandra with your Web Application
Using Cassandra with your Web ApplicationUsing Cassandra with your Web Application
Using Cassandra with your Web Application
 
Cassandra admin
Cassandra adminCassandra admin
Cassandra admin
 
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
Running 400-node Cassandra + Spark Clusters in Azure (Anubhav Kale, Microsoft...
 
Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos Fully fault tolerant real time data pipeline with docker and mesos
Fully fault tolerant real time data pipeline with docker and mesos
 
DockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and ContainerizationDockerCon14 Cluster Management and Containerization
DockerCon14 Cluster Management and Containerization
 
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB,  or how we implemented a 10-times faster CassandraSeastar / ScyllaDB,  or how we implemented a 10-times faster Cassandra
Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra
 
BigData Developers MeetUp
BigData Developers MeetUpBigData Developers MeetUp
BigData Developers MeetUp
 
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute
In-Ceph-tion: Deploying a Ceph cluster on DreamComputeIn-Ceph-tion: Deploying a Ceph cluster on DreamCompute
In-Ceph-tion: Deploying a Ceph cluster on DreamCompute
 
Building Apache Cassandra clusters for massive scale
Building Apache Cassandra clusters for massive scaleBuilding Apache Cassandra clusters for massive scale
Building Apache Cassandra clusters for massive scale
 
DrupalCampLA 2011: Drupal backend-performance
DrupalCampLA 2011: Drupal backend-performanceDrupalCampLA 2011: Drupal backend-performance
DrupalCampLA 2011: Drupal backend-performance
 
From 1000/day to 1000/sec: The Evolution of Incapsula's BIG DATA System [Surg...
From 1000/day to 1000/sec: The Evolution of Incapsula's BIG DATA System [Surg...From 1000/day to 1000/sec: The Evolution of Incapsula's BIG DATA System [Surg...
From 1000/day to 1000/sec: The Evolution of Incapsula's BIG DATA System [Surg...
 
Netflix Global Applications - NoSQL Search Roadshow
Netflix Global Applications - NoSQL Search RoadshowNetflix Global Applications - NoSQL Search Roadshow
Netflix Global Applications - NoSQL Search Roadshow
 
Containerized Data Persistence on Mesos
Containerized Data Persistence on MesosContainerized Data Persistence on Mesos
Containerized Data Persistence on Mesos
 
Data Engineer's Lunch #55: Get Started in Data Engineering
Data Engineer's Lunch #55: Get Started in Data EngineeringData Engineer's Lunch #55: Get Started in Data Engineering
Data Engineer's Lunch #55: Get Started in Data Engineering
 
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
Highlights and Challenges from Running Spark on Mesos in Production by Morri ...
 
TupleJump: Breakthrough OLAP performance on Cassandra and Spark
TupleJump: Breakthrough OLAP performance on Cassandra and SparkTupleJump: Breakthrough OLAP performance on Cassandra and Spark
TupleJump: Breakthrough OLAP performance on Cassandra and Spark
 

Más de Anant Corporation

NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
Anant Corporation
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Anant Corporation
 

Más de Anant Corporation (20)

QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
 
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdfKono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
 
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache PinotData Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
 
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
 
YugabyteDB Developer Tools
YugabyteDB Developer ToolsYugabyteDB Developer Tools
YugabyteDB Developer Tools
 
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer RoadmapEpisode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
 
Machine Learning Orchestration with Airflow
Machine Learning Orchestration with AirflowMachine Learning Orchestration with Airflow
Machine Learning Orchestration with Airflow
 
Cassandra Lunch 130: Recap of Cassandra Forward Talks
Cassandra Lunch 130: Recap of Cassandra Forward TalksCassandra Lunch 130: Recap of Cassandra Forward Talks
Cassandra Lunch 130: Recap of Cassandra Forward Talks
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
 
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
 
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
Cassandra Lunch 129: What’s New:  Apache Cassandra 4.1+ Features & FutureCassandra Lunch 129: What’s New:  Apache Cassandra 4.1+ Features & Future
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
 
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
 
Data Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data StackData Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #85: Designing a Modern Data Stack
 
CL 121
CL 121CL 121
CL 121
 
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache IcebergData Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
 
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOpsApache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
 
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache CassandraApache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
 
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise ConsciousnessData Engineer's Lunch #60: Series - Developing Enterprise Consciousness
Data Engineer's Lunch #60: Series - Developing Enterprise Consciousness
 
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data PlatformsData Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
Data Engineer's Lunch #81: Reverse ETL Tools for Modern Data Platforms
 

Último

Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
shivangimorya083
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
Lars Albertsson
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
shivangimorya083
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
shambhavirathore45
 

Último (20)

Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 

Apache Cassandra Lunch #58: Tools for Cassandra Titans

  • 1. Cassandra.Toolkit Tools of Cassandra Titans Making Apache Cassandra Smarter, Faster, Better for Architects, Engineers, Operators
  • 2. Create and manage global data platforms. www.anant.us | solutions@anant.us | (855) 262-6826 3 Washington Circle, NW | Suite 301 | Washington, DC 20037
  • 3. ARCHITECT noun: architect; chief builder verb: architect; design or make (COMPUTING) “We create and manage global platforms that run on Cassandra and related technologies.”
  • 4. 4 Things We Love : Scalable Fast Data Without Datastax With Datastax
  • 6. Day to Day - Engineer/Operator/Architect
  • 7. MobaXTerm/ DBeaver / Hackolade - Day to Day 1. MobaXTerm - Really good multi-exec for Windows or (SuperPutty) 2. DBeaver - You can rig the free one to work with Cassandra, but the paid one is awesome. 3. Hackolade - Really great tool for general NoSQL data modeling. Takeaways 1. Do you really want to keep 100 nodes info in Putty? 2. Sometimes you just want to help someone out with a simple query without having to jump into CQLSH 3. Good design tools like Hackolade also produce good Docs.
  • 8. MobaXTerm in Action (multi-exec)
  • 9. DBeaver in Action for Cassandra
  • 11. Data Operations - In / Out / Around
  • 12. DSBulk / Spark - Data Operations 1. DSBulk - Not too shabby way to get data in / out in CSV or JSON format. 2. Spark Migrator - By Scylla, but totally works for Cassandra to Cassandra 3. Spark Shell - The way to do updates, deletes, on thousands of partitions interactively. Takeaways 1. DSBulk is free and works with Cassandra. 2. Spark Migrator is free. Just try it out. 3. You don’t need Datastax to have Spark with Cassandra, but it makes it easier.
  • 14. Airflow / Jenkins - Data Operations/Scheduling 1. Airflow - Airflow has become the defacto tool for managing data pipelines / operations. 2. Jenkins - Can use this for data operations if you really want to, if you already have it. Takeaways 1. Airflow helps you run all your jobs for data pipeline processes and manages dependencies in a DAG. 2. Jenkins can also do this, but can’t do complex pipelines.
  • 16. Operate - Backup / Restore / Repair
  • 17. Reaper / Medusa - Automatic Repair / Backup 1. Cassandra Reaper - GUI for managing repairs for Cassandra. 2. Cassandra Medusa - Backup / restore nodes, clusters ( to/from S3, GCS, etc.) Takeaways 1. CRONning your own Repairs is not sustainable. 2. CRONning your own Backups is not sustainable. 3. Don’t reinvent the wheel.
  • 21. Cassandra.Vision - Offline Diagnostics 1. Diagnostic-collection- Grab all your logs, configs, etc for analysis. 2. Cassandra.Vision/CassandraAnalyzer - Visualize diagnostics/ logs offline 3. Cassandra.Vision/TableAnalyzer- Visualize data/traffic/tombstone skew across all tables/keyspaces. Takeaways 1. If you don’t have ELK online , you can still use it on your desktop using offline tools. 2. Analyzing the tablestats visually in one place can help you avert disaster later.
  • 25. MCAC / Prometheus + Grafana - Metrics Takeaways 1. Don’t have OpsCenter, get MCAC 2. Need to keep data for thousands of nodes, look into Cortex 3. Prometheus/Grafana work with everything else, you can’t go wrong. 1. Metrics Collector for Apache Cassandra - All in one package. 2. Cortex - Prometheus at Scale on Cassandra 3. Prometheus / Grafana - The O.G. of time series system data vis.
  • 26. MCAC all Setup in Action
  • 27. Cortex - 1 Trillion Samples a Day
  • 28. 1. Filebeat (or Logstash) - Parse, dissect logs before sending them to elastic. 2. Elasticsearch / Opendistro- Self explanatory. 3. Kibana - Search your logs in one place, visually. Filebeat / Elastic / Kibana - “Free” Log Analytics Takeaways 1. Tailing logs in MobaXTerm works for up to 6, maybe 8 computers. 2. Log aggregationwith intelligent parsing helps find patterns faster. 3. Having dashboards setup beforehand makes it even easier.
  • 29. ELK in Action for Cassandra
  • 31. Terraform - Infrastructure as Code 1. Terraform - Manage different clouds with one language. 2. Atlantis - Manage terraform with Github 3. Terragrunt - Makes your terraform code simpler for different environments. Takeaways 1. Terraform is the best way to manage infrastructure as code. 2. Will help operators create and destroy VM Clusters / configurations. 3. Scaling clusters up and down is easy with Terraform
  • 34. Ansible - Configuration Management 1. Ansible - Organizes commands that need to be run better - Setup, Configure, Run ad-hoc commands. 2. Ansible Semaphore - OS GUI for Ansible 3. AWX - OS version of Ansible Tower Takeaways 1. Manage configuration consistently across nodes/ datacenters. 2. Manage environments easier with variables / templates. 3. Run rolling commands on a cluster, data center, or multiple clusters.
  • 37. Ansible AWX in Action
  • 38. Bleeding Edge : Containers
  • 39. Docker/ Kubernetes / K8ssandra / Stargate 1. Docker - Customize your image. 2. Kubernetes - Run your cluster. 3. K8ssandra - Run your cluster, easier. 4. Stargate - C* API Layer on Day One Takeaways 1. Containers are the future, play with this now. 2. Containers make it easier for people to test out things without as heavy of a hardware investment. 3. Developers want APIs when possible.