SlideShare una empresa de Scribd logo
1 de 18
DICE Horizon 2020 Project
Grant Agreement no. 644869
http://www.dice-h2020.eu Funded by the Horizon 2020
Framework Programme of the European Union
Monitoring in Big Data
Frameworks
Gabriel Iuhasz
Institute e-Austria Timisoara
26 November 2015
Overview
o Introduction
o Cloud Computing and Big Data
o Monitoring Tools
o Monitoring Requirements and Solutions
o Conclusions
Introduction
o Big Data in Cloud computing
o Volume, Velocity, Variety and Veracity
o Cost Reduction, Rapid provisioning/time to market,
Flexibility/scalability
o DevOps and Cloud
o Development and Operations
o Communication, Collaboration, Integration,
Automation
o DevOps Monitoring
o Measurement is a key aspect of DevOps
Big Data in Cloud Computing
o Challenges of Big Data On Cloud
o Low Latency real-time data
o Virtualization overhead
o Multi-tenancy overhead
o Scalability
o Lack of RDBMS support
o Availability
o Data integrity/privacy
Hadoop Ecosystem
Cloudera
HortonWorks
Monitoring Architecture
o Cross layer monitoring of big data platforms
o Types of metrics are highly dependent on the type of the
application
o Have to be decided on a platform/application basis
o Centralized Monitoring
o All resource states are sent to a centralized monitoring server
o Metrics are continuously polled from monitored components
o Single point of failure
o Lacks scalability
o Decentralized Monitoring
o No single point of failure
o Central authority is diffused
Tools
o Hadoop Performance Monitoring UI
o Lightweight monitoring UI for Hadoop server
o Uses Hadoop metrics (using Sinks)
o SequenceIQ
o Based on ELK stack and Docker containers
o ElasticSearch can be easily scaled horizontally
o Logstash server on client side
o Ganglia
o Scalable distributed monitoring system
o Low per-node overhead
o Focused on System Metrics
o Gmond, gmetad and Web Front-end
Tools II
o Apache Chukwa
o Built on top of HDFS
o Easily scalable
o Potentially high overhead
o Hadoop Vaidya
o Rule Based diagnostic tool for M/R jobs
o Performes post run results analysis
o Nagios
o Plugin based architecture
o Uses a centralized server to collect metrics
o Possible to create a hierarchical deployment
Requirements
o Difficulties in cloud monitoring
o Scale
o Velocity or Timeliness
o Constant changes
o The need for scalability and automation
o Easy re-configurability
o Lightweight metrics collectors
o Identifying pertinent metrics
DICE Overview
Platform-Indep.
Model
Domain
Models
Continuous
Validation
Continuous
Monitoring
Data
Awareness
Architecture
Model
Platform-Specific
Model
Platform
Description
DICE MARTE
Deployment &
Continuous
Integration
DICE IDE
Big Data
QA
Models
DICE Monitoring Platform
o RESTful Web Service
o Used to deploy and configure all core/auxiliary components
o Used to query ElasticSearch
Exports metrics in: JSON, CSV, OSLC Perf. Mon 2.0 (RDF+XML)
o Used for auto-scaling of monitoring solution
o ELK Stack
o Extremely flexible/configurable
o Horizontally scalable
o Can except various input and output formats
o ETL via Logstash server (filters)
o Logstash-forwarder secure transmission (new Beats Data Shippers)
o Visualization using Kibana4
o Collectd
o Statistics collection daemon
o A lot of plugins available
o Simple configuration
DICE Monitoring Platform II
DICE Monitoring Platform Scaled
DICE Monitoring Platform Variant
Conclusions
o We have given a short overview of current
monitoring platforms Identified key requirements for
Big Data Monitoring
o Scaling, Autonomy, Timeliness
o Automation via Chef recipes
o Presented the current Architecture of the DICE
Monitoring Platform
o Currently collecting from: HDFS, YARN, Spark, Storm, Kafka
o In the near future: Cassandra possibly Trident
o Creating the full lambda architecture based anomaly
detection platform
o ElasticSearch used as serving layer
Thank You!
Questions?

Más contenido relacionado

La actualidad más candente

Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Altan Khendup
 

La actualidad más candente (18)

Complex event processing platform handling millions of users - Krzysztof Zarz...
Complex event processing platform handling millions of users - Krzysztof Zarz...Complex event processing platform handling millions of users - Krzysztof Zarz...
Complex event processing platform handling millions of users - Krzysztof Zarz...
 
Migrating Big Data Workloads to the Cloud
Migrating Big Data Workloads to the CloudMigrating Big Data Workloads to the Cloud
Migrating Big Data Workloads to the Cloud
 
Spark Streaming and Expert Systems
Spark Streaming and Expert SystemsSpark Streaming and Expert Systems
Spark Streaming and Expert Systems
 
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring SolutionHow KeyBank Used Elastic to Build an Enterprise Monitoring Solution
How KeyBank Used Elastic to Build an Enterprise Monitoring Solution
 
T-Mobile and Elastic
T-Mobile and ElasticT-Mobile and Elastic
T-Mobile and Elastic
 
Batch and Interactive Analytics: From Data to Insight
Batch and Interactive Analytics: From Data to InsightBatch and Interactive Analytics: From Data to Insight
Batch and Interactive Analytics: From Data to Insight
 
Leveraging Spark to Democratize Data for Omni-Commerce with Shafaq Abdullah
Leveraging Spark to Democratize Data for Omni-Commerce with Shafaq AbdullahLeveraging Spark to Democratize Data for Omni-Commerce with Shafaq Abdullah
Leveraging Spark to Democratize Data for Omni-Commerce with Shafaq Abdullah
 
The Rise of Engineering-Driven Analytics by Loren Shure
The Rise of Engineering-Driven Analytics by Loren ShureThe Rise of Engineering-Driven Analytics by Loren Shure
The Rise of Engineering-Driven Analytics by Loren Shure
 
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
Shortening the Feedback Loop: How Spotify’s Big Data Ecosystem has evolved to...
 
Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"Monitoring Big Data Systems - "The Simple Way"
Monitoring Big Data Systems - "The Simple Way"
 
ironSource Atom BigData Berlin
ironSource Atom BigData BerlinironSource Atom BigData Berlin
ironSource Atom BigData Berlin
 
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
Data Apps with the Lambda Architecture - with Real Work Examples on Merging B...
 
Intuit Analytics Cloud 101
Intuit Analytics Cloud 101Intuit Analytics Cloud 101
Intuit Analytics Cloud 101
 
Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!Counting Unique Users in Real-Time: Here's a Challenge for You!
Counting Unique Users in Real-Time: Here's a Challenge for You!
 
Data ops in practice - Swedish style
Data ops in practice - Swedish styleData ops in practice - Swedish style
Data ops in practice - Swedish style
 
Netflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
 
Data Pipline Observability meetup
Data Pipline Observability meetup Data Pipline Observability meetup
Data Pipline Observability meetup
 
Lambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big dataLambda Architecture and open source technology stack for real time big data
Lambda Architecture and open source technology stack for real time big data
 

Destacado

Apache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data ProcessingApache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data Processing
DataWorks Summit
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
DataWorks Summit
 

Destacado (7)

Каталог 17/2016
Каталог 17/2016Каталог 17/2016
Каталог 17/2016
 
Pig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataPig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big Data
 
IDOL presentation
IDOL presentationIDOL presentation
IDOL presentation
 
Apache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data ProcessingApache Tez - A New Chapter in Hadoop Data Processing
Apache Tez - A New Chapter in Hadoop Data Processing
 
Integrating big data into the monitoring and evaluation of development progra...
Integrating big data into the monitoring and evaluation of development progra...Integrating big data into the monitoring and evaluation of development progra...
Integrating big data into the monitoring and evaluation of development progra...
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
 
From Code to Kubernetes
From Code to KubernetesFrom Code to Kubernetes
From Code to Kubernetes
 

Similar a Monitoring in Big Data Frameworks @ Big Data Meetup, Timisoara, 2015

Cardinality-HL-Overview
Cardinality-HL-OverviewCardinality-HL-Overview
Cardinality-HL-Overview
Harry Frost
 

Similar a Monitoring in Big Data Frameworks @ Big Data Meetup, Timisoara, 2015 (20)

Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iter...
Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iter...Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iter...
Cloud Expo 2015: DICE: Developing Data-Intensive Cloud Applications with Iter...
 
Apache Stratos - Building a PaaS using OSGi and Equinox
Apache Stratos - Building a PaaS using OSGi and EquinoxApache Stratos - Building a PaaS using OSGi and Equinox
Apache Stratos - Building a PaaS using OSGi and Equinox
 
Private, Managed, Public - All Things WSO2 Cloud
Private, Managed, Public - All Things WSO2 CloudPrivate, Managed, Public - All Things WSO2 Cloud
Private, Managed, Public - All Things WSO2 Cloud
 
Istio Service Mesh
Istio Service MeshIstio Service Mesh
Istio Service Mesh
 
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
Virtual BenchLearning - I-BiDaaS - Industrial-Driven Big Data as a Self-Servi...
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloud
 
Cardinality-HL-Overview
Cardinality-HL-OverviewCardinality-HL-Overview
Cardinality-HL-Overview
 
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
Session 2 - A Project Perspective on Big Data Architectural Pipelines and Ben...
 
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE PlatformBigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE Platform
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
 
MISE2015
MISE2015MISE2015
MISE2015
 
DevNation Tech Talk: Getting GitOps
DevNation Tech Talk: Getting GitOpsDevNation Tech Talk: Getting GitOps
DevNation Tech Talk: Getting GitOps
 
Phoenix Data Conference - Big Data Analytics for IoT 11/4/17
Phoenix Data Conference - Big Data Analytics for IoT 11/4/17Phoenix Data Conference - Big Data Analytics for IoT 11/4/17
Phoenix Data Conference - Big Data Analytics for IoT 11/4/17
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
DICE & Cloudify – Quality Big Data Made Easy
DICE & Cloudify – Quality Big Data Made EasyDICE & Cloudify – Quality Big Data Made Easy
DICE & Cloudify – Quality Big Data Made Easy
 
Data Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWSData Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWS
 
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, RomaniaDICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
DICE @ Innomatch 2015, 3rd Regional Innovation Fair, Arad, Romania
 
WSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product OverviewWSO2 Machine Learner - Product Overview
WSO2 Machine Learner - Product Overview
 
Reliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoTReliable Data Intestion in BigData / IoT
Reliable Data Intestion in BigData / IoT
 
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer DemandPaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
PaaS Lessons: Cisco IT Deploys OpenShift to Meet Developer Demand
 

Último

CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
anilsa9823
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
anilsa9823
 

Último (20)

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female serviceCALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
CALL ON ➥8923113531 🔝Call Girls Badshah Nagar Lucknow best Female service
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 

Monitoring in Big Data Frameworks @ Big Data Meetup, Timisoara, 2015

  • 1. DICE Horizon 2020 Project Grant Agreement no. 644869 http://www.dice-h2020.eu Funded by the Horizon 2020 Framework Programme of the European Union Monitoring in Big Data Frameworks Gabriel Iuhasz Institute e-Austria Timisoara 26 November 2015
  • 2. Overview o Introduction o Cloud Computing and Big Data o Monitoring Tools o Monitoring Requirements and Solutions o Conclusions
  • 3. Introduction o Big Data in Cloud computing o Volume, Velocity, Variety and Veracity o Cost Reduction, Rapid provisioning/time to market, Flexibility/scalability o DevOps and Cloud o Development and Operations o Communication, Collaboration, Integration, Automation o DevOps Monitoring o Measurement is a key aspect of DevOps
  • 4. Big Data in Cloud Computing o Challenges of Big Data On Cloud o Low Latency real-time data o Virtualization overhead o Multi-tenancy overhead o Scalability o Lack of RDBMS support o Availability o Data integrity/privacy
  • 8. Monitoring Architecture o Cross layer monitoring of big data platforms o Types of metrics are highly dependent on the type of the application o Have to be decided on a platform/application basis o Centralized Monitoring o All resource states are sent to a centralized monitoring server o Metrics are continuously polled from monitored components o Single point of failure o Lacks scalability o Decentralized Monitoring o No single point of failure o Central authority is diffused
  • 9. Tools o Hadoop Performance Monitoring UI o Lightweight monitoring UI for Hadoop server o Uses Hadoop metrics (using Sinks) o SequenceIQ o Based on ELK stack and Docker containers o ElasticSearch can be easily scaled horizontally o Logstash server on client side o Ganglia o Scalable distributed monitoring system o Low per-node overhead o Focused on System Metrics o Gmond, gmetad and Web Front-end
  • 10. Tools II o Apache Chukwa o Built on top of HDFS o Easily scalable o Potentially high overhead o Hadoop Vaidya o Rule Based diagnostic tool for M/R jobs o Performes post run results analysis o Nagios o Plugin based architecture o Uses a centralized server to collect metrics o Possible to create a hierarchical deployment
  • 11. Requirements o Difficulties in cloud monitoring o Scale o Velocity or Timeliness o Constant changes o The need for scalability and automation o Easy re-configurability o Lightweight metrics collectors o Identifying pertinent metrics
  • 13. DICE Monitoring Platform o RESTful Web Service o Used to deploy and configure all core/auxiliary components o Used to query ElasticSearch Exports metrics in: JSON, CSV, OSLC Perf. Mon 2.0 (RDF+XML) o Used for auto-scaling of monitoring solution o ELK Stack o Extremely flexible/configurable o Horizontally scalable o Can except various input and output formats o ETL via Logstash server (filters) o Logstash-forwarder secure transmission (new Beats Data Shippers) o Visualization using Kibana4 o Collectd o Statistics collection daemon o A lot of plugins available o Simple configuration
  • 17. Conclusions o We have given a short overview of current monitoring platforms Identified key requirements for Big Data Monitoring o Scaling, Autonomy, Timeliness o Automation via Chef recipes o Presented the current Architecture of the DICE Monitoring Platform o Currently collecting from: HDFS, YARN, Spark, Storm, Kafka o In the near future: Cassandra possibly Trident o Creating the full lambda architecture based anomaly detection platform o ElasticSearch used as serving layer

Notas del editor

  1. - DevOps is a design philosophy that emphasizes collaboration and communication while automating the process of software delivery and infrastructure changes
  2. RDBMS
  3. Quality-Aware Development for Big Data applications