SlideShare una empresa de Scribd logo
1 de 41
Low-Cost Adaptive Monitoring
Techniques for the Internet of Things
The AdaM Framework
Demetris Trihinas
University of Cyprus
trihinas@cs.ucy.ac.cy
Talk at TU Berlin, 15 mar. 2015
This talk is based on…
AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices, D. Trihinas
and G. Pallis and M. D. Dikaiakos, 2015 IEEE International Conference on Big Data (IEEE BigData
2015), Santa Clara, CA, USA Pages: 717–726, 2015.
Low-Cost Adaptive Monitoring Techniques for the Internet of Things, D. Trihinas and G. Pallis and
M. D. Dikaiakos, Transactions on Big Data, 2016, (In Review).
Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 2
Demetris Trihinas 3
IoT was initially devices sensing and exchanging data streams
with humans or other network-enabled devices
Init
Sense
Transmit
Sleep
Talk at TU Berlin, 15 mar. 2015
Edge-Mining
Demetris Trihinas 4
A term coined to reflect data processing and decision-making on
“smart” devices that sit at the edge of IoT networks
Buy more
detergent
…our devices just got a little bit more “smarter”…
Init
Sense
Transmit
Sleep
Receive
Actuation
Talk at TU Berlin, 15 mar. 2015
The “Big Data” in IoT
Demetris Trihinas 5
[IDC, Big Data in IoT, 2014]
[Cisco, IBSG, Apr 2011]
Talk at TU Berlin, 15 mar. 2015
IoT Monitoring Data
2% of digital universe in
2012
Projections for >10% in
2020
[Gartner, 2015]
Big Data Challenges Transitioning to IoT
Demetris Trihinas 6
• Taming data volume and data velocity with
limited processing and network capabilities
• IoT devices are usually battery-powered which means intense
processing results in increased energy consumption (less battery-life)
Zhang et al., Usenix HotCloud, 2015
Talk at TU Berlin, 15 mar. 2015
Low-Cost Adaptive
Monitoring Techniques
Adaptive Sampling and Filtering
Demetris Trihinas 7Talk at TU Berlin, 15 mar. 2015
Metric Stream
Demetris Trihinas 8
• A metric stream 𝑀 = {𝑠𝑖}𝑖=0
𝑛
published from a monitoring source is
a large sequence of samples, denoted as 𝑠𝑖 , where 𝑖 = 0, 1, … , 𝑛
and 𝑛 → ∞
• Each sample 𝑠𝑖 is a tuple 𝑡𝑖, 𝑣𝑖 described by a timestamp 𝑡𝑖 and a
value 𝑣𝑖
𝑠𝑖
𝑠𝑖+1
Metric Stream M
Talk at TU Berlin, 15 mar. 2015
Periodic Sampling
Demetris Trihinas 9
• The process of triggering the collection mechanism of a monitored source every
𝑇 time units such that the 𝑖 𝑡ℎ sample is collected at time 𝑡𝑖 = 𝑖 ∙ 𝑇
𝑠𝑖
𝑠𝑖+1
Metric Stream M sampled every T = 1s
Compute resources and energy are wasted while generating large data volumes at a high velocity
Metric Stream M sampled every T = 10s
Sudden events and significant insights are missed
Talk at TU Berlin, 15 mar. 2015
Adaptive Sampling
Demetris Trihinas 10
• Dynamically adjust the sampling period 𝑇𝑖 based on some function 𝜌(𝑀),
containing information of the metric stream evolution
𝑠𝑖
𝑠𝑖+1
𝑇𝑖+1
Metric Stream 𝑀′ dynamically sampledMetric Stream 𝑀 with 𝑇 = 1𝑠
• Find the max 𝑇 ∈ [𝑇 𝑚𝑖𝑛, 𝑇 𝑚𝑎𝑥] to collect sample 𝑠𝑖+1 based on an estimation
of the evolution of the metric stream 𝜌(𝑀), such that 𝑀′ differs from 𝑀 less
than a user-defined imprecision value 𝛾 (𝑑𝑖𝑠𝑡 < 𝛾)
Talk at TU Berlin, 15 mar. 2015
Metric Filtering
Demetris Trihinas 11
• The process of suppressing metric value dissemination when
consecutive values do not “change” (e.g. differ less than a
range of values)
• Goal: Reduce data volume and network overhead in favour of
exact precision
• How much “change” is required, depends on the type of filter
applied
• The receiver-side (e.g., base station, monitoring server) assumes
that the values of any unreported metrics remain unchanged
Talk at TU Berlin, 15 mar. 2015
Metric Filtering
• Fixed Range Metric Filters
Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 12
Metric Stream M with fixed filter range 𝑅 = 1%
if (curValue ∈ [prevValue – R, prevValue + R ])
filter(curValue)
No samples are filtered
Can we know this R
beforehand?
Should it keep the
same value forever?
Is it the same for each
and every metric?
Adaptive Filtering
Demetris Trihinas 13
• Dynamically adjusting the filter range 𝑅 based on the current variability
of the metric stream, denoted as 𝑞(𝑀)
Metric Stream 𝑀′ with adaptive RangeMetric Stream 𝑀
𝑠𝑖
𝑠𝑖+1
𝑅𝑖+1
𝑅𝑖
• Find the max filter range 𝑅𝑖+1 ∈ [𝑅 𝑚𝑖𝑛, 𝑅 𝑚𝑎𝑥] for sample 𝑠𝑖+1 such
that 𝑀′ differs from 𝑀 less than a user-defined imprecision value 𝛾
based on the variability of the metric stream 𝑞(𝑀)
Talk at TU Berlin, 15 mar. 2015
The Adaptive Monitoring Framework
Demetris Trihinas 14
• Software library with no external dependencies embeddable on IoT devices
• Reduces processing, network traffic and energy consumption by adapting the
monitoring intensity based on the metric stream evolution and variability
• Achieves a balance between efficiency and accuracy
It’s open-source! Give it a try!
https://github.com/dtrihinas/AdaM
Raspberry Pi
Arduino
Beacons
Talk at TU Berlin, 15 mar. 2015
Sensing
Unit
Processing
Unit
Dissemination
Unit
Adaptive
Sampling
Knowledge
Base
AdaM IoT Device
𝑠𝑖
(𝑇𝑖+1, 𝑐𝑖)
(𝑅𝑖+1, 𝐹𝑖)
𝑠𝑖
Adaptive
Filtering
A
P
I
𝑠𝑖
The AdaM Framework
AdaM’s
Algorithms
Adaptive Sampling & Filtering
Demetris Trihinas 15Talk at TU Berlin, 15 mar. 2015
Adaptive Sampling Algorithm
• Step 1: Compute the distance 𝛿𝑖 between the current two consecutive
sample values
Demetris Trihinas 16
Why use the distance instead of the current value?
𝛿𝑖 = |𝑣𝑖 − 𝑣𝑖−1|
Threshold-base techniques increase sampling rate while sample values
approach a user-defined threshold
• Good for anomaly detection (e.g. DDoS attacks)
• But, what about events away from threshold? (e.g. low rate DDoS attacks)
[Meng, Trans on Computers, 2013]
Talk at TU Berlin, 15 mar. 2015
Adaptive Sampling Algorithm
• Step 2: Compute metric stream evolution based on a Probabilistic
Exponential Weighted Moving Average (PEWMA) to estimate 𝛿𝑖+1
and the standard deviation 𝜎𝑖+1:
𝛿𝑖+1 = 𝜇𝑖 = 𝑎 ∙ 𝜇𝑖−1 + 1 − 𝑎 𝛿𝑖
Demetris Trihinas 17
Looks like an exponential moving average, right?
But weighting is probabilistically applied!
𝑎 = 𝛼 1 − 𝛽P𝑖
Moving standard deviation
with only previous value
knowledge
Talk at TU Berlin, 15 mar. 2015
Why use a Probabilistic EWMA?
Simple EWMA is volatile to abrupt transient changes
• Slow to acknowledge spike after “stable” periods
• If “stable” phases follow sudden spikes, then subsequent
values are overestimated
Demetris Trihinas 18
Sounds a lot like a job for
a Gaussian distribution!
Talk at TU Berlin, 15 mar. 2015
Adaptive Sampling Algorithm
• Step 3: Compute actual standard deviation 𝜎𝑖
• Step 4: Compute the current confidence (𝑐𝑖 ≤ 1) of our approach based
on the actual and estimated standard deviation
𝑐𝑖 = 1 −
| 𝜎𝑖 − 𝜎𝑖|
𝜎𝑖
• Step 5: Compute sampling period 𝑇𝑖+1 based on the current confidence
and the user-defined imprecision γ (e.g. 10% tolerance to errors)
Demetris Trihinas 19
… the more “confident” the algorithm is, the larger the
outputted sampling period 𝑇𝑖+1 can be…
O(1) complexity as all steps only use their previous values!
Talk at TU Berlin, 15 mar. 2015
λ is an
aggressiveness
multiplier
(default λ=1)
Adaptive Filtering Algorithm
• Step 1: Compute the current variability of the metric stream using a moving
Fano Factor 𝐹𝑖 based on exponentially weighted average 𝜇𝑖 and standard
deviation 𝜎𝑖 already computed from adaptive sampling
𝐹𝑖 =
𝜎𝑖
2
𝜇𝑖
𝑅𝑖 = 𝑅𝑖−1 ± 0.01 ∙ 𝑅𝑖−1
Even for a 1% adjustment critical values can be filtered out in biosignal monitoring
Demetris Trihinas 20
…a low 𝐹𝑖 (due to 𝜎𝑖) indicates a currently in-dispersed data
stream which means low variability in the metric stream…
Talk at TU Berlin, 15 mar. 2015
Why use variability and not follow a stepwise approach?
Adaptive Filtering Algorithm
• Step 2: Compute the new filter range 𝑅𝑖+1 based on 𝐹𝑖 and the user-
defined imprecision 𝛾
Demetris Trihinas 21
O(1) complexity as all steps only use their previous values!
𝑠𝑖
𝑠𝑖+1
𝑅𝑖+1
𝑠𝑖+2
𝑠𝑖+3
𝐹𝑖𝑙𝑡𝑒𝑟𝑒𝑑!
𝑅𝑖+2
𝑅𝑖
𝐹 < 𝛾
Talk at TU Berlin, 15 mar. 2015
𝐹 > 𝛾
𝐹𝑘−1 > 𝐹𝑘
𝑠 𝑘
Evaluation
Demetris Trihinas 22Talk at TU Berlin, 15 mar. 2015
AdaM vs State-of-the-Art
• i-EWMA [1]: an EWMA adapting sampling period by 1 time unit when
the estimated error (ε) is under/over user-defined imprecision
• L-SIP [2]: a linear algorithm using a double EWMA to produce
estimates of current data distribution based on rate values change
• Slow to react to highly transient and abrupt fluctuations in the metric stream
• FAST [3]: an (aggressive) framework using a PID controller to compute
(large) sampling periods accompanied by a Kalman Filter to predict
values at non sampling points
Demetris Trihinas 23
[1] Trihinas et al., CCGrid, 2014 [2] Gaura et al., IEEE Sensors Journal, 2013 [3] Fan et al., ACM CIKM, 2012
Talk at TU Berlin, 15 mar. 2015
Datasets
Demetris Trihinas 24
Memory Trace
Java Sorting Benchmark
CPU Trace
Carnegie Mellon RainMon Project
Disk I/O Trace
Carnegie Mellon RainMon Project
TCP Port Monitoring Trace
Cyber Defence SANS Tech Institute
Step Trace
Fitbit Charge HR Wearable
Talk at TU Berlin, 15 mar. 2015
Heart Trace
Fitbit Charge HR Wearable
Our Small “Big Data” Testbeds
Demetris Trihinas 25
Raspberry Pi 1st Gen. Model B
512MB RAM, Single Core ARM 700MHz
Daemon on OS emulates traces while feeding
samples to each algorithm
Android Emulator + SensorSimulator
128MB RAM, Single Core ARM 32MHz
SensorSimulator script emulates traces by
feeding samples to Android Wear emulator for
Step and Heartrate processing
Daemon
Talk at TU Berlin, 15 mar. 2015
Evaluation Metrics
• Estimation Accuracy – Mean Absolute Percentage Error (MAPE)
• CPU Cycles
• Outgoing Network Traffic
• Edge Device Energy Consumption*
Demetris Trihinas 26
*Xiao et al., ACM DAC, 2010
**Brooks, ACM SIGARCH, 2000
perf
nethogs
Powerstat and
Other than error evaluation no other study goes through an overhead study!
Talk at TU Berlin, 15 mar. 2015
Wattch**
Error Comparison
Demetris Trihinas 27
Memory Trace CPU Trace Disk I/O Trace
TCP Port Monitoring Trace Fitbit Step Trace
For all traces AdaM has the smallest inaccuracy (<10%)
Even in a more aggressive configuration (λ=2) AdaM is still comparable to L-SIP
AdaM AdaM AdaM
AdaM AdaM
User-defined accepted imprecision set to γ = 0.1
Talk at TU Berlin, 15 mar. 2015
Fitbit Heartrate Trace
AdaM
AdaM γ-Parameter Evaluation
For a wide range of γ-parameter values we compute AdaM’s
MAPE per trace
Demetris Trihinas 28
Even for extreme imprecision values (>0.3) AdaM can still take correct
decisions signifying the importance of the confidence metric
Talk at TU Berlin, 15 mar. 2015
So How Well Does AdaM Perform?
Demetris Trihinas 29
Memory Trace
Java Sorting Benchmark
CPU Trace
Carnegie Mellon RainMon Project
Disk I/O Trace
Carnegie Mellon RainMon Project
TCP Port Monitoring Trace
Cyber Defence SANS Tech Institute
Step Trace
Fitbit Charge HR Wearable
Talk at TU Berlin, 15 mar. 2015
Heartrate Trace
Fitbit Charge HR Wearable
Overhead Comparison (1)
Demetris Trihinas 30
Network Traffic
Energy Consumption
CPU Cycles
Filtering adds an overhead <2% to AdaM but
benefits network traffic reduction by at least
32% and energy consumption by 12%
Talk at TU Berlin, 15 mar. 2015
Overhead Comparison (2)
Demetris Trihinas 31
Network Traffic
Energy Consumption
CPU Cycles
Talk at TU Berlin, 15 mar. 2015
Data volume is reduced by 74%, energy
consumption by 71%, while accuracy is always
above 89% and 83% with filtering enabled
Overhead Comparison (2)
Demetris Trihinas 32
Network Traffic Energy Consumption
Error (MAPE)
FAST’s aggressiveness results in slightly lower
energy consumption and network traffic
But, this does not come for free…
FAST features a large inaccuracy
AdaM achieves a balance between
efficiency and accuracy
Talk at TU Berlin, 15 mar. 2015
Overhead Comparison (3)
Demetris Trihinas 33Talk at TU Berlin, 15 mar. 2015
Energy Consumption
In the case of heartrate monitoring where signal
analysis on AC wavelets reflected on wrist arteries is
needed, AdaM reduces energy consumption by 86%
Calorie Counting is based on human body indicators
(age, weight, height) and heartrate monitoring
Error (MAPE)
AdaM’s MAPE grows from 6.42% in heartrate monitoring to 9.07% in calorie counting
in contrast to FAST with 13.61% and 21.83% respectively
Overhead Comparison (4)
Demetris Trihinas 34Talk at TU Berlin, 15 mar. 2015
+ AdaM
3 - 5 days
7-8 days
Scalability Evaluation (1)
• Integrated AdaM to a data streaming
system
• JCatascopia Cloud Monitoring System
• JCatascopia Agents (data sources) use AdaM
to adapt monitoring intensity
• Archiving time is measured at JCatascopia
Server to evaluate data velocity
• Compare AdaM over Periodic Sampling
Demetris Trihinas 35
https://github.com/dtrihinas/JCatascopia
DB
Agent
Agent
Agent
.
.
.
JCatascopia
AdaM
AdaM
metrics
Data velocity is reduced
to a linear growth
TALK AT TU BERLIN, 15 MAR. 2015
JCatascopia: Monitoring Elastically Adaptive Applications in the Cloud. Trihinas, D.; Pallis, G.; and Dikaiakos, M. D. In the 14th IEEE/ACM
International Symposium on Cluster, Cloud and Grid Computing (CCGRID 2014), pages 226-235, May, Chicago, IL, USA, 2014.
AdaM
Agent
Scalability Evaluation (2)
• Dublin Smart City Intelligent Transportation Service (Dublin ITS)
Demetris Trihinas 36Talk at TU Berlin, 15 mar. 2015
.
.
.
.
.
.
1000 Buses* with GPS tracking
sending updates to ITS with 16
params (e.g. busID, location,
current route delay)
*Real data from Jan. 2014
Apache Kafka queuing
service on x-large VM
(16VCPU, 16GB RAM,
100GB Disk)
Apache Spark cluster with 5 workers on large VMs
(8VCPU, 8GB RAM, 40GB Disk)
Alerts ITS operators when more than 10 buses in a
Dublin city area, per 5min window, are reporting delays
over 1 standard deviation from their weekly average
Scalability Evaluation (3)
• Spark total delay (processing + scheduling) for T=1, 5, 10 intervals and
AdaM with max imprecision γ = 0.15
Demetris Trihinas 37Talk at TU Berlin, 15 mar. 2015
After 572 buses deployment
becomes unstable
After 746 buses deployment
becomes unstable
Deployment remains stable
barely rising above threshold
at 971 buses
Deployment remains stable
but rises above threshold at
817 buses
Scalability Evaluation (4)
Demetris Trihinas 38Talk at TU Berlin, 15 mar. 2015
AdaM achieves an 83% average accuracy with >85% in all major Dublin areas
compared to 46% for T=5s and 17% for T=10s
High variety of sampling rates used throughout the hours of the day showing the
need of adaptive sampling
Conclusions
Demetris Trihinas 39
• Edge-mining on IoT devices is both resource and
energy intensive
• Big data streaming engines struggle to cope as the
volume and velocity of IoT-generated data keep
increasing
• Adapts the monitoring intensity based on current metric
evolution and variability
• Reduces processing, network traffic and energy
consumption on IoT devices and the IoT network
• Achieves a balance between efficiency and accuracy
Talk at TU Berlin, 15 mar. 2015
The AdaM Framework
Acknowledgements
Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 40
co-funded by the
European Commission
Demetris Trihinas 41
Laboratory for Internet Computing
Department of Computer Science
University of Cyprus
http://linc.ucy.ac.cy
Talk at TU Berlin, 15 mar. 2015

Más contenido relacionado

Similar a Low-Cost Adaptive Monitoring Techniques for the Internet of Things

AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT DevicesAdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
Demetris Trihinas
 
Exploiting clustering techniques
Exploiting clustering techniquesExploiting clustering techniques
Exploiting clustering techniques
mejjagiri
 

Similar a Low-Cost Adaptive Monitoring Techniques for the Internet of Things (20)

AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT DevicesAdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices
 
Low-Cost Approximate and Adaptive Monitoring Techniques for the Internet of T...
Low-Cost Approximate and Adaptive Monitoring Techniques for the Internet of T...Low-Cost Approximate and Adaptive Monitoring Techniques for the Internet of T...
Low-Cost Approximate and Adaptive Monitoring Techniques for the Internet of T...
 
Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics Big Data Value in Mobility and Logistics
Big Data Value in Mobility and Logistics
 
DTN
DTNDTN
DTN
 
Data Streaming (in a Nutshell) ... and Spark's window operations
Data Streaming (in a Nutshell) ... and Spark's window operationsData Streaming (in a Nutshell) ... and Spark's window operations
Data Streaming (in a Nutshell) ... and Spark's window operations
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
 
Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014Introduction to Data streaming - 05/12/2014
Introduction to Data streaming - 05/12/2014
 
2015/12/16 Participatory Urban Sensing
2015/12/16 Participatory Urban Sensing2015/12/16 Participatory Urban Sensing
2015/12/16 Participatory Urban Sensing
 
Exploiting clustering techniques
Exploiting clustering techniquesExploiting clustering techniques
Exploiting clustering techniques
 
Topic 4 Data Processing.pptx
Topic 4 Data Processing.pptxTopic 4 Data Processing.pptx
Topic 4 Data Processing.pptx
 
Incremental and Streaming Model Transformations
Incremental and Streaming Model TransformationsIncremental and Streaming Model Transformations
Incremental and Streaming Model Transformations
 
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policiesSelf-adaptive container monitoring with performance-aware Load-Shedding policies
Self-adaptive container monitoring with performance-aware Load-Shedding policies
 
5.1 mining data streams
5.1 mining data streams5.1 mining data streams
5.1 mining data streams
 
rerngvit_phd_seminar
rerngvit_phd_seminarrerngvit_phd_seminar
rerngvit_phd_seminar
 
Pipeline leak detection system
Pipeline leak detection systemPipeline leak detection system
Pipeline leak detection system
 
Seminar sib final
Seminar sib finalSeminar sib final
Seminar sib final
 
TINET_FRnOG_2008_public
TINET_FRnOG_2008_publicTINET_FRnOG_2008_public
TINET_FRnOG_2008_public
 
Shikha fdp 62_14july2017
Shikha fdp 62_14july2017Shikha fdp 62_14july2017
Shikha fdp 62_14july2017
 
Ppt on speech processing by ranbeer
Ppt on speech processing by ranbeerPpt on speech processing by ranbeer
Ppt on speech processing by ranbeer
 
Queing theory and delay analysis
Queing theory and delay analysisQueing theory and delay analysis
Queing theory and delay analysis
 

Más de Demetris Trihinas

Más de Demetris Trihinas (16)

Rapidly Testing ML-Driven Drone Applications - The FlockAI Framework
Rapidly Testing ML-Driven Drone Applications - The FlockAI FrameworkRapidly Testing ML-Driven Drone Applications - The FlockAI Framework
Rapidly Testing ML-Driven Drone Applications - The FlockAI Framework
 
Towards Energy and Carbon Footprint and Testing for AI-driven IoT Services
Towards Energy and Carbon Footprint and Testing for AI-driven IoT ServicesTowards Energy and Carbon Footprint and Testing for AI-driven IoT Services
Towards Energy and Carbon Footprint and Testing for AI-driven IoT Services
 
StreamSight: A Query-Driven Framework Extending Streaming IoT Analytics to th...
StreamSight: A Query-Driven Framework Extending Streaming IoT Analytics to th...StreamSight: A Query-Driven Framework Extending Streaming IoT Analytics to th...
StreamSight: A Query-Driven Framework Extending Streaming IoT Analytics to th...
 
Composable Energy Modeling for ML-Driven Drone Applications
Composable Energy Modeling for ML-Driven Drone ApplicationsComposable Energy Modeling for ML-Driven Drone Applications
Composable Energy Modeling for ML-Driven Drone Applications
 
Low-Cost Approximate and Adaptive Techniques for the Internet of Things
Low-Cost Approximate and Adaptive Techniques for the Internet of ThingsLow-Cost Approximate and Adaptive Techniques for the Internet of Things
Low-Cost Approximate and Adaptive Techniques for the Internet of Things
 
Telling a Story – or Even Propaganda – Through Data Visualization
Telling a Story – or Even Propaganda – Through Data VisualizationTelling a Story – or Even Propaganda – Through Data Visualization
Telling a Story – or Even Propaganda – Through Data Visualization
 
StreamSight - Query-Driven Descriptive Analytics for IoT and Edge Computing
StreamSight - Query-Driven Descriptive Analytics for IoT and Edge ComputingStreamSight - Query-Driven Descriptive Analytics for IoT and Edge Computing
StreamSight - Query-Driven Descriptive Analytics for IoT and Edge Computing
 
Machine Learning Introduction
Machine Learning IntroductionMachine Learning Introduction
Machine Learning Introduction
 
Απεικόνιση και Αλληλεπίδραση Δεδομένων Μεγάλου Όγκου με Διαδραστικούς Χάρτες
Απεικόνιση και Αλληλεπίδραση Δεδομένων Μεγάλου Όγκου με Διαδραστικούς ΧάρτεςΑπεικόνιση και Αλληλεπίδραση Δεδομένων Μεγάλου Όγκου με Διαδραστικούς Χάρτες
Απεικόνιση και Αλληλεπίδραση Δεδομένων Μεγάλου Όγκου με Διαδραστικούς Χάρτες
 
The Data Science Process: From Mining Raw Data to Story Visualization
The Data Science Process: From Mining Raw Data to Story VisualizationThe Data Science Process: From Mining Raw Data to Story Visualization
The Data Science Process: From Mining Raw Data to Story Visualization
 
From Mining Raw Data to Story Visualization
From Mining Raw Data to Story VisualizationFrom Mining Raw Data to Story Visualization
From Mining Raw Data to Story Visualization
 
Designing Scalable and Secure Microservices by Embracing DevOps-as-a-Service ...
Designing Scalable and Secure Microservices by Embracing DevOps-as-a-Service ...Designing Scalable and Secure Microservices by Embracing DevOps-as-a-Service ...
Designing Scalable and Secure Microservices by Embracing DevOps-as-a-Service ...
 
Find A Project
Find A ProjectFind A Project
Find A Project
 
Cloud Elasticity and the CELAR Project
Cloud Elasticity and the CELAR ProjectCloud Elasticity and the CELAR Project
Cloud Elasticity and the CELAR Project
 
[ccgrid2014] JCatascopia: Monitoring Elastically Adaptive Applications in the...
[ccgrid2014] JCatascopia: Monitoring Elastically Adaptive Applications in the...[ccgrid2014] JCatascopia: Monitoring Elastically Adaptive Applications in the...
[ccgrid2014] JCatascopia: Monitoring Elastically Adaptive Applications in the...
 
[SummerSoc 2014] Monitoring Elastic Cloud Services
[SummerSoc 2014] Monitoring Elastic Cloud Services[SummerSoc 2014] Monitoring Elastic Cloud Services
[SummerSoc 2014] Monitoring Elastic Cloud Services
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Low-Cost Adaptive Monitoring Techniques for the Internet of Things

  • 1. Low-Cost Adaptive Monitoring Techniques for the Internet of Things The AdaM Framework Demetris Trihinas University of Cyprus trihinas@cs.ucy.ac.cy Talk at TU Berlin, 15 mar. 2015
  • 2. This talk is based on… AdaM: an Adaptive Monitoring Framework for Sampling and Filtering on IoT Devices, D. Trihinas and G. Pallis and M. D. Dikaiakos, 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA Pages: 717–726, 2015. Low-Cost Adaptive Monitoring Techniques for the Internet of Things, D. Trihinas and G. Pallis and M. D. Dikaiakos, Transactions on Big Data, 2016, (In Review). Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 2
  • 3. Demetris Trihinas 3 IoT was initially devices sensing and exchanging data streams with humans or other network-enabled devices Init Sense Transmit Sleep Talk at TU Berlin, 15 mar. 2015
  • 4. Edge-Mining Demetris Trihinas 4 A term coined to reflect data processing and decision-making on “smart” devices that sit at the edge of IoT networks Buy more detergent …our devices just got a little bit more “smarter”… Init Sense Transmit Sleep Receive Actuation Talk at TU Berlin, 15 mar. 2015
  • 5. The “Big Data” in IoT Demetris Trihinas 5 [IDC, Big Data in IoT, 2014] [Cisco, IBSG, Apr 2011] Talk at TU Berlin, 15 mar. 2015 IoT Monitoring Data 2% of digital universe in 2012 Projections for >10% in 2020 [Gartner, 2015]
  • 6. Big Data Challenges Transitioning to IoT Demetris Trihinas 6 • Taming data volume and data velocity with limited processing and network capabilities • IoT devices are usually battery-powered which means intense processing results in increased energy consumption (less battery-life) Zhang et al., Usenix HotCloud, 2015 Talk at TU Berlin, 15 mar. 2015
  • 7. Low-Cost Adaptive Monitoring Techniques Adaptive Sampling and Filtering Demetris Trihinas 7Talk at TU Berlin, 15 mar. 2015
  • 8. Metric Stream Demetris Trihinas 8 • A metric stream 𝑀 = {𝑠𝑖}𝑖=0 𝑛 published from a monitoring source is a large sequence of samples, denoted as 𝑠𝑖 , where 𝑖 = 0, 1, … , 𝑛 and 𝑛 → ∞ • Each sample 𝑠𝑖 is a tuple 𝑡𝑖, 𝑣𝑖 described by a timestamp 𝑡𝑖 and a value 𝑣𝑖 𝑠𝑖 𝑠𝑖+1 Metric Stream M Talk at TU Berlin, 15 mar. 2015
  • 9. Periodic Sampling Demetris Trihinas 9 • The process of triggering the collection mechanism of a monitored source every 𝑇 time units such that the 𝑖 𝑡ℎ sample is collected at time 𝑡𝑖 = 𝑖 ∙ 𝑇 𝑠𝑖 𝑠𝑖+1 Metric Stream M sampled every T = 1s Compute resources and energy are wasted while generating large data volumes at a high velocity Metric Stream M sampled every T = 10s Sudden events and significant insights are missed Talk at TU Berlin, 15 mar. 2015
  • 10. Adaptive Sampling Demetris Trihinas 10 • Dynamically adjust the sampling period 𝑇𝑖 based on some function 𝜌(𝑀), containing information of the metric stream evolution 𝑠𝑖 𝑠𝑖+1 𝑇𝑖+1 Metric Stream 𝑀′ dynamically sampledMetric Stream 𝑀 with 𝑇 = 1𝑠 • Find the max 𝑇 ∈ [𝑇 𝑚𝑖𝑛, 𝑇 𝑚𝑎𝑥] to collect sample 𝑠𝑖+1 based on an estimation of the evolution of the metric stream 𝜌(𝑀), such that 𝑀′ differs from 𝑀 less than a user-defined imprecision value 𝛾 (𝑑𝑖𝑠𝑡 < 𝛾) Talk at TU Berlin, 15 mar. 2015
  • 11. Metric Filtering Demetris Trihinas 11 • The process of suppressing metric value dissemination when consecutive values do not “change” (e.g. differ less than a range of values) • Goal: Reduce data volume and network overhead in favour of exact precision • How much “change” is required, depends on the type of filter applied • The receiver-side (e.g., base station, monitoring server) assumes that the values of any unreported metrics remain unchanged Talk at TU Berlin, 15 mar. 2015
  • 12. Metric Filtering • Fixed Range Metric Filters Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 12 Metric Stream M with fixed filter range 𝑅 = 1% if (curValue ∈ [prevValue – R, prevValue + R ]) filter(curValue) No samples are filtered Can we know this R beforehand? Should it keep the same value forever? Is it the same for each and every metric?
  • 13. Adaptive Filtering Demetris Trihinas 13 • Dynamically adjusting the filter range 𝑅 based on the current variability of the metric stream, denoted as 𝑞(𝑀) Metric Stream 𝑀′ with adaptive RangeMetric Stream 𝑀 𝑠𝑖 𝑠𝑖+1 𝑅𝑖+1 𝑅𝑖 • Find the max filter range 𝑅𝑖+1 ∈ [𝑅 𝑚𝑖𝑛, 𝑅 𝑚𝑎𝑥] for sample 𝑠𝑖+1 such that 𝑀′ differs from 𝑀 less than a user-defined imprecision value 𝛾 based on the variability of the metric stream 𝑞(𝑀) Talk at TU Berlin, 15 mar. 2015
  • 14. The Adaptive Monitoring Framework Demetris Trihinas 14 • Software library with no external dependencies embeddable on IoT devices • Reduces processing, network traffic and energy consumption by adapting the monitoring intensity based on the metric stream evolution and variability • Achieves a balance between efficiency and accuracy It’s open-source! Give it a try! https://github.com/dtrihinas/AdaM Raspberry Pi Arduino Beacons Talk at TU Berlin, 15 mar. 2015 Sensing Unit Processing Unit Dissemination Unit Adaptive Sampling Knowledge Base AdaM IoT Device 𝑠𝑖 (𝑇𝑖+1, 𝑐𝑖) (𝑅𝑖+1, 𝐹𝑖) 𝑠𝑖 Adaptive Filtering A P I 𝑠𝑖 The AdaM Framework
  • 15. AdaM’s Algorithms Adaptive Sampling & Filtering Demetris Trihinas 15Talk at TU Berlin, 15 mar. 2015
  • 16. Adaptive Sampling Algorithm • Step 1: Compute the distance 𝛿𝑖 between the current two consecutive sample values Demetris Trihinas 16 Why use the distance instead of the current value? 𝛿𝑖 = |𝑣𝑖 − 𝑣𝑖−1| Threshold-base techniques increase sampling rate while sample values approach a user-defined threshold • Good for anomaly detection (e.g. DDoS attacks) • But, what about events away from threshold? (e.g. low rate DDoS attacks) [Meng, Trans on Computers, 2013] Talk at TU Berlin, 15 mar. 2015
  • 17. Adaptive Sampling Algorithm • Step 2: Compute metric stream evolution based on a Probabilistic Exponential Weighted Moving Average (PEWMA) to estimate 𝛿𝑖+1 and the standard deviation 𝜎𝑖+1: 𝛿𝑖+1 = 𝜇𝑖 = 𝑎 ∙ 𝜇𝑖−1 + 1 − 𝑎 𝛿𝑖 Demetris Trihinas 17 Looks like an exponential moving average, right? But weighting is probabilistically applied! 𝑎 = 𝛼 1 − 𝛽P𝑖 Moving standard deviation with only previous value knowledge Talk at TU Berlin, 15 mar. 2015
  • 18. Why use a Probabilistic EWMA? Simple EWMA is volatile to abrupt transient changes • Slow to acknowledge spike after “stable” periods • If “stable” phases follow sudden spikes, then subsequent values are overestimated Demetris Trihinas 18 Sounds a lot like a job for a Gaussian distribution! Talk at TU Berlin, 15 mar. 2015
  • 19. Adaptive Sampling Algorithm • Step 3: Compute actual standard deviation 𝜎𝑖 • Step 4: Compute the current confidence (𝑐𝑖 ≤ 1) of our approach based on the actual and estimated standard deviation 𝑐𝑖 = 1 − | 𝜎𝑖 − 𝜎𝑖| 𝜎𝑖 • Step 5: Compute sampling period 𝑇𝑖+1 based on the current confidence and the user-defined imprecision γ (e.g. 10% tolerance to errors) Demetris Trihinas 19 … the more “confident” the algorithm is, the larger the outputted sampling period 𝑇𝑖+1 can be… O(1) complexity as all steps only use their previous values! Talk at TU Berlin, 15 mar. 2015 λ is an aggressiveness multiplier (default λ=1)
  • 20. Adaptive Filtering Algorithm • Step 1: Compute the current variability of the metric stream using a moving Fano Factor 𝐹𝑖 based on exponentially weighted average 𝜇𝑖 and standard deviation 𝜎𝑖 already computed from adaptive sampling 𝐹𝑖 = 𝜎𝑖 2 𝜇𝑖 𝑅𝑖 = 𝑅𝑖−1 ± 0.01 ∙ 𝑅𝑖−1 Even for a 1% adjustment critical values can be filtered out in biosignal monitoring Demetris Trihinas 20 …a low 𝐹𝑖 (due to 𝜎𝑖) indicates a currently in-dispersed data stream which means low variability in the metric stream… Talk at TU Berlin, 15 mar. 2015 Why use variability and not follow a stepwise approach?
  • 21. Adaptive Filtering Algorithm • Step 2: Compute the new filter range 𝑅𝑖+1 based on 𝐹𝑖 and the user- defined imprecision 𝛾 Demetris Trihinas 21 O(1) complexity as all steps only use their previous values! 𝑠𝑖 𝑠𝑖+1 𝑅𝑖+1 𝑠𝑖+2 𝑠𝑖+3 𝐹𝑖𝑙𝑡𝑒𝑟𝑒𝑑! 𝑅𝑖+2 𝑅𝑖 𝐹 < 𝛾 Talk at TU Berlin, 15 mar. 2015 𝐹 > 𝛾 𝐹𝑘−1 > 𝐹𝑘 𝑠 𝑘
  • 22. Evaluation Demetris Trihinas 22Talk at TU Berlin, 15 mar. 2015
  • 23. AdaM vs State-of-the-Art • i-EWMA [1]: an EWMA adapting sampling period by 1 time unit when the estimated error (ε) is under/over user-defined imprecision • L-SIP [2]: a linear algorithm using a double EWMA to produce estimates of current data distribution based on rate values change • Slow to react to highly transient and abrupt fluctuations in the metric stream • FAST [3]: an (aggressive) framework using a PID controller to compute (large) sampling periods accompanied by a Kalman Filter to predict values at non sampling points Demetris Trihinas 23 [1] Trihinas et al., CCGrid, 2014 [2] Gaura et al., IEEE Sensors Journal, 2013 [3] Fan et al., ACM CIKM, 2012 Talk at TU Berlin, 15 mar. 2015
  • 24. Datasets Demetris Trihinas 24 Memory Trace Java Sorting Benchmark CPU Trace Carnegie Mellon RainMon Project Disk I/O Trace Carnegie Mellon RainMon Project TCP Port Monitoring Trace Cyber Defence SANS Tech Institute Step Trace Fitbit Charge HR Wearable Talk at TU Berlin, 15 mar. 2015 Heart Trace Fitbit Charge HR Wearable
  • 25. Our Small “Big Data” Testbeds Demetris Trihinas 25 Raspberry Pi 1st Gen. Model B 512MB RAM, Single Core ARM 700MHz Daemon on OS emulates traces while feeding samples to each algorithm Android Emulator + SensorSimulator 128MB RAM, Single Core ARM 32MHz SensorSimulator script emulates traces by feeding samples to Android Wear emulator for Step and Heartrate processing Daemon Talk at TU Berlin, 15 mar. 2015
  • 26. Evaluation Metrics • Estimation Accuracy – Mean Absolute Percentage Error (MAPE) • CPU Cycles • Outgoing Network Traffic • Edge Device Energy Consumption* Demetris Trihinas 26 *Xiao et al., ACM DAC, 2010 **Brooks, ACM SIGARCH, 2000 perf nethogs Powerstat and Other than error evaluation no other study goes through an overhead study! Talk at TU Berlin, 15 mar. 2015 Wattch**
  • 27. Error Comparison Demetris Trihinas 27 Memory Trace CPU Trace Disk I/O Trace TCP Port Monitoring Trace Fitbit Step Trace For all traces AdaM has the smallest inaccuracy (<10%) Even in a more aggressive configuration (λ=2) AdaM is still comparable to L-SIP AdaM AdaM AdaM AdaM AdaM User-defined accepted imprecision set to γ = 0.1 Talk at TU Berlin, 15 mar. 2015 Fitbit Heartrate Trace AdaM
  • 28. AdaM γ-Parameter Evaluation For a wide range of γ-parameter values we compute AdaM’s MAPE per trace Demetris Trihinas 28 Even for extreme imprecision values (>0.3) AdaM can still take correct decisions signifying the importance of the confidence metric Talk at TU Berlin, 15 mar. 2015
  • 29. So How Well Does AdaM Perform? Demetris Trihinas 29 Memory Trace Java Sorting Benchmark CPU Trace Carnegie Mellon RainMon Project Disk I/O Trace Carnegie Mellon RainMon Project TCP Port Monitoring Trace Cyber Defence SANS Tech Institute Step Trace Fitbit Charge HR Wearable Talk at TU Berlin, 15 mar. 2015 Heartrate Trace Fitbit Charge HR Wearable
  • 30. Overhead Comparison (1) Demetris Trihinas 30 Network Traffic Energy Consumption CPU Cycles Filtering adds an overhead <2% to AdaM but benefits network traffic reduction by at least 32% and energy consumption by 12% Talk at TU Berlin, 15 mar. 2015
  • 31. Overhead Comparison (2) Demetris Trihinas 31 Network Traffic Energy Consumption CPU Cycles Talk at TU Berlin, 15 mar. 2015 Data volume is reduced by 74%, energy consumption by 71%, while accuracy is always above 89% and 83% with filtering enabled
  • 32. Overhead Comparison (2) Demetris Trihinas 32 Network Traffic Energy Consumption Error (MAPE) FAST’s aggressiveness results in slightly lower energy consumption and network traffic But, this does not come for free… FAST features a large inaccuracy AdaM achieves a balance between efficiency and accuracy Talk at TU Berlin, 15 mar. 2015
  • 33. Overhead Comparison (3) Demetris Trihinas 33Talk at TU Berlin, 15 mar. 2015 Energy Consumption In the case of heartrate monitoring where signal analysis on AC wavelets reflected on wrist arteries is needed, AdaM reduces energy consumption by 86% Calorie Counting is based on human body indicators (age, weight, height) and heartrate monitoring Error (MAPE) AdaM’s MAPE grows from 6.42% in heartrate monitoring to 9.07% in calorie counting in contrast to FAST with 13.61% and 21.83% respectively
  • 34. Overhead Comparison (4) Demetris Trihinas 34Talk at TU Berlin, 15 mar. 2015 + AdaM 3 - 5 days 7-8 days
  • 35. Scalability Evaluation (1) • Integrated AdaM to a data streaming system • JCatascopia Cloud Monitoring System • JCatascopia Agents (data sources) use AdaM to adapt monitoring intensity • Archiving time is measured at JCatascopia Server to evaluate data velocity • Compare AdaM over Periodic Sampling Demetris Trihinas 35 https://github.com/dtrihinas/JCatascopia DB Agent Agent Agent . . . JCatascopia AdaM AdaM metrics Data velocity is reduced to a linear growth TALK AT TU BERLIN, 15 MAR. 2015 JCatascopia: Monitoring Elastically Adaptive Applications in the Cloud. Trihinas, D.; Pallis, G.; and Dikaiakos, M. D. In the 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID 2014), pages 226-235, May, Chicago, IL, USA, 2014. AdaM Agent
  • 36. Scalability Evaluation (2) • Dublin Smart City Intelligent Transportation Service (Dublin ITS) Demetris Trihinas 36Talk at TU Berlin, 15 mar. 2015 . . . . . . 1000 Buses* with GPS tracking sending updates to ITS with 16 params (e.g. busID, location, current route delay) *Real data from Jan. 2014 Apache Kafka queuing service on x-large VM (16VCPU, 16GB RAM, 100GB Disk) Apache Spark cluster with 5 workers on large VMs (8VCPU, 8GB RAM, 40GB Disk) Alerts ITS operators when more than 10 buses in a Dublin city area, per 5min window, are reporting delays over 1 standard deviation from their weekly average
  • 37. Scalability Evaluation (3) • Spark total delay (processing + scheduling) for T=1, 5, 10 intervals and AdaM with max imprecision γ = 0.15 Demetris Trihinas 37Talk at TU Berlin, 15 mar. 2015 After 572 buses deployment becomes unstable After 746 buses deployment becomes unstable Deployment remains stable barely rising above threshold at 971 buses Deployment remains stable but rises above threshold at 817 buses
  • 38. Scalability Evaluation (4) Demetris Trihinas 38Talk at TU Berlin, 15 mar. 2015 AdaM achieves an 83% average accuracy with >85% in all major Dublin areas compared to 46% for T=5s and 17% for T=10s High variety of sampling rates used throughout the hours of the day showing the need of adaptive sampling
  • 39. Conclusions Demetris Trihinas 39 • Edge-mining on IoT devices is both resource and energy intensive • Big data streaming engines struggle to cope as the volume and velocity of IoT-generated data keep increasing • Adapts the monitoring intensity based on current metric evolution and variability • Reduces processing, network traffic and energy consumption on IoT devices and the IoT network • Achieves a balance between efficiency and accuracy Talk at TU Berlin, 15 mar. 2015 The AdaM Framework
  • 40. Acknowledgements Demetris Trihinas Talk at TU Berlin, 15 mar. 2015 40 co-funded by the European Commission
  • 41. Demetris Trihinas 41 Laboratory for Internet Computing Department of Computer Science University of Cyprus http://linc.ucy.ac.cy Talk at TU Berlin, 15 mar. 2015

Notas del editor

  1. 45million wearables sold in us just for 2015 holiday season
  2. Why gaussian? If an expected value is served next then we get a high probability returned. If not, then we get a smaller value. So for spikes that do not preserve we get a small value, we account for it in the current estimation but its effect dissolves quicker. If this spike turns out to be a shift in the evolution and not a spike then the bell form of the gauss model will shift as well and provide higher value output
  3. Skip this and go to its testbed **Archiving time = time to process and store received metrics