SlideShare una empresa de Scribd logo
1 de 67
1
CORRELATION ANALYSIS
ON LIVE DATA STREAMS
ARUN KEJARIWAL, DHRUV CHOUDHARY, FRANCOIS ORSINI
2
STREAMING
Serve data at rest
LIVE
Serve data as it is being generated
STREAMING vs. LIVE
MOBILE DATA TRAFFIC
17% of total IP traffic by 2021
VR/AR Traffic
CAGR of 82% from 2016-2021
ANNUAL GLOBAL IP TRAFFIC
3.3 ZB by 2021
BROADBAND SPEEDS
Reach 53 Mbps by 2021
INTERNET VIDEO SURVEILLANCE
3.4% of all Internet video traffic
by 2021
LIVE INTERNET VIDEO
13% of Internet video traffic
by 2021
THE NUMBERS
3
https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/complete-white-paper-c11-481360.html
4
DATA FUSION
CHARACTERISTICS
DISTRIBUTED
HETEROGENEOUS
NON-LINEAR
INTERNET OF THINGS
Sensors and Actuators
SMART CITIES
SMART HEALTH CARE
POLLUTON MONITORING
INSIGHTS
PROBABILISTIC METHODS
THEORY OF EVIDENCE
MACHINE LEARNING
FUZZY LOGIC
CHALLENGES
DATA: INCONSISTENCY
MISSINGNESS, NOISE
LATENCY
BATTERY CONSUMPTION
INTELLIGENT DECISION MAKING
5
TRAFFIC SURVEILLANCE
CONGESTION DETECTION
ACCIDENT DETECTION
HEALTH EMERGENCY
RISK ALERT
RAIN WATER CLOGGING, HYDROPLANING
MUDSLIDE
AN EXAMPLE APPLICATION OF DATA FUSION
CORRELATION ANALYSIS
6
CORRELATION ANALYSIS
A LONG HISTORY!
7
EARLY FLAVOR
DATES BACK to 1885!
8
APPLICATION DOMAINS
9
BIOLOGY ANTHROPLOGY AGRICULTURE PHILOSOPHY PSYCHOLOGY
FINANCE ECONOMETRICS STATISTICS NETWORKING OPERATIONS
Common representation learning
Example: Across audio and video modalities
in a single live stream
Multimodal
Linear/non-linear correlation
Example: Across univariate time series of
operations data
Unimodal
CORRELATION ANALYSIS
FLAVORS: AT A HIGH LEVEL
10
CROSS-MODAL ANALYSIS
Joint Representation of Text-
Audio-Video
INTERPRETABILITY
Comparative analysis of deep
representations
AUTHENTICITY
Live streams vs. Recording
OBJECT IDENTIFICATION
Multiple cameras
MULTI-STREAM
SYNCHRONIZATION
Different vantage points
Unreliable timestamps
LOCALIZATION
Video ⟷ Audio
Audio ⟷ Text
CORRELATION ANALYSIS
WHY BOTHER?
11
Pearson
Spearman 𝜌
Kendall 𝜏
Goodman and Kruskal
COMMON TYPES OF CORRELATION
12
Ties are dropped
CORRELATION ANALYSIS
A REAL LIFE EXAMPLE
13
Root cause analysis
Expose investment avenues
Surface optimization opportunities
Risk minimization
Medical diagnosis
Learning
MULTIPLE TIME SERIES
14
15
Symmetric
Lack of context
Spurious correlations
Non-actionable
Network topology
CORRELATION MATRIX
Scalability
Hundreds of millions of time series
Use cases
Multiple Regression
Discriminant Analysis
Mahalanobis Distance
CORRELATION MATRIX
16
* Figure borrowed from [Mueen et al. 2010]
*
Thresholded(=0.5)CorrelationMatrix
350x350CorrelationMatrix
MULTIPLE CORRELATION
1957
MULTIVARIATE CASE
17
Proportion of variance of xj that cannot be
explained by other independent variables
Depends on the choice of
the dependent variable
Correlation
Matrix
2-D case
18
Time
Varying
ROLLING CORRELATION
CONSTANTLY EVOLVING DATA
19
STOCHASTICITY
20
Why bother? Multiple flavors
If X, Y are independent, then 𝜌(X, Y) = 0
However, the converse is not true, as only
the first two moments are considered
e.g., 𝜌(X, Y) = 0 even when Y = X2
Not invariant under nonlinear strictly
increasing transformations
Pearson’s correlation only recognizes linear
dependence
var(X) and var(Y) have to be finite
Non-stationary random processes
Stochastic correlation process
Applications
Finance (Brownian motion), biology
Time-varying correlation
Rolling correlation - lagged indicator
Dynamic Conditional Correlation [Engle’02]
Local Correlation [Langnau’09]
Wishart autoregressive process [Gourieroux’09]
Transformations [van Emmerich’06]
arctan and
Modified Jacobi process [Ma’09]
tanh transformation [Teng’16]
SPURIOUS CORRELATIONS
SPURIOUS CORRELATION
* Figure borrowed from [Anscombe 1973]
*
WHY CONTEXT IS IMPORTANT?
22
Identical r=0.816
Clearly spurious
Linear correlation is perhaps not the right metic
Identical summary statistics
*
ROBUSTNESS
r =0.8 r =0.2
r is highly sensitive to slight change,as measured
by Kolmogorov distance, in one of the marginal
distributions
Bivariate Normal
Distribution
Contaminated Normal
Distribution
PEARSON CORRELATION
23
* Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox
Influence Function (IF) of Pearson’s Correlation
Unbounded
Pearson’s correlation does not have infinitesimal
robustness
x x
y y
zz
Recall, first order approximation of sample IF of r is:
r-i : correlation coefficient based on all but the ith
observation
24
ROBUSTNESS
*
* Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox
Sensitivity to anomalies
Linear relationship between x & y but r = -0.21
PEARSON CORRELATION
Quadrant (signum) correlation coefficient#
# [Blomqvist, 1950] ^[Pasman and Shevlyakov, 1987]
Correlation median estimator^
ROBUSTNESS
PEARSON CORRELATION
25
Based on Robust Principal Variables
Alternatives
ROBUSTNESS
PEARSON CORRELATION
26
Percentage Bend Correlation
27
SPEED-ACCURACY TRADE-OFF
Live Data
STREAMING CORRELATION
Incremental, One pass
CHARACTERISTICS
29
Other correlation measures are not amenable to incremental
computation
Applications
Security
Correlation power analysis
30
APPROXIMATION ALGORITHMS
STREAMING CORRELATION
Wide Spectrum of Approaches
Sliding Windows, Damped Windows
Reduction
Smoothing
Down sampling
DFT [Agrawal et al. 1993, Zhu and Shasha 2002,
Qiu et al. 2018]
DWT [Chan and Fu 1999, Popivanov & Miller 2002]
PCA, SVD
PAA [Faloutsos and Yi 2000]
APAC [Chakrabarti et al. 2001]
Random projections [Grellmann et al. 2016]
LSH [Sundaram et al. 2013]
DATA SKETCHES
31
Bursts
Page views, Clicks, Retweets
Correlated burstiness (e.g., data center operations)
Root-cause analysis
STREAMING CORRELATION
* Figure borrowed from [Sakurai et al. 2005]
*
Lag between time series
Lagged correlation/Cross-correlation
ACTIONABLE INSIGHTS
32
[Zhu and Shasha 2002, Levine et al. 2016, Wu et al. 2017]
[Vlachos et al. 2008, Kotov et al. 2011, Shafer et al. 2012, Kusmierczyk and Norvag 2015]
33
Motion
Inertia
Motion
Occlusion
Velocity
Estimation
Motion-Outlier
Detection
Blur
Removal
VIDEO CORRELATION
TEMPORAL COHERENCE
PREDICTION MOTION
1952
“… prediction motion is the continuation of tracking motion
after a target disappears from view.”
1955
1962
Unlike duration of target presentation, target speed
exerts an influence on prediction accuracy.
EXPLORED >5 DECADES BACK!
34
PREDICTION MOTION
35
Human Motion/Robotics
[Bütepage et al. 2017, Martinez et a al. 2017, Byravan & Fox]
Traffic Prediction
[Hermes et al. 2010, Walker et al. 2014, Yu et al. 2017]
360-Degree Video (AR/VR)
[Bao et al. 2017, Vishwanath et al., 2017]
Potpourri: Deep Learning Based Approaches
[Oh et al. 2015, Mathieu et al. 2016, Liang et al. 2017]
AUDIO-VIDEO CORRELATION
A LONG HISTORY: EXPERIMENTAL PSYCHOLOGY -> DEEP LEARNING
36
AUDIO-VIDEO CORRELATION
1952
People utilize visual and postural experiences in perceiving the position
of an object in the field, of the whole field.
1897
Localization of sounds varied, being different when the source of sound was in sight from what it
was when this was out of sight, and also in the latter case differing with different directions of
attention, or with different suggestions as to the direction from which the sound came.
1941
SOUND LOCALIZATION: AN APPLICATION
37
AUDIO-VIDEO CORRELATION
1977
1960
1976
LIP READING: AN APPLICATION
38
Demonstrated the influence of vision on speech perception
McGurk and MacDonald
Established the relationship of the visually perceived
symbols to the underlying linguistic system
Use visual information as an aid when white noise made speech difficult to hear
1954
There’s a great opportunity for the visual contribution
at low speech-to-noise ratios
AUDIO-VIDEO CORRELATION
2007
2004
Combined acoustic and visual feature vectors to distinguish live synchronous audio-video
recordings from replay attacks that use audio with a still photo.
LIVENESS AND SYNCHRONIZATION: AN APPLICATION
39
Extracted the correlated components of audio and lip features
based on Canonical Correlation Analysis.
2009
Showed that there exists a relationship between perception of video presented
in screen and accompanying audio signals, both stereo and spatial
40
AUDIO-VIDEO-TEXT
Speaker identification in multi-speaker scenarios
[Huang & Kingsbury 2013, Chung & Zisserman 2016, Torfi et al. 2017]
Cross-Modal Correlation Learning in Audio and Lyrics
[Yu et al. 2017, Tang et al. 2017]
Speech enhancement
[Xu et al. 2014, Hou et al. 2017, Kolbæk et al. 2017]
Action Recognition and Video Highlight Detection
[Wu et al. 2013, Sun et al. 2013, Takahashi et al. 2017]
DEEP LEARNING WAY
Emotion Recognition
[Tzirakis et al. 2017, Pini et al. 2017]
41
OVERVIEW
Satori is the only live data platform that enables immediate integration, interaction, correlation, and intelligent response
at high throughput and ultra-low latency.
SATORI
A UNIFIED LIVE DATA PLATFORM
42
SATORI LIVE DATA SERVICES
43
SATORI LIVE DATA LIFECYCLE
44
OPEN PROBLEMS
45
Online anomaly detection: speed-accuracy trade-off
On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data*
by Choudhary et al. 2017
Breakouts/Changepoints
Skew in location of anomalies
SUSCEPTIBILITY TO ANOMALIES
* https://arxiv.org/pdf/1710.04735.pdf
46
HETEROSCEDASTICITY
Methods
(Adjusted) Percentile Bootstrap [Wilcox 1996]
Nested Bootstrap [DiCiccio et al. 1992]
Challenge: Detecting non-linear correlation
2001
47
Potential sources: Event based monitoring via sensors, Occlusion in a video
Techniques: Resampling
Lomb-Scargle Fourier Transform [Andersson 2007, Rehfeld et al. 2011]
Kernel - such as Laplacian, Gaussian - based methods
IRREGULARLY SPACED TIME SERIES
TIME VARYING CORRELATION
48
* Figure borrowed from [Fu et al. 2013]
TVCC: Time Varying Correlation Coefficient
fMRI time-courses of 4 Regions of Interests (ROI)
✦ Time varying joint distribution
✦ Parameter non-constancy/Instability
F Test [Chow’60]
SupF Test [Quandt’60]
Lagrange Multiplier Test
[Nabeya and Tanaka’88, Nylom’89]
✦ Co-integrated processes
I(1) [Hansen’90]
✦ Stochastic vs. Deterministic
✦ Dynamic Conditional Correlation (DCC)
[Engle’02]
*
Post stimulus period with significant difference
(p<0.05) w.r.t. pre-stimulus interval
STREAMING CORRELATION
Missing data
Packets being dropped owing to, say, unexpected
high traffic
Data collection: Every, say, 5 seconds
How to scale analysis to milli-seconds granularity?
Unequal length time series
Different sampling rates
Small samples
Bootstrapping
Low SNR (Signal to Noise Ratio)
49
NEXT FRONTIER
LEVERAGING MULTIMODAL & CONTINUOUSLY EVOLVING CORRELATION FOR SELF-LEARNING
50
SELF-LEARNING MACHINES
A LONG HISTORY!
52
SELF-LEARNING
EARLY WORK: GAMES
19501914
constructed a device which played an end game of king
and rook against king. The machine played the side with king and rook and would
force checkmate in a few moves however its human opponent played.
Gerald Tesauro
199519592002
2002
2017
Amongst the first and most famous was the
chess-playing automaton constructed in 1769
by Baron Kempelen …
1953
Alan Turing
2012
1970
GAME PLAYING
POTPOURRI
53
19962007
The game of checkers has roughly 500 billion billion
possible positions (5 × 1020)
SELF-LEARNING
2016
REINFORCEMENT LEARNING
54
2016201720172018
AlphaGo
Mastering the game of Go with deep neural networks and tree search
by Silver et al.
AlphaGo Zero
Mastering the game of Go without Human knowledge
by Silver et al.
Alpha Zero
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
by Silver et al.
Libratus
Superhuman AI for heads-up no-limit poker: Libratus beats top professionals
by Brown and Sandholm
57
Rank Correlation Methods
by Kendall and Gibbons
READINGS
Correlation and Regression
by Bobko
Correlation
by Garson
Robust Correlation: Theory and Applications
by Shevlyakov and Oja
A Mathematical Theory of Evidence
by Shafer
BOOKS
58
Fast Approximate Correlation of Massive Time-series Data
by Queen et al., 2010
READINGS
Local Correlation Detection with Linearity Enhancement in
Streaming Data
by Xie et al., 2013
Fast Distributed Correlation Discovery Over Streaming
Time-Series Data
by Guo et al., 2015
Random Projection of Fast and Efficient Multivariate
Correlation Analysis of High-Dimensional Data: A New Approach
by Grellmann et al., 2016
Detection of Highly Correlated Live Data Streams
by Alseghayer et al., 2017
STREAMING CORRELATION
59
Perception of body position and of the position of the visual
field
by Witkin, 1949
READINGS
Autocorrelation, a principle for the evaluation of sensory
information by the nervous system
by Reichardt, 1961
EARLY RESEARCH IN AUDIO-VISUAL CORRELATION
Binocular cross-correlation in time and space
by Tyler and Julesz, 1978
Neurontropy. an entropy like measure of neural correlation
by Julesz and Tyler, 1976
Cross correlation of sensory stimuli and electroencephalogram
by Morgan, 1969
60
Patch to the future: Unsupervised visual prediction
by Walker et al., 2014
READINGS
Motion-Prediction-based Multicast for 360-Degree Video
Transmissions
by Bao et al., 2017
Dual Motion GAN for Future-Flow Embedded Video
Prediction

by Liang et al., 2017
Deep representation learning for human motion prediction
and classification
by Bütepage et al., 2017
On human motion prediction using recurrent neural
networks
by Martinez, 2017
RECENT WORKS IN PREDICTION MOTION
61
On Deep Multi-View Representation Learning
by Wang et al., 2015
READINGS
Correlational Neural Networks
by Chandar et al., 2017
Common Representation Learning Using Step-based
Correlation Multi-Modal CNN

by Bhatt et al., 2017
Objects that Sound
by Arandjelović and Zisserman, 2017
Deep Correlation Feature Learning for Face Verification in
the Wild
by Deng et al., 2017
COMMON REPRESENTATION LEARNING
62
READINGS
Audiovisual Synchronization and Fusion Using Canonical
Correlation Analysis
by Sargin et al., 2007
The predictive power of trajectory motion
by Watamaniuk, 2005
Seeing motion behind occluders
by Watamaniuk and McKee, 1995
Temporal Coherence Theory For The Detection And
Measurement Of Visual Motion
by Grzywacz et al.,1995
Probabilistic Motion Estimation Based On Temporal Coherence
by Burgi et al., 2000
AUDIO-VISUAL RESEARCH
63
A New Method of Audio-Visual Correlation Analysis
by Kunka and Kostek, 2009
READINGS
Uncertainty in ontologies: Dempster–Shafer theory for data
fusion applications
by Bellenger and Gatepaille, 2011
City Data Fusion: Sensor Data Fusion in the Internet of Things
by Wang et al., 2015
Data Fusion and IoT for Smart Ubiquitous Environments: A
Survey
by Alam et al., 2017
Correlation Analysis of Audio and Video Contents: A Metadata
based Approach
by Algur et al., 2015
POTPOURRI
64
READINGS
POTPOURRI
Correlation detection as a general mechanism for
multisensory integration
by Parise and Ernst, 2016
Origin of information-limiting noise correlations
by Kanitscheidera et al., 2015
Temporal structure and complexity affect audio-visual
correspondence detection
by Denison et al., 2013
Correlation versus causation in multisensory perception
by Mitterer, Jesse, 2010
65
READINGS
MATHEMATICS
A Bayesian approach to problems in stochastic estimation
and control
by Ho and Lee, 1964
A Generalization of Bayesian Inference
by Dempster, 1968
Multidimensional Scaling
by Kruskal and Wish, 1978
66
RESOURCES
http://www.tylervigen.com/spurious-correlations
http://www.sumsar.net/blog/2013/08/robust-bayesian-
estimation-of-correlation/
http://www.sumsar.net/blog/2013/08/bayesian-estimation-of-
correlation/
JAGS: http://mcmc-jags.sourceforge.net/
BUGS: http://www.openbugs.net
https://blog.quantopian.com/bayesian-correlation-estimation/
https://jimgrange.wordpress.com/2017/11/19/bayesian-
estimation-of-partial-correlations/
67
https://www.mouser.com/applications/sensor-fusion-iot/
RESOURCES
https://cacm.acm.org/magazines/2017/11/222180-heads-up-
limit-holdem-poker-is-solved

Más contenido relacionado

La actualidad más candente

Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
Kai Wähner
 

La actualidad más candente (20)

Kappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology ComparisonKappa vs Lambda Architectures and Technology Comparison
Kappa vs Lambda Architectures and Technology Comparison
 
Time series deep learning
Time series   deep learningTime series   deep learning
Time series deep learning
 
BDA311 Introduction to AWS Glue
BDA311 Introduction to AWS GlueBDA311 Introduction to AWS Glue
BDA311 Introduction to AWS Glue
 
Cassandra at eBay - Cassandra Summit 2012
Cassandra at eBay - Cassandra Summit 2012Cassandra at eBay - Cassandra Summit 2012
Cassandra at eBay - Cassandra Summit 2012
 
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
 
Introduction to Structured Streaming
Introduction to Structured StreamingIntroduction to Structured Streaming
Introduction to Structured Streaming
 
Monitoring Apache Kafka with Confluent Control Center
Monitoring Apache Kafka with Confluent Control Center   Monitoring Apache Kafka with Confluent Control Center
Monitoring Apache Kafka with Confluent Control Center
 
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ NetflixWhoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
Whoops, The Numbers Are Wrong! Scaling Data Quality @ Netflix
 
Elk with Openstack
Elk with OpenstackElk with Openstack
Elk with Openstack
 
밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장밑바닥부터 시작하는딥러닝 8장
밑바닥부터 시작하는딥러닝 8장
 
Amazon Athena, w/ benchmark against Redshift - Pop-up Loft TLV 2017
Amazon Athena, w/ benchmark against Redshift - Pop-up Loft TLV 2017Amazon Athena, w/ benchmark against Redshift - Pop-up Loft TLV 2017
Amazon Athena, w/ benchmark against Redshift - Pop-up Loft TLV 2017
 
INTRODUCTION TO NLP, RNN, LSTM, GRU
INTRODUCTION TO NLP, RNN, LSTM, GRUINTRODUCTION TO NLP, RNN, LSTM, GRU
INTRODUCTION TO NLP, RNN, LSTM, GRU
 
Apache Flink 101 - the rise of stream processing and beyond
Apache Flink 101 - the rise of stream processing and beyondApache Flink 101 - the rise of stream processing and beyond
Apache Flink 101 - the rise of stream processing and beyond
 
Anomaly detection with machine learning at scale
Anomaly detection with machine learning at scaleAnomaly detection with machine learning at scale
Anomaly detection with machine learning at scale
 
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live ImplementationAnomaly Detection - Real World Scenarios, Approaches and Live Implementation
Anomaly Detection - Real World Scenarios, Approaches and Live Implementation
 
Ray Serve: A new scalable machine learning model serving library on Ray
Ray Serve: A new scalable machine learning model serving library on RayRay Serve: A new scalable machine learning model serving library on Ray
Ray Serve: A new scalable machine learning model serving library on Ray
 
Introductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning AlgorithmsIntroductions to Online Machine Learning Algorithms
Introductions to Online Machine Learning Algorithms
 
Apache kafka performance(throughput) - without data loss and guaranteeing dat...
Apache kafka performance(throughput) - without data loss and guaranteeing dat...Apache kafka performance(throughput) - without data loss and guaranteeing dat...
Apache kafka performance(throughput) - without data loss and guaranteeing dat...
 
Understanding Bagging and Boosting
Understanding Bagging and BoostingUnderstanding Bagging and Boosting
Understanding Bagging and Boosting
 
Mongo DB
Mongo DBMongo DB
Mongo DB
 

Similar a Correlation Analysis on Live Data Streams

VR2015-revision-v7-embedded
VR2015-revision-v7-embeddedVR2015-revision-v7-embedded
VR2015-revision-v7-embedded
Yun Suk Chang
 
Predicting growth of urban agglomerations through fractal analysis of geo spa...
Predicting growth of urban agglomerations through fractal analysis of geo spa...Predicting growth of urban agglomerations through fractal analysis of geo spa...
Predicting growth of urban agglomerations through fractal analysis of geo spa...
Indicus Analytics Private Limited
 

Similar a Correlation Analysis on Live Data Streams (20)

Correlation Analysis on Live Data Streams
Correlation Analysis on Live Data StreamsCorrelation Analysis on Live Data Streams
Correlation Analysis on Live Data Streams
 
IRJET- A Review on Moving Object Detection in Video Forensics
IRJET- A Review on Moving Object Detection in Video Forensics IRJET- A Review on Moving Object Detection in Video Forensics
IRJET- A Review on Moving Object Detection in Video Forensics
 
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTORCHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
CHARACTERIZING HUMAN BEHAVIOURS USING STATISTICAL MOTION DESCRIPTOR
 
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
HUMAN ACTION RECOGNITION IN VIDEOS USING STABLE FEATURES
 
Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...
 
Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...Understanding user interactivity for immersive communications and its impact ...
Understanding user interactivity for immersive communications and its impact ...
 
A New Approach for video denoising and enhancement using optical flow Estimation
A New Approach for video denoising and enhancement using optical flow EstimationA New Approach for video denoising and enhancement using optical flow Estimation
A New Approach for video denoising and enhancement using optical flow Estimation
 
Object Detection using SURF features
Object Detection using SURF featuresObject Detection using SURF features
Object Detection using SURF features
 
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
Video Analysis with Convolutional Neural Networks (Master Computer Vision Bar...
 
Raskar stanfordextremecompuimagingapr2016
Raskar stanfordextremecompuimagingapr2016Raskar stanfordextremecompuimagingapr2016
Raskar stanfordextremecompuimagingapr2016
 
Pedestrian Counting in Video Sequences based on Optical Flow Clustering
Pedestrian Counting in Video Sequences based on Optical Flow ClusteringPedestrian Counting in Video Sequences based on Optical Flow Clustering
Pedestrian Counting in Video Sequences based on Optical Flow Clustering
 
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds
Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds Detecting and Shadows in the HSV Color Space using  Dynamic Thresholds
Detecting and Shadows in the HSV Color Space using Dynamic Thresholds
 
Video Browsing By Direct Manipulation - Draft 1
Video Browsing By Direct Manipulation - Draft 1Video Browsing By Direct Manipulation - Draft 1
Video Browsing By Direct Manipulation - Draft 1
 
VR2015-revision-v7-embedded
VR2015-revision-v7-embeddedVR2015-revision-v7-embedded
VR2015-revision-v7-embedded
 
A Hybrid Virtual Reality Simulation System for Wave Energy Conversion
A Hybrid Virtual Reality Simulation System for Wave Energy ConversionA Hybrid Virtual Reality Simulation System for Wave Energy Conversion
A Hybrid Virtual Reality Simulation System for Wave Energy Conversion
 
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNINGHUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
HUMAN IDENTIFIER WITH MANNERISM USING DEEP LEARNING
 
Shot Boundary Detection using Radon Projection Method
Shot Boundary Detection using Radon Projection MethodShot Boundary Detection using Radon Projection Method
Shot Boundary Detection using Radon Projection Method
 
Predicting growth of urban agglomerations through fractal analysis of geo spa...
Predicting growth of urban agglomerations through fractal analysis of geo spa...Predicting growth of urban agglomerations through fractal analysis of geo spa...
Predicting growth of urban agglomerations through fractal analysis of geo spa...
 
50120140502009
5012014050200950120140502009
50120140502009
 
50120140504012
5012014050401250120140504012
50120140504012
 

Más de Arun Kejariwal

Anomaly Detection At The Edge
Anomaly Detection At The EdgeAnomaly Detection At The Edge
Anomaly Detection At The Edge
Arun Kejariwal
 
Serverless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the EnterpriseServerless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the Enterprise
Arun Kejariwal
 
Designing Modern Streaming Data Applications
Designing Modern Streaming Data ApplicationsDesigning Modern Streaming Data Applications
Designing Modern Streaming Data Applications
Arun Kejariwal
 
Data Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action UponData Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action Upon
Arun Kejariwal
 
Finding bad apples early: Minimizing performance impact
Finding bad apples early: Minimizing performance impactFinding bad apples early: Minimizing performance impact
Finding bad apples early: Minimizing performance impact
Arun Kejariwal
 
Days In Green (DIG): Forecasting the life of a healthy service
Days In Green (DIG): Forecasting the life of a healthy serviceDays In Green (DIG): Forecasting the life of a healthy service
Days In Green (DIG): Forecasting the life of a healthy service
Arun Kejariwal
 
Gimme More! Supporting User Growth in a Performant and Efficient Fashion
Gimme More! Supporting User Growth in a Performant and Efficient FashionGimme More! Supporting User Growth in a Performant and Efficient Fashion
Gimme More! Supporting User Growth in a Performant and Efficient Fashion
Arun Kejariwal
 
Techniques for Minimizing Cloud Footprint
Techniques for Minimizing Cloud FootprintTechniques for Minimizing Cloud Footprint
Techniques for Minimizing Cloud Footprint
Arun Kejariwal
 

Más de Arun Kejariwal (20)

Anomaly Detection At The Edge
Anomaly Detection At The EdgeAnomaly Detection At The Edge
Anomaly Detection At The Edge
 
Serverless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the EnterpriseServerless Streaming Architectures and Algorithms for the Enterprise
Serverless Streaming Architectures and Algorithms for the Enterprise
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
 
Sequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time SeriesSequence-to-Sequence Modeling for Time Series
Sequence-to-Sequence Modeling for Time Series
 
Model Serving via Pulsar Functions
Model Serving via Pulsar FunctionsModel Serving via Pulsar Functions
Model Serving via Pulsar Functions
 
Designing Modern Streaming Data Applications
Designing Modern Streaming Data ApplicationsDesigning Modern Streaming Data Applications
Designing Modern Streaming Data Applications
 
Deep Learning for Time Series Data
Deep Learning for Time Series DataDeep Learning for Time Series Data
Deep Learning for Time Series Data
 
Live Anomaly Detection
Live Anomaly DetectionLive Anomaly Detection
Live Anomaly Detection
 
Modern real-time streaming architectures
Modern real-time streaming architecturesModern real-time streaming architectures
Modern real-time streaming architectures
 
Anomaly detection in real-time data streams using Heron
Anomaly detection in real-time data streams using HeronAnomaly detection in real-time data streams using Heron
Anomaly detection in real-time data streams using Heron
 
Data Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action UponData Data Everywhere: Not An Insight to Take Action Upon
Data Data Everywhere: Not An Insight to Take Action Upon
 
Real Time Analytics: Algorithms and Systems
Real Time Analytics: Algorithms and SystemsReal Time Analytics: Algorithms and Systems
Real Time Analytics: Algorithms and Systems
 
Finding bad apples early: Minimizing performance impact
Finding bad apples early: Minimizing performance impactFinding bad apples early: Minimizing performance impact
Finding bad apples early: Minimizing performance impact
 
Velocity 2015-final
Velocity 2015-finalVelocity 2015-final
Velocity 2015-final
 
Statistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ TwitterStatistical Learning Based Anomaly Detection @ Twitter
Statistical Learning Based Anomaly Detection @ Twitter
 
Days In Green (DIG): Forecasting the life of a healthy service
Days In Green (DIG): Forecasting the life of a healthy serviceDays In Green (DIG): Forecasting the life of a healthy service
Days In Green (DIG): Forecasting the life of a healthy service
 
Gimme More! Supporting User Growth in a Performant and Efficient Fashion
Gimme More! Supporting User Growth in a Performant and Efficient FashionGimme More! Supporting User Growth in a Performant and Efficient Fashion
Gimme More! Supporting User Growth in a Performant and Efficient Fashion
 
A Systematic Approach to Capacity Planning in the Real World
A Systematic Approach to Capacity Planning in the Real WorldA Systematic Approach to Capacity Planning in the Real World
A Systematic Approach to Capacity Planning in the Real World
 
Isolating Events from the Fail Whale
Isolating Events from the Fail WhaleIsolating Events from the Fail Whale
Isolating Events from the Fail Whale
 
Techniques for Minimizing Cloud Footprint
Techniques for Minimizing Cloud FootprintTechniques for Minimizing Cloud Footprint
Techniques for Minimizing Cloud Footprint
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 

Correlation Analysis on Live Data Streams

  • 1. 1 CORRELATION ANALYSIS ON LIVE DATA STREAMS ARUN KEJARIWAL, DHRUV CHOUDHARY, FRANCOIS ORSINI
  • 2. 2 STREAMING Serve data at rest LIVE Serve data as it is being generated STREAMING vs. LIVE
  • 3. MOBILE DATA TRAFFIC 17% of total IP traffic by 2021 VR/AR Traffic CAGR of 82% from 2016-2021 ANNUAL GLOBAL IP TRAFFIC 3.3 ZB by 2021 BROADBAND SPEEDS Reach 53 Mbps by 2021 INTERNET VIDEO SURVEILLANCE 3.4% of all Internet video traffic by 2021 LIVE INTERNET VIDEO 13% of Internet video traffic by 2021 THE NUMBERS 3 https://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/complete-white-paper-c11-481360.html
  • 4. 4 DATA FUSION CHARACTERISTICS DISTRIBUTED HETEROGENEOUS NON-LINEAR INTERNET OF THINGS Sensors and Actuators SMART CITIES SMART HEALTH CARE POLLUTON MONITORING INSIGHTS PROBABILISTIC METHODS THEORY OF EVIDENCE MACHINE LEARNING FUZZY LOGIC CHALLENGES DATA: INCONSISTENCY MISSINGNESS, NOISE LATENCY BATTERY CONSUMPTION INTELLIGENT DECISION MAKING
  • 5. 5 TRAFFIC SURVEILLANCE CONGESTION DETECTION ACCIDENT DETECTION HEALTH EMERGENCY RISK ALERT RAIN WATER CLOGGING, HYDROPLANING MUDSLIDE AN EXAMPLE APPLICATION OF DATA FUSION
  • 9. APPLICATION DOMAINS 9 BIOLOGY ANTHROPLOGY AGRICULTURE PHILOSOPHY PSYCHOLOGY FINANCE ECONOMETRICS STATISTICS NETWORKING OPERATIONS
  • 10. Common representation learning Example: Across audio and video modalities in a single live stream Multimodal Linear/non-linear correlation Example: Across univariate time series of operations data Unimodal CORRELATION ANALYSIS FLAVORS: AT A HIGH LEVEL 10
  • 11. CROSS-MODAL ANALYSIS Joint Representation of Text- Audio-Video INTERPRETABILITY Comparative analysis of deep representations AUTHENTICITY Live streams vs. Recording OBJECT IDENTIFICATION Multiple cameras MULTI-STREAM SYNCHRONIZATION Different vantage points Unreliable timestamps LOCALIZATION Video ⟷ Audio Audio ⟷ Text CORRELATION ANALYSIS WHY BOTHER? 11
  • 12. Pearson Spearman 𝜌 Kendall 𝜏 Goodman and Kruskal COMMON TYPES OF CORRELATION 12 Ties are dropped
  • 13. CORRELATION ANALYSIS A REAL LIFE EXAMPLE 13 Root cause analysis Expose investment avenues Surface optimization opportunities Risk minimization Medical diagnosis Learning
  • 15. 15 Symmetric Lack of context Spurious correlations Non-actionable Network topology CORRELATION MATRIX Scalability Hundreds of millions of time series Use cases Multiple Regression Discriminant Analysis Mahalanobis Distance
  • 16. CORRELATION MATRIX 16 * Figure borrowed from [Mueen et al. 2010] * Thresholded(=0.5)CorrelationMatrix 350x350CorrelationMatrix
  • 17. MULTIPLE CORRELATION 1957 MULTIVARIATE CASE 17 Proportion of variance of xj that cannot be explained by other independent variables Depends on the choice of the dependent variable Correlation Matrix 2-D case
  • 20. STOCHASTICITY 20 Why bother? Multiple flavors If X, Y are independent, then 𝜌(X, Y) = 0 However, the converse is not true, as only the first two moments are considered e.g., 𝜌(X, Y) = 0 even when Y = X2 Not invariant under nonlinear strictly increasing transformations Pearson’s correlation only recognizes linear dependence var(X) and var(Y) have to be finite Non-stationary random processes Stochastic correlation process Applications Finance (Brownian motion), biology Time-varying correlation Rolling correlation - lagged indicator Dynamic Conditional Correlation [Engle’02] Local Correlation [Langnau’09] Wishart autoregressive process [Gourieroux’09] Transformations [van Emmerich’06] arctan and Modified Jacobi process [Ma’09] tanh transformation [Teng’16]
  • 22. SPURIOUS CORRELATION * Figure borrowed from [Anscombe 1973] * WHY CONTEXT IS IMPORTANT? 22 Identical r=0.816 Clearly spurious Linear correlation is perhaps not the right metic Identical summary statistics
  • 23. * ROBUSTNESS r =0.8 r =0.2 r is highly sensitive to slight change,as measured by Kolmogorov distance, in one of the marginal distributions Bivariate Normal Distribution Contaminated Normal Distribution PEARSON CORRELATION 23 * Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox Influence Function (IF) of Pearson’s Correlation Unbounded Pearson’s correlation does not have infinitesimal robustness x x y y zz Recall, first order approximation of sample IF of r is: r-i : correlation coefficient based on all but the ith observation
  • 24. 24 ROBUSTNESS * * Figure borrowed from “Introduction to Robust Estimation and Hypothesis Testing” by Rand. R. Wilcox Sensitivity to anomalies Linear relationship between x & y but r = -0.21 PEARSON CORRELATION Quadrant (signum) correlation coefficient# # [Blomqvist, 1950] ^[Pasman and Shevlyakov, 1987] Correlation median estimator^
  • 25. ROBUSTNESS PEARSON CORRELATION 25 Based on Robust Principal Variables Alternatives
  • 29. STREAMING CORRELATION Incremental, One pass CHARACTERISTICS 29 Other correlation measures are not amenable to incremental computation Applications Security Correlation power analysis
  • 31. STREAMING CORRELATION Wide Spectrum of Approaches Sliding Windows, Damped Windows Reduction Smoothing Down sampling DFT [Agrawal et al. 1993, Zhu and Shasha 2002, Qiu et al. 2018] DWT [Chan and Fu 1999, Popivanov & Miller 2002] PCA, SVD PAA [Faloutsos and Yi 2000] APAC [Chakrabarti et al. 2001] Random projections [Grellmann et al. 2016] LSH [Sundaram et al. 2013] DATA SKETCHES 31
  • 32. Bursts Page views, Clicks, Retweets Correlated burstiness (e.g., data center operations) Root-cause analysis STREAMING CORRELATION * Figure borrowed from [Sakurai et al. 2005] * Lag between time series Lagged correlation/Cross-correlation ACTIONABLE INSIGHTS 32 [Zhu and Shasha 2002, Levine et al. 2016, Wu et al. 2017] [Vlachos et al. 2008, Kotov et al. 2011, Shafer et al. 2012, Kusmierczyk and Norvag 2015]
  • 34. PREDICTION MOTION 1952 “… prediction motion is the continuation of tracking motion after a target disappears from view.” 1955 1962 Unlike duration of target presentation, target speed exerts an influence on prediction accuracy. EXPLORED >5 DECADES BACK! 34
  • 35. PREDICTION MOTION 35 Human Motion/Robotics [Bütepage et al. 2017, Martinez et a al. 2017, Byravan & Fox] Traffic Prediction [Hermes et al. 2010, Walker et al. 2014, Yu et al. 2017] 360-Degree Video (AR/VR) [Bao et al. 2017, Vishwanath et al., 2017] Potpourri: Deep Learning Based Approaches [Oh et al. 2015, Mathieu et al. 2016, Liang et al. 2017]
  • 36. AUDIO-VIDEO CORRELATION A LONG HISTORY: EXPERIMENTAL PSYCHOLOGY -> DEEP LEARNING 36
  • 37. AUDIO-VIDEO CORRELATION 1952 People utilize visual and postural experiences in perceiving the position of an object in the field, of the whole field. 1897 Localization of sounds varied, being different when the source of sound was in sight from what it was when this was out of sight, and also in the latter case differing with different directions of attention, or with different suggestions as to the direction from which the sound came. 1941 SOUND LOCALIZATION: AN APPLICATION 37
  • 38. AUDIO-VIDEO CORRELATION 1977 1960 1976 LIP READING: AN APPLICATION 38 Demonstrated the influence of vision on speech perception McGurk and MacDonald Established the relationship of the visually perceived symbols to the underlying linguistic system Use visual information as an aid when white noise made speech difficult to hear 1954 There’s a great opportunity for the visual contribution at low speech-to-noise ratios
  • 39. AUDIO-VIDEO CORRELATION 2007 2004 Combined acoustic and visual feature vectors to distinguish live synchronous audio-video recordings from replay attacks that use audio with a still photo. LIVENESS AND SYNCHRONIZATION: AN APPLICATION 39 Extracted the correlated components of audio and lip features based on Canonical Correlation Analysis. 2009 Showed that there exists a relationship between perception of video presented in screen and accompanying audio signals, both stereo and spatial
  • 40. 40 AUDIO-VIDEO-TEXT Speaker identification in multi-speaker scenarios [Huang & Kingsbury 2013, Chung & Zisserman 2016, Torfi et al. 2017] Cross-Modal Correlation Learning in Audio and Lyrics [Yu et al. 2017, Tang et al. 2017] Speech enhancement [Xu et al. 2014, Hou et al. 2017, Kolbæk et al. 2017] Action Recognition and Video Highlight Detection [Wu et al. 2013, Sun et al. 2013, Takahashi et al. 2017] DEEP LEARNING WAY Emotion Recognition [Tzirakis et al. 2017, Pini et al. 2017]
  • 41. 41 OVERVIEW Satori is the only live data platform that enables immediate integration, interaction, correlation, and intelligent response at high throughput and ultra-low latency. SATORI A UNIFIED LIVE DATA PLATFORM
  • 43. 43 SATORI LIVE DATA LIFECYCLE
  • 45. 45 Online anomaly detection: speed-accuracy trade-off On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data* by Choudhary et al. 2017 Breakouts/Changepoints Skew in location of anomalies SUSCEPTIBILITY TO ANOMALIES * https://arxiv.org/pdf/1710.04735.pdf
  • 46. 46 HETEROSCEDASTICITY Methods (Adjusted) Percentile Bootstrap [Wilcox 1996] Nested Bootstrap [DiCiccio et al. 1992] Challenge: Detecting non-linear correlation 2001
  • 47. 47 Potential sources: Event based monitoring via sensors, Occlusion in a video Techniques: Resampling Lomb-Scargle Fourier Transform [Andersson 2007, Rehfeld et al. 2011] Kernel - such as Laplacian, Gaussian - based methods IRREGULARLY SPACED TIME SERIES
  • 48. TIME VARYING CORRELATION 48 * Figure borrowed from [Fu et al. 2013] TVCC: Time Varying Correlation Coefficient fMRI time-courses of 4 Regions of Interests (ROI) ✦ Time varying joint distribution ✦ Parameter non-constancy/Instability F Test [Chow’60] SupF Test [Quandt’60] Lagrange Multiplier Test [Nabeya and Tanaka’88, Nylom’89] ✦ Co-integrated processes I(1) [Hansen’90] ✦ Stochastic vs. Deterministic ✦ Dynamic Conditional Correlation (DCC) [Engle’02] * Post stimulus period with significant difference (p<0.05) w.r.t. pre-stimulus interval
  • 49. STREAMING CORRELATION Missing data Packets being dropped owing to, say, unexpected high traffic Data collection: Every, say, 5 seconds How to scale analysis to milli-seconds granularity? Unequal length time series Different sampling rates Small samples Bootstrapping Low SNR (Signal to Noise Ratio) 49
  • 50. NEXT FRONTIER LEVERAGING MULTIMODAL & CONTINUOUSLY EVOLVING CORRELATION FOR SELF-LEARNING 50
  • 52. 52 SELF-LEARNING EARLY WORK: GAMES 19501914 constructed a device which played an end game of king and rook against king. The machine played the side with king and rook and would force checkmate in a few moves however its human opponent played. Gerald Tesauro 199519592002
  • 53. 2002 2017 Amongst the first and most famous was the chess-playing automaton constructed in 1769 by Baron Kempelen … 1953 Alan Turing 2012 1970 GAME PLAYING POTPOURRI 53 19962007 The game of checkers has roughly 500 billion billion possible positions (5 × 1020)
  • 54. SELF-LEARNING 2016 REINFORCEMENT LEARNING 54 2016201720172018 AlphaGo Mastering the game of Go with deep neural networks and tree search by Silver et al. AlphaGo Zero Mastering the game of Go without Human knowledge by Silver et al. Alpha Zero Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm by Silver et al. Libratus Superhuman AI for heads-up no-limit poker: Libratus beats top professionals by Brown and Sandholm
  • 55.
  • 56.
  • 57. 57 Rank Correlation Methods by Kendall and Gibbons READINGS Correlation and Regression by Bobko Correlation by Garson Robust Correlation: Theory and Applications by Shevlyakov and Oja A Mathematical Theory of Evidence by Shafer BOOKS
  • 58. 58 Fast Approximate Correlation of Massive Time-series Data by Queen et al., 2010 READINGS Local Correlation Detection with Linearity Enhancement in Streaming Data by Xie et al., 2013 Fast Distributed Correlation Discovery Over Streaming Time-Series Data by Guo et al., 2015 Random Projection of Fast and Efficient Multivariate Correlation Analysis of High-Dimensional Data: A New Approach by Grellmann et al., 2016 Detection of Highly Correlated Live Data Streams by Alseghayer et al., 2017 STREAMING CORRELATION
  • 59. 59 Perception of body position and of the position of the visual field by Witkin, 1949 READINGS Autocorrelation, a principle for the evaluation of sensory information by the nervous system by Reichardt, 1961 EARLY RESEARCH IN AUDIO-VISUAL CORRELATION Binocular cross-correlation in time and space by Tyler and Julesz, 1978 Neurontropy. an entropy like measure of neural correlation by Julesz and Tyler, 1976 Cross correlation of sensory stimuli and electroencephalogram by Morgan, 1969
  • 60. 60 Patch to the future: Unsupervised visual prediction by Walker et al., 2014 READINGS Motion-Prediction-based Multicast for 360-Degree Video Transmissions by Bao et al., 2017 Dual Motion GAN for Future-Flow Embedded Video Prediction by Liang et al., 2017 Deep representation learning for human motion prediction and classification by Bütepage et al., 2017 On human motion prediction using recurrent neural networks by Martinez, 2017 RECENT WORKS IN PREDICTION MOTION
  • 61. 61 On Deep Multi-View Representation Learning by Wang et al., 2015 READINGS Correlational Neural Networks by Chandar et al., 2017 Common Representation Learning Using Step-based Correlation Multi-Modal CNN by Bhatt et al., 2017 Objects that Sound by Arandjelović and Zisserman, 2017 Deep Correlation Feature Learning for Face Verification in the Wild by Deng et al., 2017 COMMON REPRESENTATION LEARNING
  • 62. 62 READINGS Audiovisual Synchronization and Fusion Using Canonical Correlation Analysis by Sargin et al., 2007 The predictive power of trajectory motion by Watamaniuk, 2005 Seeing motion behind occluders by Watamaniuk and McKee, 1995 Temporal Coherence Theory For The Detection And Measurement Of Visual Motion by Grzywacz et al.,1995 Probabilistic Motion Estimation Based On Temporal Coherence by Burgi et al., 2000 AUDIO-VISUAL RESEARCH
  • 63. 63 A New Method of Audio-Visual Correlation Analysis by Kunka and Kostek, 2009 READINGS Uncertainty in ontologies: Dempster–Shafer theory for data fusion applications by Bellenger and Gatepaille, 2011 City Data Fusion: Sensor Data Fusion in the Internet of Things by Wang et al., 2015 Data Fusion and IoT for Smart Ubiquitous Environments: A Survey by Alam et al., 2017 Correlation Analysis of Audio and Video Contents: A Metadata based Approach by Algur et al., 2015 POTPOURRI
  • 64. 64 READINGS POTPOURRI Correlation detection as a general mechanism for multisensory integration by Parise and Ernst, 2016 Origin of information-limiting noise correlations by Kanitscheidera et al., 2015 Temporal structure and complexity affect audio-visual correspondence detection by Denison et al., 2013 Correlation versus causation in multisensory perception by Mitterer, Jesse, 2010
  • 65. 65 READINGS MATHEMATICS A Bayesian approach to problems in stochastic estimation and control by Ho and Lee, 1964 A Generalization of Bayesian Inference by Dempster, 1968 Multidimensional Scaling by Kruskal and Wish, 1978