SlideShare una empresa de Scribd logo
1 de 18
Descargar para leer sin conexión
Introduction
Statistical distances
Optimal Transport vs. Fisher-Rao distance
between Copulas
IEEE SSP 2016
G. Marti, S. Andler, F. Nielsen, P. Donnat
June 28, 2016
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Clustering of Time Series
We need a distance Dij between time series xi and xj
If we look for ‘correlation’, Dij is a decreasing function of ρij ,
a measure of ‘correlation’
Several choices are available for ρij . . .
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Copulas
Sklar’s Theorem:
F(xi , xj ) = Cij (Fi (xi ), Fj (xj ))
Cij , the copula, encodes the dependence structure
Fr´echet-Hoeffding bounds:
max{ui + uj − 1, 0} ≤ Cij (ui , uj ) ≤ min{ui , uj }
(left) lower-bound, (mid) independence, (right) upper-bound copulas
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Copulas - Gaussian Example
Gaussian copula: CGauss
R (ui , uj ) = ΦR(Φ−1(ui ), Φ−1(uj ))
The distribution is parametrized by a correlation matrix R.
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
The Target/Forget (copula-based) Dependence Coefficient
Dependence is measured as the relative distance from independence to
the nearest target-dependence: comonotonicity or counter-monotonicity
Which distances are appropriate between copulas for the task of
clustering (copulas and time series)?
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Definitions - Fisher-Rao geodesic distance
Metrization of the paramater space {θ ∈ Rd | p(X; θ)dx = 1}.
Consider the metric gjk(θ) = − ∂2 log p(x,θ)
∂θj ∂θk
p(x, θ)dx,
the infinitesimal length ds(θ) = ( θ) G(θ) θ,
the Fisher-Rao geodesic distance
FR(θ1, θ2) =
θ2
θ1
ds(θ).
f -divergences induce infinitesimal length proportional to
Fisher-Rao infinitesimal length:
Df (θ θ + dθ) =
1
2
( θ) G(θ) θ.
Thus, they have the same local behaviour [1].
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Definitions - Optimal Transport distances
Wasserstein metric
Wp(µ, ν)p
= inf
γ∈Γ(µ,ν) M×M
d(x, y)p
dγ(x, y)
Image from Optimal Transport for Image Processing, Papadakis
Other transportation distances: regularized discrete optimal
transport [3], Sinkhorn distances [2], . . .
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Geometry of covariances
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Distances between Gaussian copulas
Copulas C1, C2, C3 encoding a correlation of 0.5, 0.99, 0.9999 respectively;
Which pair of copulas is the nearest?
- For Fisher-Rao, Kullback-Leibler, Hellinger and related divergences:
D(C1, C2) ≤ D(C2, C3);
- For Wasserstein: W2(C2, C3) ≤ W2(C1, C2)
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Distances as a function of (ρ1, ρ2)
Distance heatmap and surface as a function of (ρ1, ρ2)
for Fisher-Rao for Wasserstein W2
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Distances impact on clustering
Datasets of bivariate time series are generated from six Gaussian copulas
with correlation .1, .2, .6, .7, .99, .9999
Distance heatmaps for Fisher-Rao (left), W2 (right); Using Ward
clustering, Fisher-Rao yields clusters of copulas with correlations
{.1, .2, .6, .7}, {.99}, {.9999}, W2 yields {.1, .2}, {.6, .7}, {.99, .9999}
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Fisher metric and the Cram´er–Rao lower bound
Cram´er–Rao lower bound (CRLB)
The variance of any unbiased estimator ˆθ of θ is bounded by the
reciprocal of the Fisher information G(θ):
var(ˆθ) ≥
1
G(θ)
.
In the bivariate Gaussian copula case,
var(ˆρ) ≥
(ρ − 1)2(ρ + 1)2
3(ρ2 + 1)
.
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Fisher metric and the Cram´er–Rao lower bound
We consider the set of 2 × 2 correlation matrices C =
1 θ
θ 1
parameterized by θ.
Let x =
x1
x2
∈ R2
.
f (x; θ) = 1
2π 1−θ2
exp − 1
2
x C−1
x = 1
2π 1−θ2
exp − 1
2(1−θ2)
(x2
1 + x2
2 − 2θx1x2)
log f (x; θ) = − log(2π 1 − θ2) − 1
2(1−θ2)
(x2
1 + x2
2 − 2θx1x2)
∂2 log f (x;θ)
∂θ2 = − θ2+1
(θ2−1)2 −
x2
1
2(θ+1)3 +
x2
1
2(θ−1)3 −
x2
2
2(θ+1)3 +
x2
2
2(θ−1)3 −
x1x2
(θ+1)3 −
x1x2
(θ−1)3
Then, we compute ∞
−∞
∂2 log f (x;θ)
∂θ2 f (x; θ)dx.
Since E[x1] = E[x2] = 0, E[x1x2] = θ, E[x2
1 ] = E[x2
2 ] = 1, we get
∞
−∞
∂2 log f (x;θ)
∂θ2 f (x; θ)dx =
− θ2+1
(θ2−1)2 − 1
2(θ+1)3 + 1
2(θ−1)3 − 1
2(θ+1)3 + 1
2(θ−1)3 − θ
(θ+1)3 − θ
(θ−1)3 = −
3(θ2+1)
(θ−1)2(θ+1)2
Thus,
G(θ) =
3(θ2
+ 1)
(θ − 1)2(θ + 1)2
.
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Fisher metric and the Cram´er–Rao lower bound
In the bivariate Gaussian copula case,
var(ˆρ) ≥
(ρ − 1)2(ρ + 1)2
3(ρ2 + 1)
.
Recall that locally Fisher-Rao and the f -divergences are a
quadratic form of the Fisher metric ( θ) G(θ) θ. So, the
discriminative power of these distances is well calibrated with
respect to statistical uncertainty. For this purpose, they induce the
appropriate curvature on the parameter space.
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Properties of these distances
In addition, for clustering we prefer OT since:
in a parametric setting:
Fisher-Rao and f -divergences are defined on density manifolds,
but some important copulas (such as the Fr´echet-Hoeffding
upper bound) do not belong to these manifolds;
Thus, in case of closed-form formulas (such as in the Gaussian
case), they are ill-defined for these copulas (for perfect
dependence, covariance is not invertible)
in a non-parametric/empirical setting:
f -divergences are defined for absolutely continuous measures,
thus require a pre-processing KDE
they are not aware of the support geometry, thus badly handle
noise on the support
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Barycenters
OT is defined for both discrete/empirical and continuous measures
and is support-geometry aware:
0 0.5 1
0
0.5
1
0.0000
0.0015
0.0030
0.0045
0.0060
0.0075
0.0090
0.0105
0.0120
0 0.5 1
0
0.5
1
0.0000
0.0015
0.0030
0.0045
0.0060
0.0075
0.0090
0.0105
0.0120
0 0.5 1
0
0.5
1
0.0000
0.0008
0.0016
0.0024
0.0032
0.0040
0.0048
0.0056
0 0.5 1
0
0.5
1
0.0000
0.0015
0.0030
0.0045
0.0060
0.0075
0.0090
0.0105
0.0120
0 0.5 1
0
0.5
1
0.0000
0.0015
0.0030
0.0045
0.0060
0.0075
0.0090
0.0105
0.0120
5 copulas describing the dependence between X ∼ U([0, 1]) and
Y ∼ (X ± i )2
, where i is a constant noise specific for each distribution
0 0.5 1
0
0.5
1 Wasserstein barycenter copula
0.0000
0.0004
0.0008
0.0012
0.0016
0.0020
0.0024
0.0028
0.0032
Barycenter of the 5 copulas for a divergence and OT
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Future Research
Develop further geometries of copulas
using Optimal Transport: show that dependence-clustering of
time series is improved over standard correlations
using f -divergences: detect efficiently dependence-regime
switching in multivariate time series (cf. Fr´ed´eric Barbaresco’s
work on radar signal processing)
Numerical experiments and code:
https://www.datagrapple.com/Tech/fisher-vs-ot.html
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
Introduction
Statistical distances
Shun-ichi Amari and Andrzej Cichocki.
Information geometry of divergence functions.
Bulletin of the Polish Academy of Sciences: Technical
Sciences, 58(1):183–195, 2010.
Marco Cuturi.
Sinkhorn distances: Lightspeed computation of optimal
transport.
In Advances in Neural Information Processing Systems, pages
2292–2300, 2013.
Sira Ferradans, Nicolas Papadakis, Julien Rabin, Gabriel Peyr´e,
and Jean-Fran¸cois Aujol.
Regularized discrete optimal transport.
Springer, 2013.
Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas

Más contenido relacionado

La actualidad más candente

人間と人工知能(前篇)
人間と人工知能(前篇)人間と人工知能(前篇)
人間と人工知能(前篇)Youichiro Miyake
 
Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...
	 Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...	 Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...
Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...Fernando A. B. Sabino da Silva
 
Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Chris Ohk
 
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...홍배 김
 
Presentation on Time Travel
Presentation on Time TravelPresentation on Time Travel
Presentation on Time TravelSourav Dey
 
Introduction to Capsule Networks (CapsNets)
Introduction to Capsule Networks (CapsNets)Introduction to Capsule Networks (CapsNets)
Introduction to Capsule Networks (CapsNets)Aurélien Géron
 
Object recognition of CIFAR - 10
Object recognition of CIFAR  - 10Object recognition of CIFAR  - 10
Object recognition of CIFAR - 10Ratul Alahy
 
The Scientific Legacy of Stephen Hawking
The Scientific Legacy of Stephen HawkingThe Scientific Legacy of Stephen Hawking
The Scientific Legacy of Stephen HawkingDanielBaumann11
 
The Physics Of Time Travel
The Physics Of Time TravelThe Physics Of Time Travel
The Physics Of Time Travelguest433bdee
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun YooJaeJun Yoo
 

La actualidad más candente (16)

人間と人工知能(前篇)
人間と人工知能(前篇)人間と人工知能(前篇)
人間と人工知能(前篇)
 
Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...
	 Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...	 Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...
Pairs Trading: Optimizing via Mixed Copula versus Distance Method for S&P 5...
 
Time travel
Time travel Time travel
Time travel
 
Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015Trust Region Policy Optimization, Schulman et al, 2015
Trust Region Policy Optimization, Schulman et al, 2015
 
Time travel
Time travelTime travel
Time travel
 
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...
InfoGAN: Interpretable Representation Learning by Information Maximizing Gene...
 
Time travel
Time travelTime travel
Time travel
 
time travel
time traveltime travel
time travel
 
Presentation on Time Travel
Presentation on Time TravelPresentation on Time Travel
Presentation on Time Travel
 
Time travel
Time travelTime travel
Time travel
 
Introduction to Capsule Networks (CapsNets)
Introduction to Capsule Networks (CapsNets)Introduction to Capsule Networks (CapsNets)
Introduction to Capsule Networks (CapsNets)
 
Object recognition of CIFAR - 10
Object recognition of CIFAR  - 10Object recognition of CIFAR  - 10
Object recognition of CIFAR - 10
 
The Scientific Legacy of Stephen Hawking
The Scientific Legacy of Stephen HawkingThe Scientific Legacy of Stephen Hawking
The Scientific Legacy of Stephen Hawking
 
The Physics Of Time Travel
The Physics Of Time TravelThe Physics Of Time Travel
The Physics Of Time Travel
 
Time travel
Time travelTime travel
Time travel
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo
 

Destacado

Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesGautier Marti
 
A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlationsGautier Marti
 
Proximal Splitting and Optimal Transport
Proximal Splitting and Optimal TransportProximal Splitting and Optimal Transport
Proximal Splitting and Optimal TransportGabriel Peyré
 
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...Gabriel Peyré
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sportsmfrfye
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando M. Imperiale
 
Here be dragons
Here be dragonsHere be dragons
Here be dragonsdeelay1
 
On the stability of clustering financial time series
On the stability of clustering financial time seriesOn the stability of clustering financial time series
On the stability of clustering financial time seriesGautier Marti
 
Neurological considerations
Neurological considerationsNeurological considerations
Neurological considerationsJess Sarabia
 
Fernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando M. Imperiale
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013Olle Bergendahl
 
IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESFernando M. Imperiale
 
Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Bestr
 
Cormac Ferrick Sociology 204 Final Presentation
Cormac Ferrick Sociology 204 Final PresentationCormac Ferrick Sociology 204 Final Presentation
Cormac Ferrick Sociology 204 Final PresentationMac Ferrick
 

Destacado (19)

Optimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time SeriesOptimal Transport between Copulas for Clustering Time Series
Optimal Transport between Copulas for Clustering Time Series
 
A closer look at correlations
A closer look at correlationsA closer look at correlations
A closer look at correlations
 
Proximal Splitting and Optimal Transport
Proximal Splitting and Optimal TransportProximal Splitting and Optimal Transport
Proximal Splitting and Optimal Transport
 
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...
Low Complexity Regularization of Inverse Problems - Course #3 Proximal Splitt...
 
NSO_cv_20160511
NSO_cv_20160511NSO_cv_20160511
NSO_cv_20160511
 
Diapo bourse aux sports
Diapo bourse aux sportsDiapo bourse aux sports
Diapo bourse aux sports
 
Prabhu Sundaramurthi (4)
Prabhu Sundaramurthi (4)Prabhu Sundaramurthi (4)
Prabhu Sundaramurthi (4)
 
Fernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMESFernando Imperiale - Security Intelligence para PYMES
Fernando Imperiale - Security Intelligence para PYMES
 
Here be dragons
Here be dragonsHere be dragons
Here be dragons
 
Prezentacja1
Prezentacja1Prezentacja1
Prezentacja1
 
On the stability of clustering financial time series
On the stability of clustering financial time seriesOn the stability of clustering financial time series
On the stability of clustering financial time series
 
Neurological considerations
Neurological considerationsNeurological considerations
Neurological considerations
 
Fernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajarFernando Imperiale - Una aguja en el pajar
Fernando Imperiale - Una aguja en el pajar
 
EColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docxEColi_CaseStudyRoughDraft.docx
EColi_CaseStudyRoughDraft.docx
 
Searching for the grey gold - 2013
Searching for the grey gold - 2013Searching for the grey gold - 2013
Searching for the grey gold - 2013
 
IBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMESIBM - Security Intelligence para PYMES
IBM - Security Intelligence para PYMES
 
Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015Carla Casilli - Cineca + open badges - May 2015
Carla Casilli - Cineca + open badges - May 2015
 
bala.resume
bala.resumebala.resume
bala.resume
 
Cormac Ferrick Sociology 204 Final Presentation
Cormac Ferrick Sociology 204 Final PresentationCormac Ferrick Sociology 204 Final Presentation
Cormac Ferrick Sociology 204 Final Presentation
 

Similar a Optimal Transport vs. Fisher-Rao distance between Copulas

Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time SeriesGautier Marti
 
Bayesian phylogenetic inference_big4_ws_2016-10-10
Bayesian phylogenetic inference_big4_ws_2016-10-10Bayesian phylogenetic inference_big4_ws_2016-10-10
Bayesian phylogenetic inference_big4_ws_2016-10-10FredrikRonquist
 
Continuous and Discrete-Time Analysis of SGD
Continuous and Discrete-Time Analysis of SGDContinuous and Discrete-Time Analysis of SGD
Continuous and Discrete-Time Analysis of SGDValentin De Bortoli
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Valentin De Bortoli
 
The role of kalman filter in improving the accuracy of gps kinematic technique
The role of kalman filter in improving the accuracy of gps kinematic techniqueThe role of kalman filter in improving the accuracy of gps kinematic technique
The role of kalman filter in improving the accuracy of gps kinematic techniqueIAEME Publication
 
random forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationrandom forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationChristian Robert
 
Natalini nse slide_giu2013
Natalini nse slide_giu2013Natalini nse slide_giu2013
Natalini nse slide_giu2013Madd Maths
 
International conference "QP 34 -- Quantum Probability and Related Topics"
International conference "QP 34 -- Quantum Probability and Related Topics"International conference "QP 34 -- Quantum Probability and Related Topics"
International conference "QP 34 -- Quantum Probability and Related Topics"Medhi Corneille Famibelle*
 
ABC based on Wasserstein distances
ABC based on Wasserstein distancesABC based on Wasserstein distances
ABC based on Wasserstein distancesChristian Robert
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeMagdi Mohamed
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practiceguest3550292
 
ABC with Wasserstein distances
ABC with Wasserstein distancesABC with Wasserstein distances
ABC with Wasserstein distancesChristian Robert
 

Similar a Optimal Transport vs. Fisher-Rao distance between Copulas (20)

Clustering Random Walk Time Series
Clustering Random Walk Time SeriesClustering Random Walk Time Series
Clustering Random Walk Time Series
 
Bayesian phylogenetic inference_big4_ws_2016-10-10
Bayesian phylogenetic inference_big4_ws_2016-10-10Bayesian phylogenetic inference_big4_ws_2016-10-10
Bayesian phylogenetic inference_big4_ws_2016-10-10
 
Continuous and Discrete-Time Analysis of SGD
Continuous and Discrete-Time Analysis of SGDContinuous and Discrete-Time Analysis of SGD
Continuous and Discrete-Time Analysis of SGD
 
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
MUMS Opening Workshop - Panel Discussion: Facts About Some Statisitcal Models...
 
Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...Maximum likelihood estimation of regularisation parameters in inverse problem...
Maximum likelihood estimation of regularisation parameters in inverse problem...
 
The role of kalman filter in improving the accuracy of gps kinematic technique
The role of kalman filter in improving the accuracy of gps kinematic techniqueThe role of kalman filter in improving the accuracy of gps kinematic technique
The role of kalman filter in improving the accuracy of gps kinematic technique
 
Climate Extremes Workshop - A Semiparametric Bayesian Clustering Model for S...
Climate Extremes Workshop -  A Semiparametric Bayesian Clustering Model for S...Climate Extremes Workshop -  A Semiparametric Bayesian Clustering Model for S...
Climate Extremes Workshop - A Semiparametric Bayesian Clustering Model for S...
 
random forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationrandom forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimation
 
the ABC of ABC
the ABC of ABCthe ABC of ABC
the ABC of ABC
 
Climate Extremes Workshop - Semiparametric Estimation of Heavy Tailed Density...
Climate Extremes Workshop - Semiparametric Estimation of Heavy Tailed Density...Climate Extremes Workshop - Semiparametric Estimation of Heavy Tailed Density...
Climate Extremes Workshop - Semiparametric Estimation of Heavy Tailed Density...
 
Natalini nse slide_giu2013
Natalini nse slide_giu2013Natalini nse slide_giu2013
Natalini nse slide_giu2013
 
International conference "QP 34 -- Quantum Probability and Related Topics"
International conference "QP 34 -- Quantum Probability and Related Topics"International conference "QP 34 -- Quantum Probability and Related Topics"
International conference "QP 34 -- Quantum Probability and Related Topics"
 
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
CLIM Fall 2017 Course: Statistics for Climate Research, Statistics of Climate...
 
Bayesian computation with INLA
Bayesian computation with INLABayesian computation with INLA
Bayesian computation with INLA
 
ABC based on Wasserstein distances
ABC based on Wasserstein distancesABC based on Wasserstein distances
ABC based on Wasserstein distances
 
QMC: Operator Splitting Workshop, Proximal Algorithms in Probability Spaces -...
QMC: Operator Splitting Workshop, Proximal Algorithms in Probability Spaces -...QMC: Operator Splitting Workshop, Proximal Algorithms in Probability Spaces -...
QMC: Operator Splitting Workshop, Proximal Algorithms in Probability Spaces -...
 
Q-Metrics in Theory and Practice
Q-Metrics in Theory and PracticeQ-Metrics in Theory and Practice
Q-Metrics in Theory and Practice
 
Q-Metrics in Theory And Practice
Q-Metrics in Theory And PracticeQ-Metrics in Theory And Practice
Q-Metrics in Theory And Practice
 
ABC with Wasserstein distances
ABC with Wasserstein distancesABC with Wasserstein distances
ABC with Wasserstein distances
 
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
Program on Quasi-Monte Carlo and High-Dimensional Sampling Methods for Applie...
 

Más de Gautier Marti

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeGautier Marti
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...Gautier Marti
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsGautier Marti
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...Gautier Marti
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?Gautier Marti
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGautier Marti
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in FinanceGautier Marti
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsGautier Marti
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaGautier Marti
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...Gautier Marti
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesGautier Marti
 
Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Gautier Marti
 
Clustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesClustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesGautier Marti
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Gautier Marti
 
Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Gautier Marti
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationGautier Marti
 
On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...Gautier Marti
 

Más de Gautier Marti (17)

Using Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of CodeUsing Large Language Models in 10 Lines of Code
Using Large Language Models in 10 Lines of Code
 
What deep learning can bring to...
What deep learning can bring to...What deep learning can bring to...
What deep learning can bring to...
 
A quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptionsA quick demo of Top2Vec With application on 2020 10-K business descriptions
A quick demo of Top2Vec With application on 2020 10-K business descriptions
 
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
cCorrGAN: Conditional Correlation GAN for Learning Empirical Conditional Dist...
 
How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?How deep generative models can help quants reduce the risk of overfitting?
How deep generative models can help quants reduce the risk of overfitting?
 
Generating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in FinanceGenerating Realistic Synthetic Data in Finance
Generating Realistic Synthetic Data in Finance
 
Applications of GANs in Finance
Applications of GANs in FinanceApplications of GANs in Finance
Applications of GANs in Finance
 
My recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returnsMy recent attempts at using GANs for simulating realistic stocks returns
My recent attempts at using GANs for simulating realistic stocks returns
 
Takeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, CaliforniaTakeaways from ICML 2019, Long Beach, California
Takeaways from ICML 2019, Long Beach, California
 
A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...A review of two decades of correlations, hierarchies, networks and clustering...
A review of two decades of correlations, hierarchies, networks and clustering...
 
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time SeriesAutoregressive Convolutional Neural Networks for Asynchronous Time Series
Autoregressive Convolutional Neural Networks for Asynchronous Time Series
 
Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...Some contributions to the clustering of financial time series - Applications ...
Some contributions to the clustering of financial time series - Applications ...
 
Clustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence ratesClustering CDS: algorithms, distances, stability and convergence rates
Clustering CDS: algorithms, distances, stability and convergence rates
 
Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...Clustering Financial Time Series using their Correlations and their Distribut...
Clustering Financial Time Series using their Correlations and their Distribut...
 
Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?Clustering Financial Time Series: How Long is Enough?
Clustering Financial Time Series: How Long is Enough?
 
On Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond CorrelationOn Clustering Financial Time Series - Beyond Correlation
On Clustering Financial Time Series - Beyond Correlation
 
On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...On clustering financial time series - A need for distances between dependent ...
On clustering financial time series - A need for distances between dependent ...
 

Último

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 

Último (20)

办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 

Optimal Transport vs. Fisher-Rao distance between Copulas

  • 1. Introduction Statistical distances Optimal Transport vs. Fisher-Rao distance between Copulas IEEE SSP 2016 G. Marti, S. Andler, F. Nielsen, P. Donnat June 28, 2016 Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 2. Introduction Statistical distances Clustering of Time Series We need a distance Dij between time series xi and xj If we look for ‘correlation’, Dij is a decreasing function of ρij , a measure of ‘correlation’ Several choices are available for ρij . . . Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 3. Introduction Statistical distances Copulas Sklar’s Theorem: F(xi , xj ) = Cij (Fi (xi ), Fj (xj )) Cij , the copula, encodes the dependence structure Fr´echet-Hoeffding bounds: max{ui + uj − 1, 0} ≤ Cij (ui , uj ) ≤ min{ui , uj } (left) lower-bound, (mid) independence, (right) upper-bound copulas Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 4. Introduction Statistical distances Copulas - Gaussian Example Gaussian copula: CGauss R (ui , uj ) = ΦR(Φ−1(ui ), Φ−1(uj )) The distribution is parametrized by a correlation matrix R. Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 5. Introduction Statistical distances The Target/Forget (copula-based) Dependence Coefficient Dependence is measured as the relative distance from independence to the nearest target-dependence: comonotonicity or counter-monotonicity Which distances are appropriate between copulas for the task of clustering (copulas and time series)? Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 6. Introduction Statistical distances Definitions - Fisher-Rao geodesic distance Metrization of the paramater space {θ ∈ Rd | p(X; θ)dx = 1}. Consider the metric gjk(θ) = − ∂2 log p(x,θ) ∂θj ∂θk p(x, θ)dx, the infinitesimal length ds(θ) = ( θ) G(θ) θ, the Fisher-Rao geodesic distance FR(θ1, θ2) = θ2 θ1 ds(θ). f -divergences induce infinitesimal length proportional to Fisher-Rao infinitesimal length: Df (θ θ + dθ) = 1 2 ( θ) G(θ) θ. Thus, they have the same local behaviour [1]. Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 7. Introduction Statistical distances Definitions - Optimal Transport distances Wasserstein metric Wp(µ, ν)p = inf γ∈Γ(µ,ν) M×M d(x, y)p dγ(x, y) Image from Optimal Transport for Image Processing, Papadakis Other transportation distances: regularized discrete optimal transport [3], Sinkhorn distances [2], . . . Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 8. Introduction Statistical distances Geometry of covariances Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 9. Introduction Statistical distances Distances between Gaussian copulas Copulas C1, C2, C3 encoding a correlation of 0.5, 0.99, 0.9999 respectively; Which pair of copulas is the nearest? - For Fisher-Rao, Kullback-Leibler, Hellinger and related divergences: D(C1, C2) ≤ D(C2, C3); - For Wasserstein: W2(C2, C3) ≤ W2(C1, C2) Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 10. Introduction Statistical distances Distances as a function of (ρ1, ρ2) Distance heatmap and surface as a function of (ρ1, ρ2) for Fisher-Rao for Wasserstein W2 Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 11. Introduction Statistical distances Distances impact on clustering Datasets of bivariate time series are generated from six Gaussian copulas with correlation .1, .2, .6, .7, .99, .9999 Distance heatmaps for Fisher-Rao (left), W2 (right); Using Ward clustering, Fisher-Rao yields clusters of copulas with correlations {.1, .2, .6, .7}, {.99}, {.9999}, W2 yields {.1, .2}, {.6, .7}, {.99, .9999} Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 12. Introduction Statistical distances Fisher metric and the Cram´er–Rao lower bound Cram´er–Rao lower bound (CRLB) The variance of any unbiased estimator ˆθ of θ is bounded by the reciprocal of the Fisher information G(θ): var(ˆθ) ≥ 1 G(θ) . In the bivariate Gaussian copula case, var(ˆρ) ≥ (ρ − 1)2(ρ + 1)2 3(ρ2 + 1) . Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 13. Introduction Statistical distances Fisher metric and the Cram´er–Rao lower bound We consider the set of 2 × 2 correlation matrices C = 1 θ θ 1 parameterized by θ. Let x = x1 x2 ∈ R2 . f (x; θ) = 1 2π 1−θ2 exp − 1 2 x C−1 x = 1 2π 1−θ2 exp − 1 2(1−θ2) (x2 1 + x2 2 − 2θx1x2) log f (x; θ) = − log(2π 1 − θ2) − 1 2(1−θ2) (x2 1 + x2 2 − 2θx1x2) ∂2 log f (x;θ) ∂θ2 = − θ2+1 (θ2−1)2 − x2 1 2(θ+1)3 + x2 1 2(θ−1)3 − x2 2 2(θ+1)3 + x2 2 2(θ−1)3 − x1x2 (θ+1)3 − x1x2 (θ−1)3 Then, we compute ∞ −∞ ∂2 log f (x;θ) ∂θ2 f (x; θ)dx. Since E[x1] = E[x2] = 0, E[x1x2] = θ, E[x2 1 ] = E[x2 2 ] = 1, we get ∞ −∞ ∂2 log f (x;θ) ∂θ2 f (x; θ)dx = − θ2+1 (θ2−1)2 − 1 2(θ+1)3 + 1 2(θ−1)3 − 1 2(θ+1)3 + 1 2(θ−1)3 − θ (θ+1)3 − θ (θ−1)3 = − 3(θ2+1) (θ−1)2(θ+1)2 Thus, G(θ) = 3(θ2 + 1) (θ − 1)2(θ + 1)2 . Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 14. Introduction Statistical distances Fisher metric and the Cram´er–Rao lower bound In the bivariate Gaussian copula case, var(ˆρ) ≥ (ρ − 1)2(ρ + 1)2 3(ρ2 + 1) . Recall that locally Fisher-Rao and the f -divergences are a quadratic form of the Fisher metric ( θ) G(θ) θ. So, the discriminative power of these distances is well calibrated with respect to statistical uncertainty. For this purpose, they induce the appropriate curvature on the parameter space. Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 15. Introduction Statistical distances Properties of these distances In addition, for clustering we prefer OT since: in a parametric setting: Fisher-Rao and f -divergences are defined on density manifolds, but some important copulas (such as the Fr´echet-Hoeffding upper bound) do not belong to these manifolds; Thus, in case of closed-form formulas (such as in the Gaussian case), they are ill-defined for these copulas (for perfect dependence, covariance is not invertible) in a non-parametric/empirical setting: f -divergences are defined for absolutely continuous measures, thus require a pre-processing KDE they are not aware of the support geometry, thus badly handle noise on the support Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 16. Introduction Statistical distances Barycenters OT is defined for both discrete/empirical and continuous measures and is support-geometry aware: 0 0.5 1 0 0.5 1 0.0000 0.0015 0.0030 0.0045 0.0060 0.0075 0.0090 0.0105 0.0120 0 0.5 1 0 0.5 1 0.0000 0.0015 0.0030 0.0045 0.0060 0.0075 0.0090 0.0105 0.0120 0 0.5 1 0 0.5 1 0.0000 0.0008 0.0016 0.0024 0.0032 0.0040 0.0048 0.0056 0 0.5 1 0 0.5 1 0.0000 0.0015 0.0030 0.0045 0.0060 0.0075 0.0090 0.0105 0.0120 0 0.5 1 0 0.5 1 0.0000 0.0015 0.0030 0.0045 0.0060 0.0075 0.0090 0.0105 0.0120 5 copulas describing the dependence between X ∼ U([0, 1]) and Y ∼ (X ± i )2 , where i is a constant noise specific for each distribution 0 0.5 1 0 0.5 1 Wasserstein barycenter copula 0.0000 0.0004 0.0008 0.0012 0.0016 0.0020 0.0024 0.0028 0.0032 Barycenter of the 5 copulas for a divergence and OT Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 17. Introduction Statistical distances Future Research Develop further geometries of copulas using Optimal Transport: show that dependence-clustering of time series is improved over standard correlations using f -divergences: detect efficiently dependence-regime switching in multivariate time series (cf. Fr´ed´eric Barbaresco’s work on radar signal processing) Numerical experiments and code: https://www.datagrapple.com/Tech/fisher-vs-ot.html Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas
  • 18. Introduction Statistical distances Shun-ichi Amari and Andrzej Cichocki. Information geometry of divergence functions. Bulletin of the Polish Academy of Sciences: Technical Sciences, 58(1):183–195, 2010. Marco Cuturi. Sinkhorn distances: Lightspeed computation of optimal transport. In Advances in Neural Information Processing Systems, pages 2292–2300, 2013. Sira Ferradans, Nicolas Papadakis, Julien Rabin, Gabriel Peyr´e, and Jean-Fran¸cois Aujol. Regularized discrete optimal transport. Springer, 2013. Gautier Marti Optimal Transport vs. Fisher-Rao distance between Copulas