SlideShare a Scribd company logo
1 of 34
Download to read offline
Journal Club – Bayes Estimators for Phylogenetic
                   Reconstruction
     Syst. Biol. 60(4), 528 – 540, 2011 doi 10.1093/sysbio/syr021



                           Leonardo de O. Martins

                               University of Vigo



                               July 22, 2011



Leo Martins (Univ. Vigo)           Journal Club               22/7   1 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   2 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).




   Leo Martins (Univ. Vigo)       Journal Club                      22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)




   Leo Martins (Univ. Vigo)              Journal Club               22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)
                       ˆ
The loss function L(θ, θ) is a penalty we give for ”deciding” away from the
parameter. Examples are the squared loss and the absolute loss.




   Leo Martins (Univ. Vigo)              Journal Club               22/7   3 / 12
Statistical Risk


                                      ˆ
The risk ρ associated with a decision θ is the expected loss of this decision
ˆ
θ (which can be, for instance, an estimate of θ).


                                ˆ
                              ρ(θ) =        ˆ
                                       L(θ, θ) P(θ | data) dθ

(promptly called posterior expected loss)
                       ˆ
The loss function L(θ, θ) is a penalty we give for ”deciding” away from the
parameter. Examples are the squared loss and the absolute loss.

For some loss functions, we can calculate what is the best decision (i.e.
the one that minimizes the risk, for any data).




   Leo Martins (Univ. Vigo)              Journal Club               22/7    3 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   4 / 12
How to summarise a collection of objects?




                                          scattered points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
How to summarise a collection of objects?




                      centroid: minimizes a distance to all points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
How to summarise a collection of objects?




                  regression line: minimizes a distance to all points

  library ( MASS ) ;
  x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ;
  plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ;




   Leo Martins (Univ. Vigo)                      Journal Club                                    22/7     5 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   6 / 12
How to summarise the posterior distribution P(X)?




  Leo Martins (Univ. Vigo)   Journal Club           22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior mean
Minimize the expected loss under a squared loss function
                                   ˆ         ˆ
                              L(θ, θ) = (θ − θ)2

(Euclidean distance)

   Leo Martins (Univ. Vigo)        Journal Club            22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior median
Minimize the expected loss under a linear loss function
                                   ˆ         ˆ
                              L(θ, θ) =| θ − θ |

(Manhattan distance)

   Leo Martins (Univ. Vigo)        Journal Club           22/7   7 / 12
How to summarise the posterior distribution P(X)?




Posterior mode
a.k.a. Maximum A Posteriori (MAP) estimate.
Minimize the expected loss under a delta loss function

                                           0,                    ˆ
                                                         for θ = θ
                                   ˆ
                              L(θ, θ) =
                                           1,                    ˆ
                                                         for θ = θ
   Leo Martins (Univ. Vigo)               Journal Club               22/7   7 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   8 / 12
Distances between trees
                      D                               D
                                  C                            E
                         
                                                                
                                                                 
             € €
              €                                     € €
                                                      €        
          
                                                  
                                                   
        E                                        C
                  €                                      €
                   f
                   f                                       f
                                                           f
                     f                                       f
                       fˆˆ                                     fˆˆ
                       ¢   ˆˆ
                            ˆB                                 ¢   ˆˆ
                                                                    ˆB
                       ¢                                      ¢
                     ¢                                       ¢
                   ¢                                       ¢
                              A                            A
Trees from the article




   Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
RF distance
    DE|ABC and CD|ABE
    total 2 branches




  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
Quartet distance
    AC|DE and AE|CD
    BC|DE and BE|CD
    4 quartets are different



  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                     D                               D
                                 C                            E
                        
                                                               
                                                                
            € €
             €                                     € €
                                                     €        
         
                                                 
                                                  
       E                                        C
                 €                                      €
                  f
                  f                                       f
                                                          f
                    f                                       f
                      fˆˆ                                     fˆˆ
                      ¢   ˆˆ
                           ˆB                                 ¢   ˆˆ
                                                                   ˆB
                      ¢                                      ¢
                    ¢                                       ¢
                  ¢                                       ¢
                             A                            A
Quartet distance
    AC|DE and AE|CD
    BC|DE and BE|CD
    4 quartets are different



  Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Distances between trees
                      D                               D
                                  C                            E
                         
                                                                
                                                                 
             € €
              €                                     € €
                                                      €        
          
                                                  
                                                   
        E                                        C
                  €                                      €
                   f
                   f                                       f
                                                           f
                     f                                       f
                       fˆˆ                                     fˆˆ
                       ¢   ˆˆ
                            ˆB                                 ¢   ˆˆ
                                                                    ˆB
                       ¢                                      ¢
                     ¢                                       ¢
                   ¢                                       ¢
                              A                            A
Path difference (number of speciations between trees)
     path from A to E is one edge longer in one tree than the other
     (...)
     the overall difference is 6



   Leo Martins (Univ. Vigo)           Journal Club                 22/7   9 / 12
Outline


1 Distance as a penalty


2 Distances, everywhere


3 No phylogenetics, yet...


4 Trees as points in space


5 To the paper, then




  Leo Martins (Univ. Vigo)   Journal Club   22/7   10 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances


But some distances between trees also lead to “analytical” solutions:




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance




   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance
     the Buneman tree minimizes (I think) the dissimilarity map distance



   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
If there is a distance, there is a Bayes estimator

For points in Rn , we know that the mean minimizes the Euclidean
distance, etc.

For phylogenies:

     there are several Euclidean distances
     the mean does not work since a tree has restrictions

But some distances between trees also lead to “analytical” solutions:

     the consensus tree minimizes the Robinson-Foulds distance between
     the samples
     the quartet puzzling minimizes the quartet distance
     the Buneman tree minimizes (I think) the dissimilarity map distance
     some of these are hard to solve as well

   Leo Martins (Univ. Vigo)       Journal Club                   22/7   11 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies




  Leo Martins (Univ. Vigo)       Journal Club                    22/7   12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples




  Leo Martins (Univ. Vigo)       Journal Club                     22/7     12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples
           the distance used is the path difference (matrix subtraction)




  Leo Martins (Univ. Vigo)           Journal Club                         22/7   12 / 12
How do they find, then, the Bayes estimates?



    like many other softwares: hill-climbing on the space of possible
    topologies
    their input data is the posterior distribution of trees from MrBayes
    starting tree can be NJ, MAP tree, ML...
    apply branch-swap (NNI) to current optimal tree, then verify distance
    to all samples
           the distance used is the path difference (matrix subtraction)
           don’t need to recalculate distance to all samples, just to matrix with
           average values




  Leo Martins (Univ. Vigo)           Journal Club                        22/7   12 / 12

More Related Content

Viewers also liked

20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...Dongwook Go
 
Pseudogene Journal Club Presentation
Pseudogene Journal Club PresentationPseudogene Journal Club Presentation
Pseudogene Journal Club PresentationLucas Man
 
Schaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin PresentationSchaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin PresentationJoseph Schaefer
 
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill AdultsJournal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill AdultsJoy Awoniyi
 
Parkinson's Disease Presentation
Parkinson's Disease PresentationParkinson's Disease Presentation
Parkinson's Disease PresentationSteven Zuckerman
 
Azithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copdAzithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copdWarawut Ia
 
Acute exacerbation of COPD
Acute exacerbation of COPDAcute exacerbation of COPD
Acute exacerbation of COPDThomas Kurian
 
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...Hofstra Northwell School of Medicine
 
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile InfectionJournal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile InfectionJoy Awoniyi
 
Prevention of Venous Thromboembolism
Prevention of Venous ThromboembolismPrevention of Venous Thromboembolism
Prevention of Venous ThromboembolismJoy Awoniyi
 
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary SyndromesJournal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary SyndromesJoy Awoniyi
 
Parkinsons Disease
Parkinsons DiseaseParkinsons Disease
Parkinsons Diseasetest
 
How to present a journal club
How to present a journal clubHow to present a journal club
How to present a journal clubsanch1684
 

Viewers also liked (16)

20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...20140328 TNTL journal club   axion electrodynamics, TI-FI interface (nomura, ...
20140328 TNTL journal club axion electrodynamics, TI-FI interface (nomura, ...
 
Pseudogene Journal Club Presentation
Pseudogene Journal Club PresentationPseudogene Journal Club Presentation
Pseudogene Journal Club Presentation
 
Schaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin PresentationSchaefer, Joseph, R. Fidaxomicin Presentation
Schaefer, Joseph, R. Fidaxomicin Presentation
 
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill AdultsJournal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
Journal Club - Early versus Late Parenteral Nutrition in Critically Ill Adults
 
Rituximab CJASN Journal Club
Rituximab CJASN Journal ClubRituximab CJASN Journal Club
Rituximab CJASN Journal Club
 
Parkinson's Disease Presentation
Parkinson's Disease PresentationParkinson's Disease Presentation
Parkinson's Disease Presentation
 
Azithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copdAzithromycin for prevention of exacerbations of copd
Azithromycin for prevention of exacerbations of copd
 
Acute exacerbation of COPD
Acute exacerbation of COPDAcute exacerbation of COPD
Acute exacerbation of COPD
 
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
Journal Club: Daily Corticosteroids Reduce Infection-associated Relapses in F...
 
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile InfectionJournal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
Journal Club: Fidaxomicin versus Vancomycin for Clostridium Difficile Infection
 
Genetic Basis Of Parkinson Disease
Genetic Basis Of Parkinson DiseaseGenetic Basis Of Parkinson Disease
Genetic Basis Of Parkinson Disease
 
Prevention of Venous Thromboembolism
Prevention of Venous ThromboembolismPrevention of Venous Thromboembolism
Prevention of Venous Thromboembolism
 
Journal Club
Journal ClubJournal Club
Journal Club
 
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary SyndromesJournal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
Journal Club: Thrombin-Receptor Antagonist Vorapaxar in Acute Coronary Syndromes
 
Parkinsons Disease
Parkinsons DiseaseParkinsons Disease
Parkinsons Disease
 
How to present a journal club
How to present a journal clubHow to present a journal club
How to present a journal club
 

Similar to Journal Club @ UVigo 2011.07.22

Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherchesPierre Pudlo
 
Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)Frank Nielsen
 
Approximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forestsApproximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forestsChristian Robert
 
Likelihood free computational statistics
Likelihood free computational statisticsLikelihood free computational statistics
Likelihood free computational statisticsPierre Pudlo
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Frank Nielsen
 
NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015Christian Robert
 
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMINGChristian Robert
 
Computational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial ModelingComputational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial ModelingVictor Zhorin
 
random forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationrandom forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationChristian Robert
 
Triangle counting handout
Triangle counting handoutTriangle counting handout
Triangle counting handoutcsedays
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinChristian Robert
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometryFrank Nielsen
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...NTNU
 
Algebras for programming languages
Algebras for programming languagesAlgebras for programming languages
Algebras for programming languagesYoshihiro Mizoguchi
 

Similar to Journal Club @ UVigo 2011.07.22 (20)

Habilitation à diriger des recherches
Habilitation à diriger des recherchesHabilitation à diriger des recherches
Habilitation à diriger des recherches
 
Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)Computational Information Geometry: A quick review (ICMS)
Computational Information Geometry: A quick review (ICMS)
 
Approximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forestsApproximate Bayesian model choice via random forests
Approximate Bayesian model choice via random forests
 
Likelihood free computational statistics
Likelihood free computational statisticsLikelihood free computational statistics
Likelihood free computational statistics
 
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
MUMS Opening Workshop - Quantifying Nonparametric Modeling Uncertainty with B...
 
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)Computational Information Geometry on Matrix Manifolds (ICTP 2013)
Computational Information Geometry on Matrix Manifolds (ICTP 2013)
 
Bayesian_Decision_Theory-3.pdf
Bayesian_Decision_Theory-3.pdfBayesian_Decision_Theory-3.pdf
Bayesian_Decision_Theory-3.pdf
 
guenomu software -- model and agorithm in 2013
guenomu software -- model and agorithm in 2013guenomu software -- model and agorithm in 2013
guenomu software -- model and agorithm in 2013
 
MCQMC 2016 Tutorial
MCQMC 2016 TutorialMCQMC 2016 Tutorial
MCQMC 2016 Tutorial
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Multidimensional Monot...
 
NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015NBBC15, Reyjavik, June 08, 2015
NBBC15, Reyjavik, June 08, 2015
 
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
3rd NIPS Workshop on PROBABILISTIC PROGRAMMING
 
Computational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial ModelingComputational Tools and Techniques for Numerical Macro-Financial Modeling
Computational Tools and Techniques for Numerical Macro-Financial Modeling
 
random forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimationrandom forests for ABC model choice and parameter estimation
random forests for ABC model choice and parameter estimation
 
Triangle counting handout
Triangle counting handoutTriangle counting handout
Triangle counting handout
 
Workshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael MartinWorkshop in honour of Don Poskitt and Gael Martin
Workshop in honour of Don Poskitt and Gael Martin
 
main
mainmain
main
 
Slides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometrySlides: Hypothesis testing, information divergence and computational geometry
Slides: Hypothesis testing, information divergence and computational geometry
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
 
Algebras for programming languages
Algebras for programming languagesAlgebras for programming languages
Algebras for programming languages
 

Recently uploaded

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 

Recently uploaded (20)

GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 

Journal Club @ UVigo 2011.07.22

  • 1. Journal Club – Bayes Estimators for Phylogenetic Reconstruction Syst. Biol. 60(4), 528 – 540, 2011 doi 10.1093/sysbio/syr021 Leonardo de O. Martins University of Vigo July 22, 2011 Leo Martins (Univ. Vigo) Journal Club 22/7 1 / 12
  • 2. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 2 / 12
  • 3. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 4. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 5. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) ˆ The loss function L(θ, θ) is a penalty we give for ”deciding” away from the parameter. Examples are the squared loss and the absolute loss. Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 6. Statistical Risk ˆ The risk ρ associated with a decision θ is the expected loss of this decision ˆ θ (which can be, for instance, an estimate of θ). ˆ ρ(θ) = ˆ L(θ, θ) P(θ | data) dθ (promptly called posterior expected loss) ˆ The loss function L(θ, θ) is a penalty we give for ”deciding” away from the parameter. Examples are the squared loss and the absolute loss. For some loss functions, we can calculate what is the best decision (i.e. the one that minimizes the risk, for any data). Leo Martins (Univ. Vigo) Journal Club 22/7 3 / 12
  • 7. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 4 / 12
  • 8. How to summarise a collection of objects? scattered points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 9. How to summarise a collection of objects? centroid: minimizes a distance to all points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 10. How to summarise a collection of objects? regression line: minimizes a distance to all points library ( MASS ) ; x <- mvrnorm ( n =1000 , mu = c (0 ,0) , Sigma = matrix ( c (1 , 0.8 , 0.9 , 1) , 2 , 2 , byrow = T ) ) ; plot ( x [ ,1] , x [ ,2] , pch = " . " , cex = 2 , xlab = " x " , ylab = " y " ) ; Leo Martins (Univ. Vigo) Journal Club 22/7 5 / 12
  • 11. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 6 / 12
  • 12. How to summarise the posterior distribution P(X)? Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 13. How to summarise the posterior distribution P(X)? Posterior mean Minimize the expected loss under a squared loss function ˆ ˆ L(θ, θ) = (θ − θ)2 (Euclidean distance) Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 14. How to summarise the posterior distribution P(X)? Posterior median Minimize the expected loss under a linear loss function ˆ ˆ L(θ, θ) =| θ − θ | (Manhattan distance) Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 15. How to summarise the posterior distribution P(X)? Posterior mode a.k.a. Maximum A Posteriori (MAP) estimate. Minimize the expected loss under a delta loss function 0, ˆ for θ = θ ˆ L(θ, θ) = 1, ˆ for θ = θ Leo Martins (Univ. Vigo) Journal Club 22/7 7 / 12
  • 16. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 8 / 12
  • 17. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Trees from the article Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 18. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A RF distance DE|ABC and CD|ABE total 2 branches Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 19. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Quartet distance AC|DE and AE|CD BC|DE and BE|CD 4 quartets are different Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 20. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Quartet distance AC|DE and AE|CD BC|DE and BE|CD 4 quartets are different Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 21. Distances between trees D D C E € € € € € € E C € € f f f f f f fˆˆ fˆˆ ¢ ˆˆ ˆB ¢ ˆˆ ˆB ¢ ¢ ¢ ¢ ¢ ¢ A A Path difference (number of speciations between trees) path from A to E is one edge longer in one tree than the other (...) the overall difference is 6 Leo Martins (Univ. Vigo) Journal Club 22/7 9 / 12
  • 22. Outline 1 Distance as a penalty 2 Distances, everywhere 3 No phylogenetics, yet... 4 Trees as points in space 5 To the paper, then Leo Martins (Univ. Vigo) Journal Club 22/7 10 / 12
  • 23. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances But some distances between trees also lead to “analytical” solutions: Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 24. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 25. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 26. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 27. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance the Buneman tree minimizes (I think) the dissimilarity map distance Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 28. If there is a distance, there is a Bayes estimator For points in Rn , we know that the mean minimizes the Euclidean distance, etc. For phylogenies: there are several Euclidean distances the mean does not work since a tree has restrictions But some distances between trees also lead to “analytical” solutions: the consensus tree minimizes the Robinson-Foulds distance between the samples the quartet puzzling minimizes the quartet distance the Buneman tree minimizes (I think) the dissimilarity map distance some of these are hard to solve as well Leo Martins (Univ. Vigo) Journal Club 22/7 11 / 12
  • 29. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 30. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 31. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 32. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 33. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples the distance used is the path difference (matrix subtraction) Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12
  • 34. How do they find, then, the Bayes estimates? like many other softwares: hill-climbing on the space of possible topologies their input data is the posterior distribution of trees from MrBayes starting tree can be NJ, MAP tree, ML... apply branch-swap (NNI) to current optimal tree, then verify distance to all samples the distance used is the path difference (matrix subtraction) don’t need to recalculate distance to all samples, just to matrix with average values Leo Martins (Univ. Vigo) Journal Club 22/7 12 / 12