SlideShare una empresa de Scribd logo
1 de 38
1




MACHINE LEARNING FOR
MEDICAL IMAGING DATA
Yiou (Leo) Li
Background
2



    Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory,
    University of California San Francisco
    •   Developed unsupervised learning method for feature extraction of brain
        imaging data
    •   Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive
        modeling of brain trauma
    •   Designed batch data processing protocol to perform image registration,
        segmentation, band-pass filtering, smoothing, and linear model fitting


    Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal
    processing Laboratory, University of Maryland Baltimore County
    •   Developed the effective degrees of freedom of random process and applied it
        to the model order selection by Information Theoretic Criteria
    •   Developed a linear filtering mechanism in independent component analysis for
        feature enhancement
    •   Analyzed canonical correlation analysis for multiple datasets
Outline
3




     Independent component analysis (ICA) and its
      application to sparse feature extraction from
      multivariate dataset


     Multi-set canonical correlation analysis and its
      application to joint pattern extraction from a group of
      datasets


     Order selection of principal component analysis (PCA)
      and its application to data dimension reduction
PCA vs ICA
4




                  PCA                               ICA
           Linear projection                 Linear projection
             (Orthogonal)
       Uncorrelated components           Independent components
             (non sparse)             (sparse, “long tail” distribution)
      Typically analytical solution     Typically iterative solution
                 (SVD)                    (Iterative optimization)
ICA detects independent factors with
    long tails in multivariate dataset
5
Long tail factors are sparse features in
    data samples
6



                                          Weights of
                                           features
                  Data points (N)

                                    ICA
    Sensors (M)         X            =       A         .         S




                                                           Sparse features



                                     X= AS
ICA model
7


           x1   a11 a12 ... a1M   s1 
          x  a       a 22   a 2M   s 2 
           2    21               
           ...        ...          ... 
                                   
           x M  a M1 a M2   a MM  s M 


                x : Observed variables
                A : Mixing matrix
                s : Latent factors



                  x= As -> s =A-1x
ICA by maximum likelihood estimation
8


    Transformation of multivariate random variable: x = As
                              p(s 1, s2 , ... , sM )
    p(x 1,x 2 , ... , x M )                                 (1)
                                    det(A)
    Statistical independence condition of s:

    p(s 1, s2 , ... , sM )  i 1 p(si )
                                          M
                                                             (2)

    Log likelihood function of x with parameter A:

    log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A)
                                                      -1

                                      i
ICA Application: Sparse feature extraction from
    multivariate dataset
9
Functional MRI experiment
10
Analyze functional MRI data of resting
     state brain
11




                              Sparse features




                    ICA
Feature 1. Primary visual network
12



 +
                                         A




 -
Feature 2. “Default mode network”
13
Feature 3. Attention control network
14
Hierarchical clustering shows link
15
     between features (brain regions)
Predicative modeling of brain trauma
16

                                                 Pattern weights
                                 N

       Healthy
                                 X           =      A        .                 S
       Patients

                                                                               Sparse
                                                                           spatial features


     Subject 1

                             …
                 Subject 2
          16                                                                       Pattern 2
                                                                   Feature 1                   …
                                 Subject M                                         Feature 2

                                                           Y.-O. Li, et al., HBM, 2011
ICA Pattern classification for predictive
     modeling of brain trauma
17




      • 29 healthy + 29 trauma, 10-fold cross-validation


             Classifier                9 patterns                14 patterns
                                   Classification error      Classification error
            Naïve Bayes                0.35+/-0.03               0.32+/-0.03

        K nearest neighbor             0.29+/-0.02               0.30+/-0.03

      Support vector classifier         0.36+/-0.02               0.30 +/-0.02
                                  (c=1, number of SV: 46)   (c=1, number of SV: 20)
Outline
18




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to dimension reduction
Joint pattern extraction requires coherency
     on extracted patterns across datasets
19


               Model:   x k =Aksk , k=1,2,...,M




                                     Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation analysis
20




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Multi-set canonical correlation
     analysis
21




          Correlation matrix of [S1,S2, … SM]

                                        Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Application: joint pattern extraction from a
     group of datasets
22




     •   Analyze group functional MRI data from
         simulated driving experiment
Simulated driving experiment
23

     •   Forty subjects, three repeated sessions (120 datasets)
     •   Experiment paradigm:




     •   Behavioral records:
          •   Average speed (AS)
          •   Differential of speed (DS)
          •   Average steering offset (AR)
          •   Differential steering offset (DR)
          •   Differential pedal offset (DP)
          •   Occurrence of yellow line crossing (YLC)
          •   Occurrence of white passenger-side line crossing (WPLC)

                                                    Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step I: M-CCA for joint feature extraction
24




                           Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Step II: PCA and behavioral association
25




                          Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Pattern 1: Primary visual function
26




                                                   D = 0:85
                                                   W = 0:42




                        95% CI of behavioral association
Pattern 2: “default mode network”
27




                                                  D = -0.63
                                                  W = -0.39




                      95% CI of behavioral association
Pattern 3: Motor coordination
28




                                                  D = 0.86
                                                  W = 0.15




                       95% CI of behavioral association
Pattern 4: Executive control network
29




                                                 D = 0.64
                                                 W = 0.61




                       95% CI of behavioral association
Cross correlation of Pattern 1
30




                       Y.-O. Li, et al., J. of Sig Proc Sys, 2011
Outline
31




      Independent component analysis (ICA) and its
       application to sparse feature extraction from
       multivariate dataset


      Multi-set canonical correlation analysis and its
       application to joint pattern extraction from a group of
       datasets


      Order selection of principal component analysis (PCA)
       and its application to data dimension reduction
Decreased reproducibility of independent
     component on high-dimensional dataset
32

          •    Functional MRI with 120 time points
          •    Twenty Monte Carlo trials of ICA algorithm
          •    Clustering the IC estimates
          •    Reproducible ICs: compact and separated clusters




        K=20                           K=40                             K=90

                                                     Y.-O. Li, et al., HBM, 2007
Dimension reduction of high-dimensional
     data by PCA
33


                                       ICA              N
               N


       M       X               =            A   .       S


                                       MxM

                       PCA dimension reduction + ICA


                           .       A    .           S
           X   =   E                                        +      N


                                       K-largest PCs            M-k PCs
Failure of information-theoretic criteria with
     uncorrected degrees of freedom
34



                        AIC, MDL
            ˆ
            k  arg min k {l ( x | k )  g (k )}

                                                                            ( M k)
                                                                    
                                                i k 1 
                                                     M       1/ ( M  k )

                          l(x |  k )  N ln  M             i
                                                                     
                                             
                                              i k 1 i / ( M  k) 
                                                                     

                                           AIC : k (2M  k )  1
                            g ( k )  
                                       MDL : 0.5  ln N (k (2M  k )  1)

                                           Y.-O. Li, et al., HBM, 2007
Estimation of degrees of freedom by
     entropy rate
35


                 Entropy rate of a Gaussian process
                                        1       
                     h( x)  ln 2 e 
                                       4        ln s()d
                                               


            h( x)  ln 2 e     iff x[n] is an i.i.d. random process




       h(x) = 0.40               h(x) = 1.28                  h(x) = 1.41
                                                     Y.-O. Li, et al., HBM, 2007
Application: Order selection of high-
     dimensional dataset
36
Corrected order selection criteria
     significantly improves order selection
37




            Original       With correction on degrees of freedom


                                    Y.-O. Li, et al., HBM, 2007
Summary
38




     •   ICA extracts useful patterns from high dimensional imaging data for
         predictive modeling

     •   M-CCA reveals patterns from several datasets in a coherent order

     •   Dimension reduction by PCA improves the reproducibility of ICA extracted
         patterns


         Exploratory multivariate analysis are promising tools for
         data mining applications

Más contenido relacionado

La actualidad más candente

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and SelectionNikita Zhiltsov
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Armando Vieira
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tenconDr. Jai Sakthi
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Juan Luis Nieves
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix FactorizationTatsuya Yokota
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentJordan McBain
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...BigMC
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentationEnrico Scapin
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Tatsuya Yokota
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registrationzukun
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship ReportHarshilJain26
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2nep_test_account
 
Pres110811
Pres110811Pres110811
Pres110811shotlub
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...ijma
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...Yusuf Bhujwalla
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filtersSandip Joardar
 

La actualidad más candente (17)

7 - Model Assessment and Selection
7 - Model Assessment and Selection7 - Model Assessment and Selection
7 - Model Assessment and Selection
 
Manifold learning for credit risk assessment
Manifold learning for credit risk assessment Manifold learning for credit risk assessment
Manifold learning for credit risk assessment
 
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
/.Amd mnt/lotus/host/home/jaishakthi/presentation/tencon/tencon
 
Eurogen v
Eurogen vEurogen v
Eurogen v
 
Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009Color Img at Prisma Network meeting 2009
Color Img at Prisma Network meeting 2009
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Condition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating EquipmentCondition Monitoring Of Unsteadily Operating Equipment
Condition Monitoring Of Unsteadily Operating Equipment
 
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
Anisotropic Metropolis Adjusted Langevin Algorithm: convergence and utility i...
 
Bytecode'13 presentation
Bytecode'13 presentationBytecode'13 presentation
Bytecode'13 presentation
 
Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)Linked CP Tensor Decomposition (presented by ICONIP2012)
Linked CP Tensor Decomposition (presented by ICONIP2012)
 
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, RegistrationCVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
CVPR2010: Advanced ITinCVPR in a Nutshell: part 4: Isocontours, Registration
 
IISc Internship Report
IISc Internship ReportIISc Internship Report
IISc Internship Report
 
08 linear classification_2
08 linear classification_208 linear classification_2
08 linear classification_2
 
Pres110811
Pres110811Pres110811
Pres110811
 
Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...Uniform and non uniform single image deblurring based on sparse representatio...
Uniform and non uniform single image deblurring based on sparse representatio...
 
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
An RKHS Approach to Systematic Kernel Selection in Nonlinear System Identific...
 
Adaptive polynomial filters
Adaptive polynomial filtersAdaptive polynomial filters
Adaptive polynomial filters
 

Destacado

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsSimon Harris
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Miguel Cabrer
 
Machine Learning
Machine LearningMachine Learning
Machine Learningbutest
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOThomas Roth-Berghofer
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...Debdoot Sheet
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2Hossam Hassan
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?Debdoot Sheet
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processingFarzad Jahedi
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzerbutest
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaAlexander Decker
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...NSEAkure
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splinesEklavya Gupta
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?PRIYAJNVCTC
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data AnalysisMerul Romadhani
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysisDushyant Bheda
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different IndustriesPRIYAJNVCTC
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_financeStefan Duprey
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionTe-Yen Liu
 

Destacado (20)

Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-upsArtificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
Artificial Intelligence in Medical Imaging: An Analysis of Funding for Start-ups
 
Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)Medical Exchange MEDTING (April 2010)
Medical Exchange MEDTING (April 2010)
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Explaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICOExplaining Semantic Search Results of Medical Images in MEDICO
Explaining Semantic Search Results of Medical Images in MEDICO
 
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...Whispers of Speckles (Part I: Building Computational Imaging Frameworks for ...
Whispers of Speckles ( Part I: Building Computational Imaging Frameworks for ...
 
Introduction to digital signal processing 2
Introduction to digital signal processing 2Introduction to digital signal processing 2
Introduction to digital signal processing 2
 
Machine Learning for Medical Image Analysis: What, where and how?
Machine Learning for Medical Image Analysis:What, where and how?Machine Learning for Medical Image Analysis:What, where and how?
Machine Learning for Medical Image Analysis: What, where and how?
 
Artificial intelligence in medical image processing
Artificial intelligence in medical image processingArtificial intelligence in medical image processing
Artificial intelligence in medical image processing
 
Jörg Stelzer
Jörg StelzerJörg Stelzer
Jörg Stelzer
 
Multivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeriaMultivariate regression techniques for analyzing auto crash variables in nigeria
Multivariate regression techniques for analyzing auto crash variables in nigeria
 
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
Multivariate Linear Regression Model for Simulaneous Estimation of Debutanise...
 
Multivariate adaptive regression splines
Multivariate adaptive regression splinesMultivariate adaptive regression splines
Multivariate adaptive regression splines
 
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLSMULTIVARIATE STATISTICAL MODELS’ SYMBOLS
MULTIVARIATE STATISTICAL MODELS’ SYMBOLS
 
What determines Sales of a Product?
What determines Sales of a Product?What determines Sales of a Product?
What determines Sales of a Product?
 
Multivariate Data Analysis
Multivariate Data AnalysisMultivariate Data Analysis
Multivariate Data Analysis
 
Multiple regression analysis
Multiple regression analysisMultiple regression analysis
Multiple regression analysis
 
Reverse Logistics in Different Industries
Reverse Logistics in Different IndustriesReverse Logistics in Different Industries
Reverse Logistics in Different Industries
 
Audio mining
Audio miningAudio mining
Audio mining
 
Machine learning for_finance
Machine learning for_financeMachine learning for_finance
Machine learning for_finance
 
Machine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis IntroductionMachine Learning, Deep Learning and Data Analysis Introduction
Machine Learning, Deep Learning and Data Analysis Introduction
 

Similar a Machine learning for medical imaging data

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis Ibrahim Amer
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Mostafa G. M. Mostafa
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx36rajneekant
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleHakka Labs
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationUnited States Air Force Academy
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionJordan McBain
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxSivam Chinna
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedPei-Che Chang
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdfNarenRajVivek
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...ICAC09
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similar a Machine learning for medical imaging data (20)

Independent Component Analysis
Independent Component Analysis Independent Component Analysis
Independent Component Analysis
 
Sp18_P1.pptx
Sp18_P1.pptxSp18_P1.pptx
Sp18_P1.pptx
 
Understandig PCA and LDA
Understandig PCA and LDAUnderstandig PCA and LDA
Understandig PCA and LDA
 
Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)Neural Networks: Principal Component Analysis (PCA)
Neural Networks: Principal Component Analysis (PCA)
 
Presentation on machine learning
Presentation on machine learningPresentation on machine learning
Presentation on machine learning
 
DimensionalityReduction.pptx
DimensionalityReduction.pptxDimensionalityReduction.pptx
DimensionalityReduction.pptx
 
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at GoogleDataEngConf: Feature Extraction: Modern Questions and Challenges at Google
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
 
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual IdentificationSubspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
 
Ph.D. Presentation
Ph.D. PresentationPh.D. Presentation
Ph.D. Presentation
 
Dbm630 lecture09
Dbm630 lecture09Dbm630 lecture09
Dbm630 lecture09
 
Principal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty DetectionPrincipal Component Analysis For Novelty Detection
Principal Component Analysis For Novelty Detection
 
Dimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptxDimensionality Reduction and feature extraction.pptx
Dimensionality Reduction and feature extraction.pptx
 
Distributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and RelatedDistributed Architecture of Subspace Clustering and Related
Distributed Architecture of Subspace Clustering and Related
 
Conference_paper.pdf
Conference_paper.pdfConference_paper.pdf
Conference_paper.pdf
 
tutorial.ppt
tutorial.ppttutorial.ppt
tutorial.ppt
 
Spatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understandingSpatio-temporal reasoning for traffic scene understanding
Spatio-temporal reasoning for traffic scene understanding
 
190 195
190 195190 195
190 195
 
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...Multiple Kernel Learning based Approach to Representation and Feature Selecti...
Multiple Kernel Learning based Approach to Representation and Feature Selecti...
 
Sunum
SunumSunum
Sunum
 
Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)Welcome to International Journal of Engineering Research and Development (IJERD)
Welcome to International Journal of Engineering Research and Development (IJERD)
 

Último

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 

Último (20)

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 

Machine learning for medical imaging data

  • 1. 1 MACHINE LEARNING FOR MEDICAL IMAGING DATA Yiou (Leo) Li
  • 2. Background 2 Post-doctoral fellow, 07/2009-Present, Neural connectivity Laboratory, University of California San Francisco • Developed unsupervised learning method for feature extraction of brain imaging data • Applied supervised learning (Naïve Bayes, SVM, Random Forest) for predictive modeling of brain trauma • Designed batch data processing protocol to perform image registration, segmentation, band-pass filtering, smoothing, and linear model fitting Graduate Research Assistant, 08/2002-06/2009, Machine learning for signal processing Laboratory, University of Maryland Baltimore County • Developed the effective degrees of freedom of random process and applied it to the model order selection by Information Theoretic Criteria • Developed a linear filtering mechanism in independent component analysis for feature enhancement • Analyzed canonical correlation analysis for multiple datasets
  • 3. Outline 3  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 4. PCA vs ICA 4 PCA ICA Linear projection Linear projection (Orthogonal) Uncorrelated components Independent components (non sparse) (sparse, “long tail” distribution) Typically analytical solution Typically iterative solution (SVD) (Iterative optimization)
  • 5. ICA detects independent factors with long tails in multivariate dataset 5
  • 6. Long tail factors are sparse features in data samples 6 Weights of features Data points (N) ICA Sensors (M) X = A . S Sparse features X= AS
  • 7. ICA model 7  x1   a11 a12 ... a1M   s1  x  a a 22 a 2M   s 2   2    21    ...   ...   ...         x M  a M1 a M2 a MM  s M  x : Observed variables A : Mixing matrix s : Latent factors x= As -> s =A-1x
  • 8. ICA by maximum likelihood estimation 8 Transformation of multivariate random variable: x = As p(s 1, s2 , ... , sM ) p(x 1,x 2 , ... , x M )  (1) det(A) Statistical independence condition of s: p(s 1, s2 , ... , sM )  i 1 p(si ) M (2) Log likelihood function of x with parameter A: log p(x 1,x 2 ,...x M )   log p([A x] i )  log det(A) -1 i
  • 9. ICA Application: Sparse feature extraction from multivariate dataset 9
  • 11. Analyze functional MRI data of resting state brain 11 Sparse features ICA
  • 12. Feature 1. Primary visual network 12 + A -
  • 13. Feature 2. “Default mode network” 13
  • 14. Feature 3. Attention control network 14
  • 15. Hierarchical clustering shows link 15 between features (brain regions)
  • 16. Predicative modeling of brain trauma 16 Pattern weights N Healthy X = A . S Patients Sparse spatial features Subject 1 … Subject 2 16 Pattern 2 Feature 1 … Subject M Feature 2 Y.-O. Li, et al., HBM, 2011
  • 17. ICA Pattern classification for predictive modeling of brain trauma 17 • 29 healthy + 29 trauma, 10-fold cross-validation Classifier 9 patterns 14 patterns Classification error Classification error Naïve Bayes 0.35+/-0.03 0.32+/-0.03 K nearest neighbor 0.29+/-0.02 0.30+/-0.03 Support vector classifier 0.36+/-0.02 0.30 +/-0.02 (c=1, number of SV: 46) (c=1, number of SV: 20)
  • 18. Outline 18  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to dimension reduction
  • 19. Joint pattern extraction requires coherency on extracted patterns across datasets 19 Model: x k =Aksk , k=1,2,...,M Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 20. Multi-set canonical correlation analysis 20 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 21. Multi-set canonical correlation analysis 21 Correlation matrix of [S1,S2, … SM] Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 22. Application: joint pattern extraction from a group of datasets 22 • Analyze group functional MRI data from simulated driving experiment
  • 23. Simulated driving experiment 23 • Forty subjects, three repeated sessions (120 datasets) • Experiment paradigm: • Behavioral records: • Average speed (AS) • Differential of speed (DS) • Average steering offset (AR) • Differential steering offset (DR) • Differential pedal offset (DP) • Occurrence of yellow line crossing (YLC) • Occurrence of white passenger-side line crossing (WPLC) Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 24. Step I: M-CCA for joint feature extraction 24 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 25. Step II: PCA and behavioral association 25 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 26. Pattern 1: Primary visual function 26 D = 0:85 W = 0:42 95% CI of behavioral association
  • 27. Pattern 2: “default mode network” 27 D = -0.63 W = -0.39 95% CI of behavioral association
  • 28. Pattern 3: Motor coordination 28 D = 0.86 W = 0.15 95% CI of behavioral association
  • 29. Pattern 4: Executive control network 29 D = 0.64 W = 0.61 95% CI of behavioral association
  • 30. Cross correlation of Pattern 1 30 Y.-O. Li, et al., J. of Sig Proc Sys, 2011
  • 31. Outline 31  Independent component analysis (ICA) and its application to sparse feature extraction from multivariate dataset  Multi-set canonical correlation analysis and its application to joint pattern extraction from a group of datasets  Order selection of principal component analysis (PCA) and its application to data dimension reduction
  • 32. Decreased reproducibility of independent component on high-dimensional dataset 32 • Functional MRI with 120 time points • Twenty Monte Carlo trials of ICA algorithm • Clustering the IC estimates • Reproducible ICs: compact and separated clusters K=20 K=40 K=90 Y.-O. Li, et al., HBM, 2007
  • 33. Dimension reduction of high-dimensional data by PCA 33 ICA N N M X = A . S MxM PCA dimension reduction + ICA . A . S X = E + N K-largest PCs M-k PCs
  • 34. Failure of information-theoretic criteria with uncorrected degrees of freedom 34 AIC, MDL ˆ k  arg min k {l ( x | k )  g (k )} ( M k)    i k 1  M 1/ ( M  k ) l(x |  k )  N ln  M i    i k 1 i / ( M  k)    AIC : k (2M  k )  1 g ( k )   MDL : 0.5  ln N (k (2M  k )  1) Y.-O. Li, et al., HBM, 2007
  • 35. Estimation of degrees of freedom by entropy rate 35 Entropy rate of a Gaussian process 1  h( x)  ln 2 e  4   ln s()d  h( x)  ln 2 e iff x[n] is an i.i.d. random process h(x) = 0.40 h(x) = 1.28 h(x) = 1.41 Y.-O. Li, et al., HBM, 2007
  • 36. Application: Order selection of high- dimensional dataset 36
  • 37. Corrected order selection criteria significantly improves order selection 37 Original With correction on degrees of freedom Y.-O. Li, et al., HBM, 2007
  • 38. Summary 38 • ICA extracts useful patterns from high dimensional imaging data for predictive modeling • M-CCA reveals patterns from several datasets in a coherent order • Dimension reduction by PCA improves the reproducibility of ICA extracted patterns Exploratory multivariate analysis are promising tools for data mining applications