SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
NIPS-2010
                                       @



           • b-bit Minwise Hashing for Estimating Three-
                Way Similarities. P. Li et al.

                •
           • Functional Geometry Alignment and
                Localization of Brain Areas. Langs et al.

                •
2011   2   14
b-bit Minwise Hashing for
        Estimating Three-Way
               Similarities

                • Minwise Hashing (MinHash)   ?

                • b-bit Minwise Hasing    ?




2011   2   14
Motivation
       •
            •              ,
            •
            •                          Web
            •
            •
       •                       2   (               )
       •               Minwise Hasing (MinHash) [Broder 1997]
                sign random projections (simhash) Hamming
                Distance LSH
2011   2   14
Minwise Hashing
       •                 Jaccard
                                                            |A ∩ B|
                                                  J(A, B) =
                                                            |A ∪ B|
       •
       •        Random parmutation (or Hash        ) π(x)
       •             A             π(x)        Pr[min(π(A)) = min(π(B))]
                Pr[min(π(A)) = min(π(B))] = J(A, B)

       •             A = {1, 3, 5, 7}, B = {3, 4, 5}
                       ⇒ A ∩ B = {3, 5}, A ∪ B = {1, 3, 4, 5, 7}
           •      min(h(A)) = min(h(B))                     {1,3,4,5,7}
                                     3    5
                 •   Jaccard

2011   2   14
•




           •
                                Hash                       bit


                •   Altavista                             40bit Fetterly
                    WWW03          64bit

           •                           Hash   1 or 2bit

           •
2011   2   14
•                 2
                    •
                    •   Jaccard       (0.5   )




2011   2   14
• b-Bit Minwise Hashing for Estimating
                    Three-Way Similarities NIPS2010

                • b-Bit Minwise Hashing   3
                    Jaccard
                                    |A ∩ B ∩ C|
                       J(A, B, C) =
                                    |A ∪ B ∪ C|

                •
2011   2   14
Functional Geometry Alignment
             and Localization of Brain Areas
                                 Registration based on anatomical data   Registration based on the function




                                  brain 1    registration    brain 2       brain 1   embedding           re


                                 Figure 1: Standard anatomical registration and the proposed fun
mical data     Registration basedtional geometry geometry matches the diffusion maps of fMRI
                                  on the functional alignment




                            Integrating functional features into the registration process prom
brain 2        brain 1 embedding proposed methods match the centers of activated cortica
                            cently       registration       embedding brain 2
                            correspondences of cortical surfaces [18]. The fMRI signals at t
                            vector, and registration is performed by maximizing the inter-su
mical2 registration and the proposed functional geometry alignment. Func- warp to
  2011  14
                            points, while at the same time regularizing the surface
Motivation
       •
       •                    fMRI


       •
            •
                                                  ?


                      Above-threshold region in       Above-threshold region in
2011   2   14         source subject                  target subject
• fMRI
       • Voxel                               (           Kernel)


       • Diffusion Maps
       •              Voxel

                a. Maps of two subjects




                                  s0             Ψ0          Ψ1        s1
                                Subject 1        Map 1       Map 2   Subject 2



2011   2   14   b. Aligning the point sets
Diffusion Maps
       •    Coifman and Lafon. Applied and Comp. Harmonic Analysis. 2006
       •                    PCA      Isomap
       •    Spectral Clustering

                                  •
                                  •         i,j     t                  i   Markov
                                          chain random walk        t


                                      •    Normalized Graph Laplacian
                                  •
                                          Diffusion Distance
                                  •       Diffusion Distance
                                                     N   (N    )

2011   2   14
a. Maps of two subjects




                             s0                          Ψ0                        Ψ1             s1
                           Subject 1                     Map 1                    Map 2         Subject 2



           b. Aligning the point sets

                                                                 xk
                                                                  0
                                                                          xl
                                                                           1

                                                                                                                          A.



                                                                                                                    FGA
                                                                                                            0.2
                                                                                                              0.2



                                                                      ?
       Figure 2: Maps of two subjects in the process of registration: (a) Left and right: the0.15  axial and
                                                                                               0.15
       sagittal views of the points in the two brains. The two central columns show plots of the first
       three dimensions of the embedding in the functional geometry after coarse rotational alignment. (b)
       During alignment, a maps is represented as a Gaussian mixture model. The colors in both plots
       indicate clusters which are only region in visualization. Above-threshold region in
                          Above-threshold used for                                              0.1
                                                                                                  0.1


2011   2    14                          source subject                         target subject
2011   2   14

Más contenido relacionado

Más de sesejun

RNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A ReviewRNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A Reviewsesejun
 
バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析sesejun
 
次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習sesejun
 
20110602labseminar pub
20110602labseminar pub20110602labseminar pub
20110602labseminar pubsesejun
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pubsesejun
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pubsesejun
 
Datamining 9th association_rule.key
Datamining 9th association_rule.keyDatamining 9th association_rule.key
Datamining 9th association_rule.keysesejun
 
Datamining 8th hclustering
Datamining 8th hclusteringDatamining 8th hclustering
Datamining 8th hclusteringsesejun
 
Datamining r 4th
Datamining r 4thDatamining r 4th
Datamining r 4thsesejun
 
Datamining r 3rd
Datamining r 3rdDatamining r 3rd
Datamining r 3rdsesejun
 
Datamining r 2nd
Datamining r 2ndDatamining r 2nd
Datamining r 2ndsesejun
 
Datamining r 1st
Datamining r 1stDatamining r 1st
Datamining r 1stsesejun
 
Datamining 6th svm
Datamining 6th svmDatamining 6th svm
Datamining 6th svmsesejun
 
Datamining 5th knn
Datamining 5th knnDatamining 5th knn
Datamining 5th knnsesejun
 
Datamining 4th adaboost
Datamining 4th adaboostDatamining 4th adaboost
Datamining 4th adaboostsesejun
 
Datamining 3rd naivebayes
Datamining 3rd naivebayesDatamining 3rd naivebayes
Datamining 3rd naivebayessesejun
 
Datamining 2nd decisiontree
Datamining 2nd decisiontreeDatamining 2nd decisiontree
Datamining 2nd decisiontreesesejun
 
Datamining 7th kmeans
Datamining 7th kmeansDatamining 7th kmeans
Datamining 7th kmeanssesejun
 
100401 Bioinfoinfra
100401 Bioinfoinfra100401 Bioinfoinfra
100401 Bioinfoinfrasesejun
 
Datamining 8th Hclustering
Datamining 8th HclusteringDatamining 8th Hclustering
Datamining 8th Hclusteringsesejun
 

Más de sesejun (20)

RNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A ReviewRNAseqによる変動遺伝子抽出の統計: A Review
RNAseqによる変動遺伝子抽出の統計: A Review
 
バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析バイオインフォマティクスによる遺伝子発現解析
バイオインフォマティクスによる遺伝子発現解析
 
次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習次世代シーケンサが求める機械学習
次世代シーケンサが求める機械学習
 
20110602labseminar pub
20110602labseminar pub20110602labseminar pub
20110602labseminar pub
 
20110524zurichngs 2nd pub
20110524zurichngs 2nd pub20110524zurichngs 2nd pub
20110524zurichngs 2nd pub
 
20110524zurichngs 1st pub
20110524zurichngs 1st pub20110524zurichngs 1st pub
20110524zurichngs 1st pub
 
Datamining 9th association_rule.key
Datamining 9th association_rule.keyDatamining 9th association_rule.key
Datamining 9th association_rule.key
 
Datamining 8th hclustering
Datamining 8th hclusteringDatamining 8th hclustering
Datamining 8th hclustering
 
Datamining r 4th
Datamining r 4thDatamining r 4th
Datamining r 4th
 
Datamining r 3rd
Datamining r 3rdDatamining r 3rd
Datamining r 3rd
 
Datamining r 2nd
Datamining r 2ndDatamining r 2nd
Datamining r 2nd
 
Datamining r 1st
Datamining r 1stDatamining r 1st
Datamining r 1st
 
Datamining 6th svm
Datamining 6th svmDatamining 6th svm
Datamining 6th svm
 
Datamining 5th knn
Datamining 5th knnDatamining 5th knn
Datamining 5th knn
 
Datamining 4th adaboost
Datamining 4th adaboostDatamining 4th adaboost
Datamining 4th adaboost
 
Datamining 3rd naivebayes
Datamining 3rd naivebayesDatamining 3rd naivebayes
Datamining 3rd naivebayes
 
Datamining 2nd decisiontree
Datamining 2nd decisiontreeDatamining 2nd decisiontree
Datamining 2nd decisiontree
 
Datamining 7th kmeans
Datamining 7th kmeansDatamining 7th kmeans
Datamining 7th kmeans
 
100401 Bioinfoinfra
100401 Bioinfoinfra100401 Bioinfoinfra
100401 Bioinfoinfra
 
Datamining 8th Hclustering
Datamining 8th HclusteringDatamining 8th Hclustering
Datamining 8th Hclustering
 

Último

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Último (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

20110214nips2010 read

  • 1. NIPS-2010 @ • b-bit Minwise Hashing for Estimating Three- Way Similarities. P. Li et al. • • Functional Geometry Alignment and Localization of Brain Areas. Langs et al. • 2011 2 14
  • 2. b-bit Minwise Hashing for Estimating Three-Way Similarities • Minwise Hashing (MinHash) ? • b-bit Minwise Hasing ? 2011 2 14
  • 3. Motivation • • , • • Web • • • 2 ( ) • Minwise Hasing (MinHash) [Broder 1997] sign random projections (simhash) Hamming Distance LSH 2011 2 14
  • 4. Minwise Hashing • Jaccard |A ∩ B| J(A, B) = |A ∪ B| • • Random parmutation (or Hash ) π(x) • A π(x) Pr[min(π(A)) = min(π(B))] Pr[min(π(A)) = min(π(B))] = J(A, B) • A = {1, 3, 5, 7}, B = {3, 4, 5} ⇒ A ∩ B = {3, 5}, A ∪ B = {1, 3, 4, 5, 7} • min(h(A)) = min(h(B)) {1,3,4,5,7} 3 5 • Jaccard 2011 2 14
  • 5. • Hash bit • Altavista 40bit Fetterly WWW03 64bit • Hash 1 or 2bit • 2011 2 14
  • 6. 2 • • Jaccard (0.5 ) 2011 2 14
  • 7. • b-Bit Minwise Hashing for Estimating Three-Way Similarities NIPS2010 • b-Bit Minwise Hashing 3 Jaccard |A ∩ B ∩ C| J(A, B, C) = |A ∪ B ∪ C| • 2011 2 14
  • 8. Functional Geometry Alignment and Localization of Brain Areas Registration based on anatomical data Registration based on the function brain 1 registration brain 2 brain 1 embedding re Figure 1: Standard anatomical registration and the proposed fun mical data Registration basedtional geometry geometry matches the diffusion maps of fMRI on the functional alignment Integrating functional features into the registration process prom brain 2 brain 1 embedding proposed methods match the centers of activated cortica cently registration embedding brain 2 correspondences of cortical surfaces [18]. The fMRI signals at t vector, and registration is performed by maximizing the inter-su mical2 registration and the proposed functional geometry alignment. Func- warp to 2011 14 points, while at the same time regularizing the surface
  • 9. Motivation • • fMRI • • ? Above-threshold region in Above-threshold region in 2011 2 14 source subject target subject
  • 10. • fMRI • Voxel ( Kernel) • Diffusion Maps • Voxel a. Maps of two subjects s0 Ψ0 Ψ1 s1 Subject 1 Map 1 Map 2 Subject 2 2011 2 14 b. Aligning the point sets
  • 11. Diffusion Maps • Coifman and Lafon. Applied and Comp. Harmonic Analysis. 2006 • PCA Isomap • Spectral Clustering • • i,j t i Markov chain random walk t • Normalized Graph Laplacian • Diffusion Distance • Diffusion Distance N (N ) 2011 2 14
  • 12. a. Maps of two subjects s0 Ψ0 Ψ1 s1 Subject 1 Map 1 Map 2 Subject 2 b. Aligning the point sets xk 0 xl 1 A. FGA 0.2 0.2 ? Figure 2: Maps of two subjects in the process of registration: (a) Left and right: the0.15 axial and 0.15 sagittal views of the points in the two brains. The two central columns show plots of the first three dimensions of the embedding in the functional geometry after coarse rotational alignment. (b) During alignment, a maps is represented as a Gaussian mixture model. The colors in both plots indicate clusters which are only region in visualization. Above-threshold region in Above-threshold used for 0.1 0.1 2011 2 14 source subject target subject
  • 13. 2011 2 14