SlideShare a Scribd company logo
1 of 17
Download to read offline
When	
  Does	
  Label	
  Propaga1on	
  Fail?	
  
A	
  View	
  from	
  a	
  Network	
  Genera1ve	
  Model	
Yuto	
  Yamaguchi	
  and	
  Kohei	
  Hayashi	
17/08/22	
 IJCAI@Melbourne	
 1
Node Classification	
Given	
 Find	
Partially labeled
undirected graph	
Labels of all nodes	
17/08/22	
 IJCAI@Melbourne	
 2
Example:
User profile inference	
Friends	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseball	
???	
What’s his hobby?
Node Classification	
17/08/22	
 IJCAI@Melbourne	
 3
Label Propagation (1/2)	
Propagate neighbors’ labels
Friends	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseline	
???	
Soccer	
 Soccer	
Soccer	
Tennis	
Baseline	
Soccer	
[Zhu+, 03], [Zhou+, 03], etc.	
17/08/22	
 IJCAI@Melbourne	
 4
Label Propagation (2/2)	
Q F;X,Y,λ( )=
1
2
fi − yi 2
2
i=1
N
∑ +
λ
2
xij fi − fj 2
2
j=1
N
∑
i=1
N
∑
Given: adjacency matrix X and labels Y
Find: F = { fi } that minimizes Q
17/08/22	
 IJCAI@Melbourne	
 5	
F ∈ RN x K
Y ∈ {0, 1}N x K	
X ∈ {0, 1}N x N
N: # of nodes
K: # of labels
λ ∈ R+ : user parameter
[Zhu+, 03], [Zhou+, 03], etc.
Cases	
  when	
  LP	
  fails	
  (prac1cally	
  known)	
Different labels
are connected	
 Label ratio is not uniform	
Q. So, do we know why LP fails in these cases?

A. No. Since it’s not a probabilistic model, we
don’t know the assumptions behind the model.
17/08/22	
 IJCAI@Melbourne	
 6	
Edge probability is not uniform
What	
  we	
  do	
  in	
  this	
  work	
1.  Prove	
  a	
  theore1cal	
  rela1onship	
  between	
  LP	
  
and	
  Stochas(c	
  Block	
  Model,	
  which	
  is	
  a	
  well-­‐
studied	
  probabilis1c	
  genera1ve	
  model	
  
2.  Find	
  the	
  assump(ons	
  behind	
  LP	
  through	
  the	
  
assump1ons	
  behind	
  SBM	
  
3.  Show	
  when	
  and	
  why	
  LP	
  fails	
17/08/22	
 IJCAI@Melbourne	
 7
NETWORK	
  GENERATIVE	
  MODELS	
17/08/22	
 IJCAI@Melbourne	
 8
Stochastic Block Model	
Generative process	
 Multinomial	
Bernoulli	
①	
②	
①: Generate cluster assignment for each node

 
(which can be thought of labels)
②: Generate adjacency matrix	
17/08/22	
 IJCAI@Melbourne	
 9	
γ ∈ RK
Π ∈ RKxK
Parameters:
Proposed:
Partially Labeled SBM (PLSBM) 	
Generative process	
①	
②	
③	
②:Generate labels for “labeled nodes” 

 
(α large à yi is more likely to be the same as zi)
Depends on
parameter α	
17/08/22	
 IJCAI@Melbourne	
 10	
γ ∈ RK
Π ∈ RKxK
α ∈ 0,1[ ]
Parameters:
Rela1onships	
  between	
  models	
17/08/22	
 IJCAI@Melbourne	
 11	
SBM	
 PLSBM	
LP	
 Discre1zed	
  LP	
Main	
  result	
  
(next	
  slide)	
No	
  labels	
Con1nuous	
  
relaxa1on
Main Result	
Map estimator Z of PLSBM is identical to the solution of
(discretized) LP when the following conditions hold 
Condition 1: 
Condition 2:
Condition 3: 
Condition 4: (omitted)
17/08/22	
 IJCAI@Melbourne	
 12
Condition 1	
Implication (implicit assumption of LP)

•  Label ratio is uniform	
17/08/22	
 IJCAI@Melbourne	
 13	
Violates this assumption L
Condition 2	
Implication (Implicit assumptions of LP)	

•  Edge probs between the same labels are all the same (μ)
•  Edge probs between different labels are all the same (ν)	
17/08/22	
 IJCAI@Melbourne	
 14	
Violates this assumption L
Condition 3	
Implication (Implicit assumption of LP)

•  Assortative (same labels tend to be connected)
17/08/22	
 IJCAI@Melbourne	
 15	
Violates this assumption L
Experimental results	
17/08/22	
 IJCAI@Melbourne	
 16	
… Come see full results at the poster session J	
Better	
Setups:
1.  Generate datasets by PLSBM
2.  infer labels (Z) by PLSBM, SBM, and LP
3.  Report mean accuracy of 20 trials	
Assortative	
 Disassortative	
Agree with
theoretical results
Summary	
•  Proposed	
  Par1ally-­‐Labeled	
  SBM	
  (PLSBM)	
  
•  Proved	
  the	
  rela1onship	
  between	
  LP	
  and	
  SBM	
  via	
  
PLSBM	
  
•  Showed	
  cases	
  when	
  LP	
  fails	
  
•  Experimental	
  and	
  Theore1cal	
  results	
  agree	
17/08/22	
 IJCAI@Melbourne	
 17	
Github: yamaguchiyuto/plsbm

More Related Content

More from Yuto Yamaguchi

OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationOMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationYuto Yamaguchi
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Yuto Yamaguchi
 
SIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaSIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaYuto Yamaguchi
 
Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Yuto Yamaguchi
 
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...Yuto Yamaguchi
 
WWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksWWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksYuto Yamaguchi
 
ICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaYuto Yamaguchi
 

More from Yuto Yamaguchi (7)

OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label CorrelationOMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
OMNI-Prop: Seamless Node Classification on Arbitrary Label Correlation
 
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
Online User Location Inference Exploiting Spatiotemporal Correlations in Soci...
 
SIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social MediaSIGMOD2013勉強会:Social Media
SIGMOD2013勉強会:Social Media
 
Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...Towards Social User Profiling: Unified and Discriminative Influence Model for...
Towards Social User Profiling: Unified and Discriminative Influence Model for...
 
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...The Length of Bridge Ties: Structural and Geographic Properties of Online So...
The Length of Bridge Ties: Structural and Geographic Properties of Online So...
 
WWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social NetworksWWW2012勉強会:Information Diffusion in Social Networks
WWW2012勉強会:Information Diffusion in Social Networks
 
ICDE2012勉強会:Social Media
ICDE2012勉強会:Social MediaICDE2012勉強会:Social Media
ICDE2012勉強会:Social Media
 

Recently uploaded

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

When Does Label Propagation Fail? A View from a Network Generative Model

  • 1. When  Does  Label  Propaga1on  Fail?   A  View  from  a  Network  Genera1ve  Model Yuto  Yamaguchi  and  Kohei  Hayashi 17/08/22 IJCAI@Melbourne 1
  • 2. Node Classification Given Find Partially labeled undirected graph Labels of all nodes 17/08/22 IJCAI@Melbourne 2
  • 3. Example: User profile inference Friends Soccer Soccer Soccer Tennis Baseball ??? What’s his hobby? Node Classification 17/08/22 IJCAI@Melbourne 3
  • 4. Label Propagation (1/2) Propagate neighbors’ labels Friends Soccer Soccer Soccer Tennis Baseline ??? Soccer Soccer Soccer Tennis Baseline Soccer [Zhu+, 03], [Zhou+, 03], etc. 17/08/22 IJCAI@Melbourne 4
  • 5. Label Propagation (2/2) Q F;X,Y,λ( )= 1 2 fi − yi 2 2 i=1 N ∑ + λ 2 xij fi − fj 2 2 j=1 N ∑ i=1 N ∑ Given: adjacency matrix X and labels Y Find: F = { fi } that minimizes Q 17/08/22 IJCAI@Melbourne 5 F ∈ RN x K Y ∈ {0, 1}N x K X ∈ {0, 1}N x N N: # of nodes K: # of labels λ ∈ R+ : user parameter [Zhu+, 03], [Zhou+, 03], etc.
  • 6. Cases  when  LP  fails  (prac1cally  known) Different labels are connected Label ratio is not uniform Q. So, do we know why LP fails in these cases? A. No. Since it’s not a probabilistic model, we don’t know the assumptions behind the model. 17/08/22 IJCAI@Melbourne 6 Edge probability is not uniform
  • 7. What  we  do  in  this  work 1.  Prove  a  theore1cal  rela1onship  between  LP   and  Stochas(c  Block  Model,  which  is  a  well-­‐ studied  probabilis1c  genera1ve  model   2.  Find  the  assump(ons  behind  LP  through  the   assump1ons  behind  SBM   3.  Show  when  and  why  LP  fails 17/08/22 IJCAI@Melbourne 7
  • 9. Stochastic Block Model Generative process Multinomial Bernoulli ① ② ①: Generate cluster assignment for each node (which can be thought of labels) ②: Generate adjacency matrix 17/08/22 IJCAI@Melbourne 9 γ ∈ RK Π ∈ RKxK Parameters:
  • 10. Proposed: Partially Labeled SBM (PLSBM) Generative process ① ② ③ ②:Generate labels for “labeled nodes” (α large à yi is more likely to be the same as zi) Depends on parameter α 17/08/22 IJCAI@Melbourne 10 γ ∈ RK Π ∈ RKxK α ∈ 0,1[ ] Parameters:
  • 11. Rela1onships  between  models 17/08/22 IJCAI@Melbourne 11 SBM PLSBM LP Discre1zed  LP Main  result   (next  slide) No  labels Con1nuous   relaxa1on
  • 12. Main Result Map estimator Z of PLSBM is identical to the solution of (discretized) LP when the following conditions hold Condition 1: Condition 2: Condition 3: Condition 4: (omitted) 17/08/22 IJCAI@Melbourne 12
  • 13. Condition 1 Implication (implicit assumption of LP) •  Label ratio is uniform 17/08/22 IJCAI@Melbourne 13 Violates this assumption L
  • 14. Condition 2 Implication (Implicit assumptions of LP) •  Edge probs between the same labels are all the same (μ) •  Edge probs between different labels are all the same (ν) 17/08/22 IJCAI@Melbourne 14 Violates this assumption L
  • 15. Condition 3 Implication (Implicit assumption of LP) •  Assortative (same labels tend to be connected) 17/08/22 IJCAI@Melbourne 15 Violates this assumption L
  • 16. Experimental results 17/08/22 IJCAI@Melbourne 16 … Come see full results at the poster session J Better Setups: 1.  Generate datasets by PLSBM 2.  infer labels (Z) by PLSBM, SBM, and LP 3.  Report mean accuracy of 20 trials Assortative Disassortative Agree with theoretical results
  • 17. Summary •  Proposed  Par1ally-­‐Labeled  SBM  (PLSBM)   •  Proved  the  rela1onship  between  LP  and  SBM  via   PLSBM   •  Showed  cases  when  LP  fails   •  Experimental  and  Theore1cal  results  agree 17/08/22 IJCAI@Melbourne 17 Github: yamaguchiyuto/plsbm