SlideShare una empresa de Scribd logo
1 de 36
Higher-order
clustering coefficients
Austin R. Benson
Cornell University
Purdue CSoI Seminar
October 4, 2017
Joint work with
Hao Yin &
Jure Leskovec
(Stanford)
2
Brains
nodes are neurons
edges are synapses
Social networks
nodes are people
edges are
friendships
Electrical grid
nodes are power plants
edges are transmission
linesTim Meko, Washington Post
Currency
nodes are accounts
edges are transactions
Background. Networks are sets of nodes and edges (graphs)
that model real-world systems.
3
Background. Networks are globally sparse but locally dense.
Co-author network
Networks for real-world systems have modules, clusters, communities.
[Watts-Strogatz 1998; Flake 2000; Newman 2004, 2006; many others…]
Brain network
Sporns and
Bullmore, Nature
Rev. Neuro., 2012
4
How do we measure
how much a network clusters?
5
? C(u) = fraction of length-2 paths
centered at node u that form a triangle.
average clustering coefficient
C = average C(u) over all nodes u.
• In real-world networks, C is larger than we would expect (there is clustering).
[Watts-Strogatz 1998] > 34k citations!
• Attributed to triadic closure in sociology – a common friend provides an
opportunity for more friendships. [Rapoport 1953; Granovetter 1973]
• Key property for generative models.
[Newman 2009; Seshadhri-Kolda-Pinar 2012; Robles-Moreno-Neville 2016]
• Common feature in role discovery, anomaly detection, etc.
[Henderson+ 2012; La Fond-Neville-Gallagher 2014, 2016]
• Predictor of mental health. [Bearman-Moody 2004]
-
-
Background. The clustering coefficient is the fundamental
measurement of network science.
6
The clustering coefficient measures the closure
probability of just one simple structure—the triangle.
… but there is lots of evidence that dense
“higher-order structure” between > 3 nodes are
also important for clustering.
• 4-cliques reveal community structure in word
association and PPI networks [Palla+ 2005]
• 4- and 5-cliques (+ other motifs/graphlets)
used to identify network type and dimension
[Yaveroğlu+ 2014, Bonato+ 2014]
• 4-node motifs identify community structure in
neural systems [Benson-Gleich-Leskovec 2016]
The clustering coefficient is inherently limited.
7
Triangles tell just one part of the
story.
How can we measure
higher-order (clique) closure
patterns?
1. Find a 2-clique 2. Attach adjacent edge 3. Check for (2 + 1)-
clique
1. Find a 3-clique 2. Attach adjacent edge 3. Check for (3+1)-
clique
1. Find a 4-clique 2. Attach adjacent edge 3. Check for (4+1)-clique
8
C2 = avg. fraction of (2-clique, adjacent edge) pairs that induce a (2+1)-clique
Increase clique size by 1 to get a higher-order clustering
coefficient.
C3 = avg. fraction of (3-clique, adjacent edge) pairs that induce a (3+1)-clique
C4 = avg. fraction of (4-clique, adjacent edge) pairs that induce a (4+1)-clique
-
-
-
Our higher-order view through clique expansion.
Alice
Bo
b
Charli
e
1. Start with a
group of 3 friends
2. One person in the
group befriends
someone new
3. The group might
increase in size
Dav
e
9
rollingstone.com
oprah.com
Intuition for higher-order closure in social networks.
10
We generalize clustering coefficients to account for clique closure.
This particular generalization has several advantages…
1. Theory. Analyze relationships between clustering at different orders.
• small-world and Gn,p random graph models
• combinatorics for general graphs
2. Data Insights. How do real-world networks cluster?
• old idea pretty much all real-world networks exhibit clustering
• new idea real-world networks may only cluster up to a certain order.
3. Applications. Finding “higher-order” communities.
• Large higher-order clustering coefficient → can find good “higher-order community”
Higher-order clustering coefficients.
11
Second-order
(classical) local
clustering coefficient at
node u.
Second-order (classical)
global clustering
coefficient.
Second-order (classical)
average clustering
coefficient.
Background. Local, average, and global clustering coefficients.
12
Third-order
local clustering
coefficient at node u.
Third-order
global clustering
coefficient.
Third-order
average clustering
coefficient.
Local, average, and global higher-order clustering coefficients.
13
• Small-world [Watts-Strogatz 1998]
• Start with n nodes and edges to 2k neighbors
and then rewire each edge with probability p.
n = 16
k = 3
p = 0
[Watts-Strogatz 1998]
[Yin-Benson-Leskovec 2017]
Small-world network analysis.
14
Proposition [Yin-Benson-Leskovec 2017]
Everything scales exponentially in the order of the cluster coefficient...
Even if a node’s neighborhood is dense, i.e., C2(u) is large,
higher-order clustering still decays exponentially in Gn,p.
Gn,p random graph network analysis.
15
General network combinatorial analysis.
Extremal relationships HOCCs of different orders.
Proposition [Yin-Benson-Leskovec 2017]
For any node u in the network,
(tight upper and lower bounds)
16
General network combinatorial analysis.
Clique density interpretation.
Proposition [Yin-Benson-Leskovec 2017]
The product of the first r - 1 local higher-order clustering coefficients is
the r-clique density between the neighbors of node u.
17
General network combinatorial analysis.
Clique participation and computation.
Observation
We can compute the rth-order HOCCs by enumerating r- and (r + 1)-
cliques.
is the number of a-
cliques containing u
18
We generalize clustering coefficients to account for clique closure.
This particular generalization has several advantages…
1. Theory. Analyze relationships between clustering at different orders.
• small-world and Gn,p random graph models
• combinatorics for general graphs
2. Data Insights. How do real-world networks cluster?
• old idea pretty much all real-world networks exhibit clustering
• new idea real-world networks may only cluster up to a certain order.
3. Applications. Finding “higher-order” communities.
• Large higher-order clustering coefficient → can find good “higher-order community”
Higher-order clustering coefficients.
19
Datasets.
Neural connections (C. elegans)
297 nodes
2.15k edges
Facebook friendships
(Stanford3)
11.6k nodes
568k edges
Co-authorships (arXiv ca-
AstroPh)
18.8k nodes
198k edges
http://www.wormatlas.org/hermaphro
dite/
neuronalsupport/mainframe.htm
20
Neural connections 0.18 0.08 0.06 decreases with
order
Facebook friendships 0.16 0.11 0.12 decreases and
increases
Co-authorships 0.32 0.33 0.36 increases with
order
Is this just due to cliques in co-authorships?
No. High-degree nodes in co-authorships exhibit
clique + star structure where C3(u) > C2(u).
Global higher-order clustering coefficients.
21
Neural connections 0.31 0.14 0.06
Random configurations 0.15 0.04 0.01
Random configurations (C2 fixed) 0.31 0.17 0.09
Facebook friendships 0.25 0.18 0.16
Random configurations 0.03 0.00 0.00
Random configurations (C2 fixed) 0.25 0.14 0.09
Co-authorships 0.68 0.61 0.56
Random configurations 0.01 0.00 0.00
Random configurations (C2 fixed) 0.68 0.60 0.52-
-
-
Average higher-order clustering coefficients
22
Random
configurations
[Bollobás 1980;
Milo 2003]
Random configurations
with C2 fixed
[Park-Newman 2004;
Colomer de Simón+ 2013]
Real network (C. elegans)
-
Concentration in random samples for neural connections data.
23
Neural connections findings not just due to cliques.
Original network Null model
# 4-cliques 2,010 440 ± 68
C3 0.14 0.17 ± 0.004
4-clique count decreases in the null model, but the
higher-order clustering coefficient increases.
-
Key reason. Clustering coefficients are
normalized by opportunities to cluster.
24
Neural connections
Gn,p baseline
Upper bound
Facebook friendships Co-authorships
Dense but nearly
random regions
Dense and
structured regions
• Real network
• Random configuration with C2 fixe-
Local HOCCs.
25
We generalize clustering coefficients to account for clique closure.
This particular generalization has several advantages…
1. Theory. Analyze relationships between clustering at different orders.
• small-world and Gn,p random graph models
• combinatorics for general graphs
2. Data Insights. How do real-world networks cluster?
• old idea pretty much all real-world networks exhibit clustering
• new idea real-world networks may only cluster up to a certain order.
3. Applications. Finding “higher-order” communities.
• Large higher-order clustering coefficient → can find good “higher-order community”
Higher-order clustering coefficients.
26
If a network has
a large higher-order clustering coefficient,
then it has communities.
then there exists at least one community
by one particular measure of “higher-order community structure”,
but we can find the community efficiently.
Conductance is one of the most important cluster quality scores [Schaeffer
2007]
used in Markov chain theory, spectral clustering, bioinformatics, vision, etc.
The conductance of a set of vertices S is the ratio of
edges leaving to edges in S.
small conductance  good cluster
(edges leaving S)
(edge end points in S)
27
S S
Background. Graph clustering and conductance.
28
Background. Motif conductance generalizes conductance to
higher-order structures like cliques [Benson-Gleich-Leskovec 2016]
Uses higher-order notions of cut and
volume.
M = triangle motif
29
Easy to see that if Cr = 1,
then the network is a union
of disjoint cliques…
… any of these cliques has
optimal motif conductance = 0
Theorem [Yin-Benson-Leskovec, in preparation]
There is some node u whose 1-hop
neighborhood N1(u) satisfies
where M is the r-clique motif
This generalizes and improves a similar r = 2 (edge) result [Gleich-Seshadhri
2012]
Higher-order clustering  higher-order communities.
30
Neural connections Facebook friendships Co-authorships
Neighborhood
Neighborhood with smallest conductance
Fiedler cut with motif normalized Laplacian
[Benson-Gleich-Leskovec 16]
Large C3 and several neighborhoods
with small triangle conductance
Higher-order clustering  higher-order communities.
31
Higher-order clustering  higher-order communities.
Theory. (pessimistic in practice)
Practice. If the higher-order clustering coefficient is non-trivial,
then there should be good local clusters.
32
Local higher-order graph clustering
Yin, Benson, Leskovec, & Gleich, KDD, 2017.
• Studies the general problem of
finding local clusters based on
motifs (cliques).
• Our method is a generalization
of the Andersen-Chung-Lang
personalized PageRank
algorithm that expands
clusters around a seed node.
• Theoretical guarantees on
cluster quality and
performance (in practice, < 2
sec / seed on 2B edge graph).
Seed
node
Local cluster
33
Local higher-order graph clustering
Yin, Benson, Leskovec, & Gleich, KDD, 2017.
• Clusters based on
triangles yield better
recovery results on
common synthetic graph
models.
Average F1 0.40 0.50
• Clusters based on triangles
can better recover a person’s
departmental affiliation in an
academic email network.
34
Related work
 Gleich and Seshadrhi, “Vertex neighborhoods, low conductance cuts, and good seeds for local
community methods”, KDD, 2012.
Motivation for relating higher-order clustering coefficients to 1-hop neighborhood communities.
Intellectually indebted for their proof techniques!
 Benson, Gleich, and Leskovec, “Higher-order organization of complex networks,” Science, 2016.
Introduced higher-order conductance and a spectral method for optimizing it.
 Fronczak et al., “Higher order clustering coefficients in Barabási–Albert networks.” Physica A, 2002.
Higher-order clustering by looking at shortest path lengths.
 Jiang and Claramunt, “Topological analysis of urban street networks,” Environ. and Planning B,
2004.
Higher-order clustering by looking for triangles in k-hop neighborhoods.
 Lambiotte et al., “Structural Transitions in Densifying Networks,” PRL, 2016.
 Bhat et al., “Densification and structural transitions in networks that grow by node copying,” PRE,
2016.
Generative models with similar clique closure ideas.
35
Papers
• “Higher-order clustering in networks.” Yin, Benson, and Leskovec. arXiv, 2017.
• “Local higher-order graph clustering.” Yin, Benson, Leskovec, and Gleich. KDD, 2017.
• “Higher-order organization of complex networks.” Benson, Gleich, and Leskovec. Science, 2016.
1. A generalization of the fundamental
measurement of network science through
“clique expansion” interpretation.
2. Able to analyze generally and in common
random graph models (small-world and Gn,p).
3. old idea all real-world graphs cluster.
new idea only cluster up to a certain order.
4. In data, helps distinguish between dense and
random (neural connections) and dense and
structured (FB friendships, co-authorship).
5. Higher-order clustering implies local (1-hop
neighborhood) higher-order communities.
Open questions / future work
• Is there a generative model
that reproduces the
observed higher-order
clustering coefficients (e.g.,
forest fire)?
• Tighter analysis for 1-hop
neighborhood conductance?
• Higher-order clustering
coefficients for other motifs
(i.e., not just cliques).
http://cs.cornell.edu/~arb
@austinbenson
arb@cs.cornell.edu
Thanks!
Austin Benson
36
Neural connections Facebook friendships Co-authorships
Decrease in average
clustering with order is
independent of
degree.
For large degrees,
Changes in higher-order clustering coefficients tend to be
independent of degree.

Más contenido relacionado

La actualidad más candente

3D 딥러닝 동향
3D 딥러닝 동향3D 딥러닝 동향
3D 딥러닝 동향
NAVER Engineering
 
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
Yandex
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
KamleshKumar394
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
Editor IJARCET
 

La actualidad más candente (20)

EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
Scalable community detection with the louvain algorithm
Scalable community detection with the louvain algorithmScalable community detection with the louvain algorithm
Scalable community detection with the louvain algorithm
 
Higher-order Link Prediction Syracuse
Higher-order Link Prediction SyracuseHigher-order Link Prediction Syracuse
Higher-order Link Prediction Syracuse
 
3D 딥러닝 동향
3D 딥러닝 동향3D 딥러닝 동향
3D 딥러닝 동향
 
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
Bat-Cluster: A Bat Algorithm-based Automated Graph Clustering Approach
 
Sequences of Sets KDD '18
Sequences of Sets KDD '18Sequences of Sets KDD '18
Sequences of Sets KDD '18
 
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
Limits of Local Algorithms for Randomly Generated Constraint Satisfaction Pro...
 
Quantum persistent k cores for community detection
Quantum persistent k cores for community detectionQuantum persistent k cores for community detection
Quantum persistent k cores for community detection
 
009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain Networks009_20150201_Structural Inference for Uncertain Networks
009_20150201_Structural Inference for Uncertain Networks
 
An Introduction to Networks
An Introduction to NetworksAn Introduction to Networks
An Introduction to Networks
 
Link prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structureLink prediction in networks with core-fringe structure
Link prediction in networks with core-fringe structure
 
Greedy Incremental approach for unfolding of communities in massive networks
Greedy Incremental approach for unfolding of communities in massive networksGreedy Incremental approach for unfolding of communities in massive networks
Greedy Incremental approach for unfolding of communities in massive networks
 
Three hypergraph eigenvector centralities
Three hypergraph eigenvector centralitiesThree hypergraph eigenvector centralities
Three hypergraph eigenvector centralities
 
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on ClusteringAbility Study of Proximity Measure for Big Data Mining Context on Clustering
Ability Study of Proximity Measure for Big Data Mining Context on Clustering
 
TMPA-2017: Compositional Process Model Synthesis based on Interface Patterns
TMPA-2017: Compositional Process Model Synthesis based on Interface PatternsTMPA-2017: Compositional Process Model Synthesis based on Interface Patterns
TMPA-2017: Compositional Process Model Synthesis based on Interface Patterns
 
Centrality Prediction in Mobile Social Networks
Centrality Prediction in Mobile Social NetworksCentrality Prediction in Mobile Social Networks
Centrality Prediction in Mobile Social Networks
 
TMPA-2017: Layered Layouts for Software Systems Visualization
TMPA-2017: Layered Layouts for Software Systems VisualizationTMPA-2017: Layered Layouts for Software Systems Visualization
TMPA-2017: Layered Layouts for Software Systems Visualization
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 
NetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael SchroederNetBioSIG2013-KEYNOTE Michael Schroeder
NetBioSIG2013-KEYNOTE Michael Schroeder
 
A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...A technical paper presentation on Evaluation of Deep Learning techniques in S...
A technical paper presentation on Evaluation of Deep Learning techniques in S...
 

Similar a Higher-order clustering coefficients at Purdue CSoI

ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
Daniel Katz
 
Distribution of maximal clique size of the
Distribution of maximal clique size of theDistribution of maximal clique size of the
Distribution of maximal clique size of the
IJCNCJournal
 
Topology ppt
Topology pptTopology ppt
Topology ppt
boocse11
 

Similar a Higher-order clustering coefficients at Purdue CSoI (20)

Higher-order clustering coefficients
Higher-order clustering coefficientsHigher-order clustering coefficients
Higher-order clustering coefficients
 
Higher-order clustering coefficients
Higher-order clustering coefficientsHigher-order clustering coefficients
Higher-order clustering coefficients
 
Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)Socialnetworkanalysis (Tin180 Com)
Socialnetworkanalysis (Tin180 Com)
 
ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor...
 
Higher-order clustering in networks
Higher-order clustering in networksHigher-order clustering in networks
Higher-order clustering in networks
 
Community detection
Community detectionCommunity detection
Community detection
 
Spectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structuresSpectral clustering with motifs and higher-order structures
Spectral clustering with motifs and higher-order structures
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Higher-order Link Prediction GraphEx
Higher-order Link Prediction GraphExHigher-order Link Prediction GraphEx
Higher-order Link Prediction GraphEx
 
Distribution of maximal clique size of the
Distribution of maximal clique size of theDistribution of maximal clique size of the
Distribution of maximal clique size of the
 
Node similarity
Node similarityNode similarity
Node similarity
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]
 
Interpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approachInterpretation of the biological knowledge using networks approach
Interpretation of the biological knowledge using networks approach
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020
 
AI Class Topic 5: Social Network Graph
AI Class Topic 5:  Social Network GraphAI Class Topic 5:  Social Network Graph
AI Class Topic 5: Social Network Graph
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
Topology ppt
Topology pptTopology ppt
Topology ppt
 
D1803022335
D1803022335D1803022335
D1803022335
 
13047926.ppt
13047926.ppt13047926.ppt
13047926.ppt
 

Más de Austin Benson

Más de Austin Benson (20)

Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)Hypergraph Cuts with General Splitting Functions (JMM)
Hypergraph Cuts with General Splitting Functions (JMM)
 
Spectral embeddings and evolving networks
Spectral embeddings and evolving networksSpectral embeddings and evolving networks
Spectral embeddings and evolving networks
 
Computational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data AnalysisComputational Frameworks for Higher-order Network Data Analysis
Computational Frameworks for Higher-order Network Data Analysis
 
Higher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modelingHigher-order link prediction and other hypergraph modeling
Higher-order link prediction and other hypergraph modeling
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Hypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting FunctionsHypergraph Cuts with General Splitting Functions
Hypergraph Cuts with General Splitting Functions
 
Higher-order link prediction
Higher-order link predictionHigher-order link prediction
Higher-order link prediction
 
Semi-supervised learning of edge flows
Semi-supervised learning of edge flowsSemi-supervised learning of edge flows
Semi-supervised learning of edge flows
 
Random spatial network models for core-periphery structure
Random spatial network models for core-periphery structureRandom spatial network models for core-periphery structure
Random spatial network models for core-periphery structure
 
Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.Random spatial network models for core-periphery structure.
Random spatial network models for core-periphery structure.
 
Simplicial closure & higher-order link prediction
Simplicial closure & higher-order link predictionSimplicial closure & higher-order link prediction
Simplicial closure & higher-order link prediction
 
Simplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusionsSimplicial closure and simplicial diffusions
Simplicial closure and simplicial diffusions
 
Sampling methods for counting temporal motifs
Sampling methods for counting temporal motifsSampling methods for counting temporal motifs
Sampling methods for counting temporal motifs
 
Set prediction three ways
Set prediction three waysSet prediction three ways
Set prediction three ways
 
Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18Simplicial closure and higher-order link prediction --- SIAMNS18
Simplicial closure and higher-order link prediction --- SIAMNS18
 
Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)Simplicial closure and higher-order link prediction (SIAMNS18)
Simplicial closure and higher-order link prediction (SIAMNS18)
 
Simplicial closure and higher-order link prediction
Simplicial closure and higher-order link predictionSimplicial closure and higher-order link prediction
Simplicial closure and higher-order link prediction
 
New perspectives on measuring network clustering
New perspectives on measuring network clusteringNew perspectives on measuring network clustering
New perspectives on measuring network clustering
 
Higher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifsHigher-order spectral graph clustering with motifs
Higher-order spectral graph clustering with motifs
 
Tensor Eigenvectors and Stochastic Processes
Tensor Eigenvectors and Stochastic ProcessesTensor Eigenvectors and Stochastic Processes
Tensor Eigenvectors and Stochastic Processes
 

Último

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 

Último (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 

Higher-order clustering coefficients at Purdue CSoI

  • 1. Higher-order clustering coefficients Austin R. Benson Cornell University Purdue CSoI Seminar October 4, 2017 Joint work with Hao Yin & Jure Leskovec (Stanford)
  • 2. 2 Brains nodes are neurons edges are synapses Social networks nodes are people edges are friendships Electrical grid nodes are power plants edges are transmission linesTim Meko, Washington Post Currency nodes are accounts edges are transactions Background. Networks are sets of nodes and edges (graphs) that model real-world systems.
  • 3. 3 Background. Networks are globally sparse but locally dense. Co-author network Networks for real-world systems have modules, clusters, communities. [Watts-Strogatz 1998; Flake 2000; Newman 2004, 2006; many others…] Brain network Sporns and Bullmore, Nature Rev. Neuro., 2012
  • 4. 4 How do we measure how much a network clusters?
  • 5. 5 ? C(u) = fraction of length-2 paths centered at node u that form a triangle. average clustering coefficient C = average C(u) over all nodes u. • In real-world networks, C is larger than we would expect (there is clustering). [Watts-Strogatz 1998] > 34k citations! • Attributed to triadic closure in sociology – a common friend provides an opportunity for more friendships. [Rapoport 1953; Granovetter 1973] • Key property for generative models. [Newman 2009; Seshadhri-Kolda-Pinar 2012; Robles-Moreno-Neville 2016] • Common feature in role discovery, anomaly detection, etc. [Henderson+ 2012; La Fond-Neville-Gallagher 2014, 2016] • Predictor of mental health. [Bearman-Moody 2004] - - Background. The clustering coefficient is the fundamental measurement of network science.
  • 6. 6 The clustering coefficient measures the closure probability of just one simple structure—the triangle. … but there is lots of evidence that dense “higher-order structure” between > 3 nodes are also important for clustering. • 4-cliques reveal community structure in word association and PPI networks [Palla+ 2005] • 4- and 5-cliques (+ other motifs/graphlets) used to identify network type and dimension [Yaveroğlu+ 2014, Bonato+ 2014] • 4-node motifs identify community structure in neural systems [Benson-Gleich-Leskovec 2016] The clustering coefficient is inherently limited.
  • 7. 7 Triangles tell just one part of the story. How can we measure higher-order (clique) closure patterns?
  • 8. 1. Find a 2-clique 2. Attach adjacent edge 3. Check for (2 + 1)- clique 1. Find a 3-clique 2. Attach adjacent edge 3. Check for (3+1)- clique 1. Find a 4-clique 2. Attach adjacent edge 3. Check for (4+1)-clique 8 C2 = avg. fraction of (2-clique, adjacent edge) pairs that induce a (2+1)-clique Increase clique size by 1 to get a higher-order clustering coefficient. C3 = avg. fraction of (3-clique, adjacent edge) pairs that induce a (3+1)-clique C4 = avg. fraction of (4-clique, adjacent edge) pairs that induce a (4+1)-clique - - - Our higher-order view through clique expansion.
  • 9. Alice Bo b Charli e 1. Start with a group of 3 friends 2. One person in the group befriends someone new 3. The group might increase in size Dav e 9 rollingstone.com oprah.com Intuition for higher-order closure in social networks.
  • 10. 10 We generalize clustering coefficients to account for clique closure. This particular generalization has several advantages… 1. Theory. Analyze relationships between clustering at different orders. • small-world and Gn,p random graph models • combinatorics for general graphs 2. Data Insights. How do real-world networks cluster? • old idea pretty much all real-world networks exhibit clustering • new idea real-world networks may only cluster up to a certain order. 3. Applications. Finding “higher-order” communities. • Large higher-order clustering coefficient → can find good “higher-order community” Higher-order clustering coefficients.
  • 11. 11 Second-order (classical) local clustering coefficient at node u. Second-order (classical) global clustering coefficient. Second-order (classical) average clustering coefficient. Background. Local, average, and global clustering coefficients.
  • 12. 12 Third-order local clustering coefficient at node u. Third-order global clustering coefficient. Third-order average clustering coefficient. Local, average, and global higher-order clustering coefficients.
  • 13. 13 • Small-world [Watts-Strogatz 1998] • Start with n nodes and edges to 2k neighbors and then rewire each edge with probability p. n = 16 k = 3 p = 0 [Watts-Strogatz 1998] [Yin-Benson-Leskovec 2017] Small-world network analysis.
  • 14. 14 Proposition [Yin-Benson-Leskovec 2017] Everything scales exponentially in the order of the cluster coefficient... Even if a node’s neighborhood is dense, i.e., C2(u) is large, higher-order clustering still decays exponentially in Gn,p. Gn,p random graph network analysis.
  • 15. 15 General network combinatorial analysis. Extremal relationships HOCCs of different orders. Proposition [Yin-Benson-Leskovec 2017] For any node u in the network, (tight upper and lower bounds)
  • 16. 16 General network combinatorial analysis. Clique density interpretation. Proposition [Yin-Benson-Leskovec 2017] The product of the first r - 1 local higher-order clustering coefficients is the r-clique density between the neighbors of node u.
  • 17. 17 General network combinatorial analysis. Clique participation and computation. Observation We can compute the rth-order HOCCs by enumerating r- and (r + 1)- cliques. is the number of a- cliques containing u
  • 18. 18 We generalize clustering coefficients to account for clique closure. This particular generalization has several advantages… 1. Theory. Analyze relationships between clustering at different orders. • small-world and Gn,p random graph models • combinatorics for general graphs 2. Data Insights. How do real-world networks cluster? • old idea pretty much all real-world networks exhibit clustering • new idea real-world networks may only cluster up to a certain order. 3. Applications. Finding “higher-order” communities. • Large higher-order clustering coefficient → can find good “higher-order community” Higher-order clustering coefficients.
  • 19. 19 Datasets. Neural connections (C. elegans) 297 nodes 2.15k edges Facebook friendships (Stanford3) 11.6k nodes 568k edges Co-authorships (arXiv ca- AstroPh) 18.8k nodes 198k edges http://www.wormatlas.org/hermaphro dite/ neuronalsupport/mainframe.htm
  • 20. 20 Neural connections 0.18 0.08 0.06 decreases with order Facebook friendships 0.16 0.11 0.12 decreases and increases Co-authorships 0.32 0.33 0.36 increases with order Is this just due to cliques in co-authorships? No. High-degree nodes in co-authorships exhibit clique + star structure where C3(u) > C2(u). Global higher-order clustering coefficients.
  • 21. 21 Neural connections 0.31 0.14 0.06 Random configurations 0.15 0.04 0.01 Random configurations (C2 fixed) 0.31 0.17 0.09 Facebook friendships 0.25 0.18 0.16 Random configurations 0.03 0.00 0.00 Random configurations (C2 fixed) 0.25 0.14 0.09 Co-authorships 0.68 0.61 0.56 Random configurations 0.01 0.00 0.00 Random configurations (C2 fixed) 0.68 0.60 0.52- - - Average higher-order clustering coefficients
  • 22. 22 Random configurations [Bollobás 1980; Milo 2003] Random configurations with C2 fixed [Park-Newman 2004; Colomer de Simón+ 2013] Real network (C. elegans) - Concentration in random samples for neural connections data.
  • 23. 23 Neural connections findings not just due to cliques. Original network Null model # 4-cliques 2,010 440 ± 68 C3 0.14 0.17 ± 0.004 4-clique count decreases in the null model, but the higher-order clustering coefficient increases. - Key reason. Clustering coefficients are normalized by opportunities to cluster.
  • 24. 24 Neural connections Gn,p baseline Upper bound Facebook friendships Co-authorships Dense but nearly random regions Dense and structured regions • Real network • Random configuration with C2 fixe- Local HOCCs.
  • 25. 25 We generalize clustering coefficients to account for clique closure. This particular generalization has several advantages… 1. Theory. Analyze relationships between clustering at different orders. • small-world and Gn,p random graph models • combinatorics for general graphs 2. Data Insights. How do real-world networks cluster? • old idea pretty much all real-world networks exhibit clustering • new idea real-world networks may only cluster up to a certain order. 3. Applications. Finding “higher-order” communities. • Large higher-order clustering coefficient → can find good “higher-order community” Higher-order clustering coefficients.
  • 26. 26 If a network has a large higher-order clustering coefficient, then it has communities. then there exists at least one community by one particular measure of “higher-order community structure”, but we can find the community efficiently.
  • 27. Conductance is one of the most important cluster quality scores [Schaeffer 2007] used in Markov chain theory, spectral clustering, bioinformatics, vision, etc. The conductance of a set of vertices S is the ratio of edges leaving to edges in S. small conductance  good cluster (edges leaving S) (edge end points in S) 27 S S Background. Graph clustering and conductance.
  • 28. 28 Background. Motif conductance generalizes conductance to higher-order structures like cliques [Benson-Gleich-Leskovec 2016] Uses higher-order notions of cut and volume. M = triangle motif
  • 29. 29 Easy to see that if Cr = 1, then the network is a union of disjoint cliques… … any of these cliques has optimal motif conductance = 0 Theorem [Yin-Benson-Leskovec, in preparation] There is some node u whose 1-hop neighborhood N1(u) satisfies where M is the r-clique motif This generalizes and improves a similar r = 2 (edge) result [Gleich-Seshadhri 2012] Higher-order clustering  higher-order communities.
  • 30. 30 Neural connections Facebook friendships Co-authorships Neighborhood Neighborhood with smallest conductance Fiedler cut with motif normalized Laplacian [Benson-Gleich-Leskovec 16] Large C3 and several neighborhoods with small triangle conductance Higher-order clustering  higher-order communities.
  • 31. 31 Higher-order clustering  higher-order communities. Theory. (pessimistic in practice) Practice. If the higher-order clustering coefficient is non-trivial, then there should be good local clusters.
  • 32. 32 Local higher-order graph clustering Yin, Benson, Leskovec, & Gleich, KDD, 2017. • Studies the general problem of finding local clusters based on motifs (cliques). • Our method is a generalization of the Andersen-Chung-Lang personalized PageRank algorithm that expands clusters around a seed node. • Theoretical guarantees on cluster quality and performance (in practice, < 2 sec / seed on 2B edge graph). Seed node Local cluster
  • 33. 33 Local higher-order graph clustering Yin, Benson, Leskovec, & Gleich, KDD, 2017. • Clusters based on triangles yield better recovery results on common synthetic graph models. Average F1 0.40 0.50 • Clusters based on triangles can better recover a person’s departmental affiliation in an academic email network.
  • 34. 34 Related work  Gleich and Seshadrhi, “Vertex neighborhoods, low conductance cuts, and good seeds for local community methods”, KDD, 2012. Motivation for relating higher-order clustering coefficients to 1-hop neighborhood communities. Intellectually indebted for their proof techniques!  Benson, Gleich, and Leskovec, “Higher-order organization of complex networks,” Science, 2016. Introduced higher-order conductance and a spectral method for optimizing it.  Fronczak et al., “Higher order clustering coefficients in Barabási–Albert networks.” Physica A, 2002. Higher-order clustering by looking at shortest path lengths.  Jiang and Claramunt, “Topological analysis of urban street networks,” Environ. and Planning B, 2004. Higher-order clustering by looking for triangles in k-hop neighborhoods.  Lambiotte et al., “Structural Transitions in Densifying Networks,” PRL, 2016.  Bhat et al., “Densification and structural transitions in networks that grow by node copying,” PRE, 2016. Generative models with similar clique closure ideas.
  • 35. 35 Papers • “Higher-order clustering in networks.” Yin, Benson, and Leskovec. arXiv, 2017. • “Local higher-order graph clustering.” Yin, Benson, Leskovec, and Gleich. KDD, 2017. • “Higher-order organization of complex networks.” Benson, Gleich, and Leskovec. Science, 2016. 1. A generalization of the fundamental measurement of network science through “clique expansion” interpretation. 2. Able to analyze generally and in common random graph models (small-world and Gn,p). 3. old idea all real-world graphs cluster. new idea only cluster up to a certain order. 4. In data, helps distinguish between dense and random (neural connections) and dense and structured (FB friendships, co-authorship). 5. Higher-order clustering implies local (1-hop neighborhood) higher-order communities. Open questions / future work • Is there a generative model that reproduces the observed higher-order clustering coefficients (e.g., forest fire)? • Tighter analysis for 1-hop neighborhood conductance? • Higher-order clustering coefficients for other motifs (i.e., not just cliques). http://cs.cornell.edu/~arb @austinbenson arb@cs.cornell.edu Thanks! Austin Benson
  • 36. 36 Neural connections Facebook friendships Co-authorships Decrease in average clustering with order is independent of degree. For large degrees, Changes in higher-order clustering coefficients tend to be independent of degree.