6. Social Networks can be represented in
graphs
Nodes correspond to individuals
Edges represent interaction among them
A community can be defined as a group
of entities that share similar properties
6
10. Compute the distance
between all vertices
and communities
Choose two communities
based on their similarity
Update the distance
between communities
Merge these two
communities into a new
community
Walk-Trap
11
12. 13
Modularity is based on the idea that a random graph is not expected to have a
community structure
𝑀𝑜𝑑𝑢𝑙𝑎𝑟𝑖𝑡𝑦 = 𝑄 =
1
2𝑚 𝑖𝑗
(𝐴𝑖𝑗 −
𝑑𝑖 𝑑𝑗
2𝑚
)𝛿(𝐶𝑖, 𝐶𝑗)
A: Adjacency Matrix m: the total number of edges in the network
𝑑𝑖: degree of node i
𝛿(𝐶𝑖, 𝐶𝑗) =
1, 𝐶𝑖 = 𝐶𝑗
0, 𝐶𝑖 ≠ 𝐶𝑗
The choice of null model is in principle arbitrary, and several possibilities exist
14. 15
Each node is assigned
to its own community
The algorithm repeatedly merges
pairs of communities together
Repeat the procedure until
only one community
remains
Choose the merger for which the
resulting modularity is the
largest.
FastQ
16. 17
A majority of community detection methods try
to optimize a global metric
Several of methods need initial parameters to
find out the problems
A centralized decision maker has been
proposed by most of the algorithms
17. A distributed framework has been proposed
to detect social networks communities
Each community acts as a selfish agent
to maximize its utility function
We use local utility maximization
Modularity has been chosen as the
community utility function
18
18. Each community just uses local information
to maximize its utility function
Each community has some pre-defined
actions
Each community chooses the best action in
order to have maximum utility
Our distributed framework can perform as
well as the existing centralized approaches
19
19. Local information is used to identify communities
Every community only utilizes the knowledge
obtained from its neighbors
Nodes belonging to a community fall into two
types:
1-Core Set(C): no node in C is linked to the
outside of the community
2-Boundary Set(B): every node in B has at
least one connection to the outside of the
community
20
23. 25
Our goal is to find a
division in which
modularity has been
maximized
𝐬𝐢 = −𝟏
C
C1
C2
𝐒𝐢 = +𝟏
𝐐 =
𝟏
𝟐𝐦 𝐢𝐣
(𝐀𝐢𝐣 −
𝐝𝐢 𝐝𝐣
𝟐𝐦
) 𝐬𝐢 𝐬𝐣
𝑸 =
𝟏
𝟐𝒎
𝒔 𝑻 𝑩𝒔
S is a vector whose
elements are 𝒔𝒊
𝑺 =
𝒊=𝟏
𝒏
𝜶𝒊 𝒖𝒊
𝒖𝒊 is ith Eigen vector
of B
B is a modularity
matrix whose
elements are:
𝑩𝒊𝒋 = (𝑨𝒊𝒋 −
𝒅𝒊 𝒅𝒋
𝟐𝒎
))
25. The proposed method may get stuck at a
local modularity
It may be possible that no community can improve itself
and also modularity is not maximized
27
26. 28
U1 U2
U
𝑼 𝟏
′
𝑼 𝟐
′
𝐔 < 𝐔 𝟏 + 𝐔 𝟐
Merging between C1 and
C2 is irrational
𝐔 𝟏 + 𝐔 𝟐 < 𝐔′ 𝟏 + 𝐔′ 𝟐
But splitting of C is rational
31. DataSet Number of Nodes Number of edges
Karate 34 77
Risk 42 83
Dolphin 62 159
Politics 105 441
AdjNoun 112 425
Football 115 613
Jazz 198 2742
USAir97 332 2126
Email 1133 5452
Power 4941 6594
Internet 22960 48436 33
32. 34
The community structures of the ground truth communities and those detected by 1st proposed
Method and 2nd proposed method on Zachary’s karate club network.
33. 35
The community structures of the ground truth communities and those detected by 1st proposed
Method and 2nd proposed method on Dolphin Network.
34. 36
The community structures of the ground truth communities and those detected by 1st proposed
Method and 2nd proposed method on NCCA Football Network.