SlideShare una empresa de Scribd logo
1 de 35
Group and Community Detection in Social Networks
Outline
1. Basics of Social Networks
2. Real-world problem
3. How to construct graph from real-world problem?
4. What graph theory problem getting from real-world problem?
5. Graph type of Social Networks
6. Special properties in social graph
7. How to find communities and groups in social networks?
(Algorithms)
8. How to interpret graph solution back to real-world problem?
1. Basics of Social Networks
Definitions
Social Network:
finite set or sets of actors and the relation or relations defined on them.
Actor = Node = Point = Agent:
social entities such as persons, organizations, cities, etc.
Tie = Link = Edge = Line = Arc:
represents relationships among actors.
Relation:
collection of ties of a specific kind among members of a group.
Attributes of Actor (nodes)
- People can be queried
about different
features, like
( age, gender, race,
socioeconomic status,
place of residence,
grade in school, etc. )
2. Real-world Problem
Definition : Social Groups and Communities
– “Two or more people , who interact with one another,
share similar characteristics and attributes and
collectively have a sense of unity”
– Actors who have all possible ties among themselves
The real-world problem is:
“Finding Groups And Communities In Social Networks”
• Social networks and the social network analysis:
– Is an interdisciplinary academic field
(social psychology, sociology, statistics, and graph theory)
– 1930; first sociograms in the to study interpersonal relationships, by
Jacob
– 1950; sociograms approaches mathematically formalized
– 1980; theories and methods of
social networks became popular
in the social and behavioral sciences
– Social network analysis is now one
of the major paradigms in
contemporary sociology
http://www.cmu.edu/joss/content/articles/volume1/Freeman.html
Why to find social groups and communities?
–behavior analysis
–location-based interaction analysis
–recommender systems development
–link prediction
–customer interaction and analysis & marketing
–media use
–Security
–Social studies
3. How to Construct Graph From
Real-world Problem?
- Shared Attributes: Actors are grouped based on the shared
attributes among them. i.e. Group of four people (Bob, Carol, Ted,
and Alice)
- Blue for males, red for females
http://faculty.ucr.edu/~hanneman/nettext/C3_Graphs.html
Bob Carol
TedAlice
- Attribute 1: "close friends”: who they regarded as close friends in
the group?
A directed graph of friendship ties
Bob Carol
TedAlice
Bob, Carol, and Ted form a "clique" (i.e. each is connected to each of the others)
Alice is a "pendant" (tied to the group by only one connection)
- Attribute 2: “Spouse”
A directed graph of spousal ties
Bob Carol
TedAlice
4. What Graph Theory Problem Getting
From Real-world Problem?
• Clique problem: refers to any problem to find
particular (complete) subgraphs ("cliques") in
a graph,
• i.e., sets of elements where each pair of
elements is connected.
http://sebastian.doc.gold.ac.uk/
• Note: the notion of clique here
dose not necessary refers to a
complete subgraph,
http://sebastian.doc.gold.ac.uk/
Complete Graph: there's an edge between any two node
Dense Graph: number of edges is close to the maximal number of edges
Sparse Graph: when it has only a few edges
Dense Graph Definition
• A graph G = (V, E) is said to be dense if for every v ∈ V ,
degree(v) > n/2, where n = |V|
• Density is the ratio between the number of edges |E|
and the number of vertices |V|.
• Density for undirected graphs:
• The maximal density is 1 = complete graphs
• Maximum number of edges ½ |V| (|V|−1)
http://www.cc.gatech.edu/~vigoda/MCMC_Course/Lec7.pdf
Complexity of the problem
• Clique problem is NP-Complete problem
– k-clique problem, the input is an undirected graph
and a number k, and the output is a clique of size
k if one exists (or, sometimes, all cliques of size k)
5. Graph Type of Social Networks
Small-World Graph = Scale-Free Graph
– most nodes are not neighbors of one another, but most
nodes can be reached from every other by a small number
of hops or steps.
– Specifically, a small-world network is defined to be a
network where the typical distance L between two
randomly chosen nodes (the number of steps required)
grows proportionally to the logarithm of the number of
nodes N in the network, that is:
http://www.lenddo.com/blog/2012/06/facebook-proves-it%E2%80%99s-a-small-world-after-all-we-are-all-connected-by-six-degrees-or-less/
Last time by: Reem
6. Special Properties
in Social Graphs
• Community Structure: Real-world social graphs are found to exhibit a
modular structure; with nodes forming groups, and possibly groups within
groups
– In a modular graph, the nodes form communities where groups of nodes in
the same community are tighter connected to each other than to those nodes
outside the community
• Heavy-tailed Degree Distribution:
– few “hubs”,
– most nodes have few neighbors
- The degree distribution has a power law (functional relationship)
- many low degree nodes - only a few high degree nodes in real graphs
• Small Diameter: also known as the ‘small-world phenomenon’ or the ‘six
degrees of separation’
M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69:026113, 2004.
7. How to Find Communities nnd Groups
in Social Networks? (Algorithms)
Taxonomy of Community Criteria
- Community detection methods categories:
• Node-Centric Community Detection
– Each node in a group satisfies certain properties
• Group-Centric Community Detection
– Consider the connections within a group as a whole. The group has
to satisfy certain properties without zooming into node-level
• Network-Centric Community Detection
– Partition the whole network into several disjoint sets
• Hierarchy-Centric Community Detection
– Construct a hierarchical structure of communities
A classification of
community
detection and graph
clustering methods
A classification of
community
detection and graph
clustering methods
Clique Percolation Method (CPM)
• Clique is a very strict definition, unstable
• Normally use cliques as a core to find larger communities
• CPM is such a method to find overlapping communities
– Input
• A parameter k, and a network
– Procedure
1. Find out all cliques of size k in a given network
2. Construct a clique graph. Two cliques are adjacent if
they share k-1 nodes
3. Each connected components in the clique graph
form a community 27
Example: Clique Percolation Method
{1, 2, 3}, {1, 3, 4}, {4, 5, 6}, {5, 6, 7}, {5, 6, 8}, {5, 7, 8},
{6, 7, 8}
28
Step 1: Find all Cliques of size 3
29
Step 2: Construct Clique Graph
{1, 2, 3}, {1, 3, 4}, {4, 5, 6},
{5, 6, 7}, {5, 6, 8}, {5, 7, 8},
{6, 7, 8}
30
Step 3: Finding Communities
Two cliques are adjacent if
they share k-1 nodes (i.e. k-1=2)
Communities:
{1, 2, 3, 4}
{4, 5, 6, 7, 8}
{1, 2, 3}, {1, 3, 4}, {4, 5, 6},
{5, 6, 7}, {5, 6, 8}, {5, 7, 8},
{6, 7, 8}
8. How to Interpret Graph Solution
Back to Real-life Problem?
- Finding Cliques in the Social Graph of the Social
Network leads to the communities and groups
inside the Social Networks, based on the
attributes and characteristics of actors in the
communities
References
• Community detection in Social Media, (2012), Symeon Papadopoulos, Yiannis Kompatsiaris,
Athena Vakali, Ploutarchos Spyridonos, Data Mining and Knowledge Discovery May 2012,
Volume 24, Issue 3, pp 515-554
• Community Detection in Graphs, (2010), Santo Fortunato, Complex Networks and Systems
Lagrange Laboratory, ISI Foundation, Viale S. Severo 65, 10133, Torino,I-ITALY.
• A Comparison of Community Detection Algorithms on Artificial Networks, (2009), Günce
Keziban Orman1,2 and Vincent Labatut , Discovery Science Lecture Notes in Computer
Science Volume 5808, pp 242-256
• Social Network Analysis. Methods and Applications, (2008), Wasserman, Stanley, Faust,
Katherine, Cambridge, University Press
• Computing Communities in Large Networks Using Random Walks, (2005), Pascal Pons and
Matthieu Latapy, Computer and Information Sciences – ISCIS, Lecture Notes in Computer
Science Volume 3733, 2005, pp 284-293
• Introduction to social network methods, (2005) Robert A. Hanneman and Mark Riddle,
University of California,
Group and Community Detection in Social Networks
http://www.lifelearn.com/2013/11/5-easy-creative-ways-thank-social-media-followers/

Más contenido relacionado

La actualidad más candente

Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewSatyaki Sikdar
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networksFrancisco Restivo
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network AnalysisPremsankar Chakkingal
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
5.3 mining sequential patterns
5.3 mining sequential patterns5.3 mining sequential patterns
5.3 mining sequential patternsKrish_ver2
 
Classification decision tree
Classification  decision treeClassification  decision tree
Classification decision treeyazad dumasia
 
Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part ITHomas Plotkowiak
 
DBSCAN (2014_11_25 06_21_12 UTC)
DBSCAN (2014_11_25 06_21_12 UTC)DBSCAN (2014_11_25 06_21_12 UTC)
DBSCAN (2014_11_25 06_21_12 UTC)Cory Cook
 
K MEANS CLUSTERING
K MEANS CLUSTERINGK MEANS CLUSTERING
K MEANS CLUSTERINGsingh7599
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisFred Stutzman
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectivenessemapesce
 
5.5 graph mining
5.5 graph mining5.5 graph mining
5.5 graph miningKrish_ver2
 
Trust and Recommender Systems
Trust and  Recommender SystemsTrust and  Recommender Systems
Trust and Recommender Systemszhayefei
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision treeKrish_ver2
 

La actualidad más candente (20)

Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief Overview
 
Social Network Analysis (SNA)
Social Network Analysis (SNA)Social Network Analysis (SNA)
Social Network Analysis (SNA)
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networks
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Dbscan algorithom
Dbscan algorithomDbscan algorithom
Dbscan algorithom
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
5.3 mining sequential patterns
5.3 mining sequential patterns5.3 mining sequential patterns
5.3 mining sequential patterns
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Classification decision tree
Classification  decision treeClassification  decision tree
Classification decision tree
 
Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part I
 
DBSCAN (2014_11_25 06_21_12 UTC)
DBSCAN (2014_11_25 06_21_12 UTC)DBSCAN (2014_11_25 06_21_12 UTC)
DBSCAN (2014_11_25 06_21_12 UTC)
 
K MEANS CLUSTERING
K MEANS CLUSTERINGK MEANS CLUSTERING
K MEANS CLUSTERING
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectiveness
 
Python networkx library quick start guide
Python networkx library quick start guidePython networkx library quick start guide
Python networkx library quick start guide
 
5.5 graph mining
5.5 graph mining5.5 graph mining
5.5 graph mining
 
Trust and Recommender Systems
Trust and  Recommender SystemsTrust and  Recommender Systems
Trust and Recommender Systems
 
2.2 decision tree
2.2 decision tree2.2 decision tree
2.2 decision tree
 
Fuzzy Clustering(C-means, K-means)
Fuzzy Clustering(C-means, K-means)Fuzzy Clustering(C-means, K-means)
Fuzzy Clustering(C-means, K-means)
 

Destacado

Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social MediaSymeon Papadopoulos
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphsNicola Barbieri
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with NetworkxErika Fille Legara
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]sdnumaygmailcom
 
Clique-based Network Clustering
Clique-based Network ClusteringClique-based Network Clustering
Clique-based Network ClusteringGuang Ouyang
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Mediarezahk
 
Community detection from a computational social science perspective
Community detection from a computational social science perspectiveCommunity detection from a computational social science perspective
Community detection from a computational social science perspectiveDavide Bennato
 
Social network analysis basics
Social network analysis basicsSocial network analysis basics
Social network analysis basicsPradeep Kumar
 
Community Detection in Brain Networks
Community Detection in Brain NetworksCommunity Detection in Brain Networks
Community Detection in Brain NetworksManas Gaur
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Advanced Methods in Network Science: Community Detection Algorithms
Advanced Methods in Network Science: Community Detection Algorithms Advanced Methods in Network Science: Community Detection Algorithms
Advanced Methods in Network Science: Community Detection Algorithms Daniel Katz
 
Paolo Rosso "On irony detection in social media"
Paolo Rosso "On irony detection in social media"Paolo Rosso "On irony detection in social media"
Paolo Rosso "On irony detection in social media"AINL Conferences
 
Twitter SmartList (第5回若手webエンジニア交流会)
Twitter SmartList (第5回若手webエンジニア交流会)Twitter SmartList (第5回若手webエンジニア交流会)
Twitter SmartList (第5回若手webエンジニア交流会)mosa siru
 
This isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network ageThis isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network ageNancy Wright White
 
The Network, the Community and the Self-Creativity
The Network, the Community and the Self-CreativityThe Network, the Community and the Self-Creativity
The Network, the Community and the Self-CreativityVince Cammarata
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Detecting Community Structures in Social Networks by Graph Sparsification
Detecting Community Structures in Social Networks by Graph SparsificationDetecting Community Structures in Social Networks by Graph Sparsification
Detecting Community Structures in Social Networks by Graph SparsificationSatyaki Sikdar
 
Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detectionroberval mariano
 

Destacado (20)

Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphs
 
Community Detection with Networkx
Community Detection with NetworkxCommunity Detection with Networkx
Community Detection with Networkx
 
Community detection in social networks[1]
Community detection in social networks[1]Community detection in social networks[1]
Community detection in social networks[1]
 
Clique-based Network Clustering
Clique-based Network ClusteringClique-based Network Clustering
Clique-based Network Clustering
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Community detection from a computational social science perspective
Community detection from a computational social science perspectiveCommunity detection from a computational social science perspective
Community detection from a computational social science perspective
 
Social network analysis basics
Social network analysis basicsSocial network analysis basics
Social network analysis basics
 
Kernighan lin
Kernighan linKernighan lin
Kernighan lin
 
Community Detection in Brain Networks
Community Detection in Brain NetworksCommunity Detection in Brain Networks
Community Detection in Brain Networks
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Advanced Methods in Network Science: Community Detection Algorithms
Advanced Methods in Network Science: Community Detection Algorithms Advanced Methods in Network Science: Community Detection Algorithms
Advanced Methods in Network Science: Community Detection Algorithms
 
Paolo Rosso "On irony detection in social media"
Paolo Rosso "On irony detection in social media"Paolo Rosso "On irony detection in social media"
Paolo Rosso "On irony detection in social media"
 
Twitter SmartList (第5回若手webエンジニア交流会)
Twitter SmartList (第5回若手webエンジニア交流会)Twitter SmartList (第5回若手webエンジニア交流会)
Twitter SmartList (第5回若手webエンジニア交流会)
 
This isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network ageThis isn't what I thought it was: community in the network age
This isn't what I thought it was: community in the network age
 
The Network, the Community and the Self-Creativity
The Network, the Community and the Self-CreativityThe Network, the Community and the Self-Creativity
The Network, the Community and the Self-Creativity
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Detecting Community Structures in Social Networks by Graph Sparsification
Detecting Community Structures in Social Networks by Graph SparsificationDetecting Community Structures in Social Networks by Graph Sparsification
Detecting Community Structures in Social Networks by Graph Sparsification
 
Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detection
 
Staying Alfoat in Social Media
Staying Alfoat in Social MediaStaying Alfoat in Social Media
Staying Alfoat in Social Media
 

Similar a Group and Community Detection in Social Networks

01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)Duke Network Analysis Center
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measuresdnac
 
LSS'11: Charting Collections Of Connections In Social Media
LSS'11: Charting Collections Of Connections In Social MediaLSS'11: Charting Collections Of Connections In Social Media
LSS'11: Charting Collections Of Connections In Social MediaLocal Social Summit
 
20111103 con tech2011-marc smith
20111103 con tech2011-marc smith20111103 con tech2011-marc smith
20111103 con tech2011-marc smithMarc Smith
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc SmithMarc Smith
 
20111123 mwa2011-marc smith
20111123 mwa2011-marc smith20111123 mwa2011-marc smith
20111123 mwa2011-marc smithMarc Smith
 
community Detection.pptx
community Detection.pptxcommunity Detection.pptx
community Detection.pptxBhuvana97
 
16 zaman nips10_workshop_v2
16 zaman nips10_workshop_v216 zaman nips10_workshop_v2
16 zaman nips10_workshop_v2talktoharry
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network AnalysisMarc Smith
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...Marc Smith
 
20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...Marc Smith
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Michael Mathioudakis
 
20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...Marc Smith
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and OverviewDuke Network Analysis Center
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012CameliaN
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Sciencedragonmeteor
 
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSSCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSIJDKP
 

Similar a Group and Community Detection in Social Networks (20)

SSRI_pt1.ppt
SSRI_pt1.pptSSRI_pt1.ppt
SSRI_pt1.ppt
 
01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)01 Introduction to Networks Methods and Measures (2016)
01 Introduction to Networks Methods and Measures (2016)
 
01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures01 Introduction to Networks Methods and Measures
01 Introduction to Networks Methods and Measures
 
LSS'11: Charting Collections Of Connections In Social Media
LSS'11: Charting Collections Of Connections In Social MediaLSS'11: Charting Collections Of Connections In Social Media
LSS'11: Charting Collections Of Connections In Social Media
 
20111103 con tech2011-marc smith
20111103 con tech2011-marc smith20111103 con tech2011-marc smith
20111103 con tech2011-marc smith
 
AI Class Topic 5: Social Network Graph
AI Class Topic 5:  Social Network GraphAI Class Topic 5:  Social Network Graph
AI Class Topic 5: Social Network Graph
 
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
 
20111123 mwa2011-marc smith
20111123 mwa2011-marc smith20111123 mwa2011-marc smith
20111123 mwa2011-marc smith
 
community Detection.pptx
community Detection.pptxcommunity Detection.pptx
community Detection.pptx
 
16 zaman nips10_workshop_v2
16 zaman nips10_workshop_v216 zaman nips10_workshop_v2
16 zaman nips10_workshop_v2
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis
 
01 Network Data Collection (2017)
01 Network Data Collection (2017)01 Network Data Collection (2017)
01 Network Data Collection (2017)
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
 
20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...20120301 strata-marc smith-mapping social media networks with no coding using...
20120301 strata-marc smith-mapping social media networks with no coding using...
 
Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020Mining the Social Web - Lecture 2 - T61.6020
Mining the Social Web - Lecture 2 - T61.6020
 
20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...20121010 marc smith - mapping collections of connections in social media with...
20121010 marc smith - mapping collections of connections in social media with...
 
02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview02 Introduction to Social Networks and Health: Key Concepts and Overview
02 Introduction to Social Networks and Health: Key Concepts and Overview
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Science
 
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKSSCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
SCALABLE LOCAL COMMUNITY DETECTION WITH MAPREDUCE FOR LARGE NETWORKS
 

Último

CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionajayrajaganeshkayala
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...PrithaVashisht1
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?sonikadigital1
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxDwiAyuSitiHartinah
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)Data & Analytics Magazin
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Guido X Jansen
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introductionsanjaymuralee1
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 

Último (17)

CI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual interventionCI, CD -Tools to integrate without manual intervention
CI, CD -Tools to integrate without manual intervention
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...Elements of language learning - an analysis of how different elements of lang...
Elements of language learning - an analysis of how different elements of lang...
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?How is Real-Time Analytics Different from Traditional OLAP?
How is Real-Time Analytics Different from Traditional OLAP?
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptxTINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
TINJUAN PEMROSESAN TRANSAKSI DAN ERP.pptx
 
AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)AI for Sustainable Development Goals (SDGs)
AI for Sustainable Development Goals (SDGs)
 
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
Persuasive E-commerce, Our Biased Brain @ Bikkeldag 2024
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 
Virtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product IntroductionVirtuosoft SmartSync Product Introduction
Virtuosoft SmartSync Product Introduction
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 

Group and Community Detection in Social Networks

  • 2. Outline 1. Basics of Social Networks 2. Real-world problem 3. How to construct graph from real-world problem? 4. What graph theory problem getting from real-world problem? 5. Graph type of Social Networks 6. Special properties in social graph 7. How to find communities and groups in social networks? (Algorithms) 8. How to interpret graph solution back to real-world problem?
  • 3. 1. Basics of Social Networks
  • 4. Definitions Social Network: finite set or sets of actors and the relation or relations defined on them. Actor = Node = Point = Agent: social entities such as persons, organizations, cities, etc. Tie = Link = Edge = Line = Arc: represents relationships among actors. Relation: collection of ties of a specific kind among members of a group.
  • 5. Attributes of Actor (nodes) - People can be queried about different features, like ( age, gender, race, socioeconomic status, place of residence, grade in school, etc. )
  • 7. Definition : Social Groups and Communities – “Two or more people , who interact with one another, share similar characteristics and attributes and collectively have a sense of unity” – Actors who have all possible ties among themselves The real-world problem is: “Finding Groups And Communities In Social Networks”
  • 8. • Social networks and the social network analysis: – Is an interdisciplinary academic field (social psychology, sociology, statistics, and graph theory) – 1930; first sociograms in the to study interpersonal relationships, by Jacob – 1950; sociograms approaches mathematically formalized – 1980; theories and methods of social networks became popular in the social and behavioral sciences – Social network analysis is now one of the major paradigms in contemporary sociology http://www.cmu.edu/joss/content/articles/volume1/Freeman.html
  • 9. Why to find social groups and communities? –behavior analysis –location-based interaction analysis –recommender systems development –link prediction –customer interaction and analysis & marketing –media use –Security –Social studies
  • 10. 3. How to Construct Graph From Real-world Problem?
  • 11. - Shared Attributes: Actors are grouped based on the shared attributes among them. i.e. Group of four people (Bob, Carol, Ted, and Alice) - Blue for males, red for females http://faculty.ucr.edu/~hanneman/nettext/C3_Graphs.html Bob Carol TedAlice
  • 12. - Attribute 1: "close friends”: who they regarded as close friends in the group? A directed graph of friendship ties Bob Carol TedAlice Bob, Carol, and Ted form a "clique" (i.e. each is connected to each of the others) Alice is a "pendant" (tied to the group by only one connection)
  • 13. - Attribute 2: “Spouse” A directed graph of spousal ties Bob Carol TedAlice
  • 14. 4. What Graph Theory Problem Getting From Real-world Problem?
  • 15. • Clique problem: refers to any problem to find particular (complete) subgraphs ("cliques") in a graph, • i.e., sets of elements where each pair of elements is connected. http://sebastian.doc.gold.ac.uk/
  • 16. • Note: the notion of clique here dose not necessary refers to a complete subgraph, http://sebastian.doc.gold.ac.uk/ Complete Graph: there's an edge between any two node Dense Graph: number of edges is close to the maximal number of edges Sparse Graph: when it has only a few edges
  • 17. Dense Graph Definition • A graph G = (V, E) is said to be dense if for every v ∈ V , degree(v) > n/2, where n = |V| • Density is the ratio between the number of edges |E| and the number of vertices |V|. • Density for undirected graphs: • The maximal density is 1 = complete graphs • Maximum number of edges ½ |V| (|V|−1) http://www.cc.gatech.edu/~vigoda/MCMC_Course/Lec7.pdf
  • 18. Complexity of the problem • Clique problem is NP-Complete problem – k-clique problem, the input is an undirected graph and a number k, and the output is a clique of size k if one exists (or, sometimes, all cliques of size k)
  • 19. 5. Graph Type of Social Networks
  • 20. Small-World Graph = Scale-Free Graph – most nodes are not neighbors of one another, but most nodes can be reached from every other by a small number of hops or steps. – Specifically, a small-world network is defined to be a network where the typical distance L between two randomly chosen nodes (the number of steps required) grows proportionally to the logarithm of the number of nodes N in the network, that is: http://www.lenddo.com/blog/2012/06/facebook-proves-it%E2%80%99s-a-small-world-after-all-we-are-all-connected-by-six-degrees-or-less/ Last time by: Reem
  • 21. 6. Special Properties in Social Graphs
  • 22. • Community Structure: Real-world social graphs are found to exhibit a modular structure; with nodes forming groups, and possibly groups within groups – In a modular graph, the nodes form communities where groups of nodes in the same community are tighter connected to each other than to those nodes outside the community • Heavy-tailed Degree Distribution: – few “hubs”, – most nodes have few neighbors - The degree distribution has a power law (functional relationship) - many low degree nodes - only a few high degree nodes in real graphs • Small Diameter: also known as the ‘small-world phenomenon’ or the ‘six degrees of separation’ M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69:026113, 2004.
  • 23. 7. How to Find Communities nnd Groups in Social Networks? (Algorithms)
  • 24. Taxonomy of Community Criteria - Community detection methods categories: • Node-Centric Community Detection – Each node in a group satisfies certain properties • Group-Centric Community Detection – Consider the connections within a group as a whole. The group has to satisfy certain properties without zooming into node-level • Network-Centric Community Detection – Partition the whole network into several disjoint sets • Hierarchy-Centric Community Detection – Construct a hierarchical structure of communities
  • 25. A classification of community detection and graph clustering methods
  • 26. A classification of community detection and graph clustering methods
  • 27. Clique Percolation Method (CPM) • Clique is a very strict definition, unstable • Normally use cliques as a core to find larger communities • CPM is such a method to find overlapping communities – Input • A parameter k, and a network – Procedure 1. Find out all cliques of size k in a given network 2. Construct a clique graph. Two cliques are adjacent if they share k-1 nodes 3. Each connected components in the clique graph form a community 27
  • 28. Example: Clique Percolation Method {1, 2, 3}, {1, 3, 4}, {4, 5, 6}, {5, 6, 7}, {5, 6, 8}, {5, 7, 8}, {6, 7, 8} 28 Step 1: Find all Cliques of size 3
  • 29. 29 Step 2: Construct Clique Graph {1, 2, 3}, {1, 3, 4}, {4, 5, 6}, {5, 6, 7}, {5, 6, 8}, {5, 7, 8}, {6, 7, 8}
  • 30. 30 Step 3: Finding Communities Two cliques are adjacent if they share k-1 nodes (i.e. k-1=2) Communities: {1, 2, 3, 4} {4, 5, 6, 7, 8} {1, 2, 3}, {1, 3, 4}, {4, 5, 6}, {5, 6, 7}, {5, 6, 8}, {5, 7, 8}, {6, 7, 8}
  • 31. 8. How to Interpret Graph Solution Back to Real-life Problem?
  • 32. - Finding Cliques in the Social Graph of the Social Network leads to the communities and groups inside the Social Networks, based on the attributes and characteristics of actors in the communities
  • 33. References • Community detection in Social Media, (2012), Symeon Papadopoulos, Yiannis Kompatsiaris, Athena Vakali, Ploutarchos Spyridonos, Data Mining and Knowledge Discovery May 2012, Volume 24, Issue 3, pp 515-554 • Community Detection in Graphs, (2010), Santo Fortunato, Complex Networks and Systems Lagrange Laboratory, ISI Foundation, Viale S. Severo 65, 10133, Torino,I-ITALY. • A Comparison of Community Detection Algorithms on Artificial Networks, (2009), Günce Keziban Orman1,2 and Vincent Labatut , Discovery Science Lecture Notes in Computer Science Volume 5808, pp 242-256 • Social Network Analysis. Methods and Applications, (2008), Wasserman, Stanley, Faust, Katherine, Cambridge, University Press • Computing Communities in Large Networks Using Random Walks, (2005), Pascal Pons and Matthieu Latapy, Computer and Information Sciences – ISCIS, Lecture Notes in Computer Science Volume 3733, 2005, pp 284-293 • Introduction to social network methods, (2005) Robert A. Hanneman and Mark Riddle, University of California,