Organizational Overlap on Social Networks and its Applications

•

1 recomendación•537 vistas

Mitul Tiwari

WWW 2013 paper presentation slides. Paper can be found here: http://mitultiwari.net/docs/papers/www13_overlap.pdf

Internet

Organizational Overlap on
Social Networks and its
Applications
Mitul Tiwari
Joint work with Cho-Jui Hsieh, Deepak Agarwal,
Xinyi (Lisa) Huang, and Sam Shah
LinkedIn

3
Outline
• Motivation
• Organizational Overlap Model
• Problem Definition
• Data Analysis
• Mathematical Formulation
• Experimental Validation
• Applications
• Link Prediction
• Community Detection

4
Motivation
• Social Networks : important for
• Sharing and Discovery
• Communication
• Networking
• Online Social Networks are partially observed
• Link Prediction and Recommending entities are important

10
Motivation: Recommender Ecosystem
Similar Profiles
Connections
News
Skill Endorsements

12
Outline
• Motivation
• Organizational Overlap Model
• Problem Definition
• Data Analysis
• Mathematical Formulation
• Experimental Validation
• Applications
• Link Prediction
• Community Detection

13
Organizational Overlap Problem
• Goal: compute the probability of connection based on the
organizational time overlap
• Organizational time overlap between two members A and B,
who belonged to the same organization O : T(A, B, O)
• Probability that A and B are connected: P(A, B)
• P(A, B) = f(T(A, B, O), O), over all organizations O
• A function of time overlapped in the organization O
• Properties of the organization O

14
Organizational Overlap Data
Analysis
• Insight 1: Connection density increases with organizational
time overlap

15
Organizational Overlap Data
Analysis
• Insight 2: Connection density decreases with the size of
the organizational

18
Organizational Overlap Model
Validation
• Empirical connection
density fits our model

19
Organizational Overlap Model:
Estimating λ
• λ: organization dependent
parameter
• Members of smaller
organization is more likely to
know each other
• Empirical and MLE estimates
for log(λ) ~ -0.8 log(|S|)

20
Outline
• Motivation
• Organizational Overlap Model
• Problem Definition
• Data Analysis
• Mathematical Formulation
• Experimental Validation
• Applications
• Link Prediction
• Community Detection

21
Application: Link Prediction
• Warm start: existing edges
• 2 features: org. overlap
time and size
• Common Neighbors (CN)
• Adamic-Adar (AA)
• Data Sets: LinkedIn, Enron
emails, Wiki talk

22
Application: Link Prediction
• Cold start: no or sparse
edges
• All features:
• time overlap, company size,
company propensity, node
propensity, ...

23
Application: Community Detection
• Good for candidate generation for an entity recommendation
systems, such as, companies to follow
• Graph Clustering algorithm (Graclus)
• Members as nodes and an edge between any pair of nodes with overlap
• Organizational overlap model for computing edge weight
• Graclus: minimizes the total weight of the cuts
• Evaluation using
• Virality of company follow within communities
• Virality of article updates

24
Community Detection Evaluation
• Using Spread of company follow
• Compared 3 methods
• Organizational overlap based
• Using social connections graph
• Random: partition the nodes in the
same company
• Spread: avg # of companies
followed within d days of the
first follow event
• Propagation rate: norm. spread

25
Community Detection Evaluation
• Virality of article updates within communities
Avg degree: 4-6 Avg degree: 12-14

27
Summary
• Motivation
• Organizational Overlap Model
• Problem Definition
• Data Analysis
• Mathematical Formulation
• Experimental Validation
• Applications
• Link Prediction
• Community Detection

28
Acknowledgement
• http://data.linkedin.com
• We are hiring!
• Contact: mtiwari[at]linkedin.com
• Follow: @mitultiwari on Twitter

Más contenido relacionado

Destacado

Hope-The Melanoma journey Nathan Jones

Melbourne FC presentation 2014 Nathan Jones

Resume ppgonzalansing

iPad APPetiser VITTA PresentationNathan Jones

The iPad Comes to Life with Augmented RealityNathan Jones

Plant hormonewarinda_lorsawat

ASK_GeneralPresentation_11.12.14Tracy White

Destacado (7)

Hope-The Melanoma journey

Melbourne FC presentation 2014

Resume pp

iPad APPetiser VITTA Presentation

The iPad Comes to Life with Augmented Reality

Plant hormone

ASK_GeneralPresentation_11.12.14

Similar a Organizational Overlap on Social Networks and its Applications

Large scale social recommender systems and their evaluationMitul Tiwari

Large scale social recommender systems at LinkedInMitul Tiwari

Exploring Generative Models of Tripartite Graphs for Recommendation in Social...Charalampos Chelmis

Social CI: A Work method and a tool for Competitive Intelligence NetworkingComintelli

Multilevel Collaboration between Software Developers and the Impact of Proxim...Dawn Foster

GraphTour London 2020 - Graphs for AI, Amy HodlerNeo4j

ONA and the tools landscapePatti Anklam

Relationships Matter: Using Connected Data for Better Machine LearningNeo4j

Browsemap: Collaborative Filtering at LinkedInLili Wu

DIY ERM (Do-It-Yourself Electronic Resources Management) for the Small LibraryNASIG

Data Mining In Social Networks Using K-Means Clustering Algorithmnishant24894

Seams2016 presentation calikli_et_alGul Calikli

TruSIS: Trust Accross Social NetworkLora Aroyo

Using Social Network Analysis to Assess Organizational Development InitiativesStephanie Richter

01-introduction.ppt the paper that you can unless you want to join me because...teodroscampaus

CC TEL- Simulation-based co-design of algorithmsSebastian Dennerlein

Social Network Analysis (Part 1)Vala Ali Rohani

Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...Lionel Briand

Visualizing Community through Social Network AnalysisStephanie Richter

Mingle spot projectsaikrishnabachuwar

Similar a Organizational Overlap on Social Networks and its Applications (20)

Large scale social recommender systems and their evaluation

Large scale social recommender systems at LinkedIn

Exploring Generative Models of Tripartite Graphs for Recommendation in Social...

Social CI: A Work method and a tool for Competitive Intelligence Networking

Multilevel Collaboration between Software Developers and the Impact of Proxim...

GraphTour London 2020 - Graphs for AI, Amy Hodler

ONA and the tools landscape

Relationships Matter: Using Connected Data for Better Machine Learning

Browsemap: Collaborative Filtering at LinkedIn

DIY ERM (Do-It-Yourself Electronic Resources Management) for the Small Library

Data Mining In Social Networks Using K-Means Clustering Algorithm

Seams2016 presentation calikli_et_al

TruSIS: Trust Accross Social Network

Using Social Network Analysis to Assess Organizational Development Initiatives

01-introduction.ppt the paper that you can unless you want to join me because...

CC TEL- Simulation-based co-design of algorithms

Social Network Analysis (Part 1)

Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...

Visualizing Community through Social Network Analysis

Mingle spot project

Más de Mitul Tiwari

Big Data Ecosystem at LinkedIn. Keynote talk at Big Data Innovators Gathering...Mitul Tiwari

Modeling Impression discounting in large-scale recommender systemsMitul Tiwari

Metaphor: A system for related searches recommendationsMitul Tiwari

Related searches at LinkedInMitul Tiwari

Structural Diversity in Social Recommender SystemsMitul Tiwari

Large-scale Social Recommendation Systems: Challenges and OpportunityMitul Tiwari

Building Data Driven Products at LinkedinMitul Tiwari

Social Network Analysis at LinkedInMitul Tiwari

Más de Mitul Tiwari (8)

Big Data Ecosystem at LinkedIn. Keynote talk at Big Data Innovators Gathering...

Modeling Impression discounting in large-scale recommender systems

Metaphor: A system for related searches recommendations

Último

Call Girls in Prashant Vihar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Trump Diapers Over Dems t shirts Sweatshirtrahman018755

2nd Solid Symposium: Solid Pods vs Personal Knowledge GraphsEleniIlkou

(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7Call Girls in Nagpur High Profile Call Girls

All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445ruhi

Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableSeo

( Pune ) VIP Baner Call Girls 🎗️ 9352988975 Sizzling | Escorts | Girls Are Re...nilamkumrai

Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Delhi Call girls

VIP Call Girls Pollachi 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698

VIP Call Girls Himatnagar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698

Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...tanu pandey

📱Dehradun Call Girls Service 📱☎️ +91'905,3900,678 ☎️📱 Call Girls In Dehradun 📱@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678

Sarola * Female Escorts Service in Pune | 8005736733 Independent Escorts & Da...SUHANI PANDEY

💚😋 Bilaspur Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋nirzagarg

💚😋 Salem Escort Service Call Girls, 9352852248 ₹5000 To 25K With AC💚😋nirzagarg

20240507 QFM013 Machine Intelligence Reading List April 2024.pdfMatthew Sinclair

pdfcoffee.com_business-ethics-q3m7-pdf-free.pdfJOHNBEBONYAP1

Pune Airport ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready...tanu pandey

➥🔝 7737669865 🔝▻ mehsana Call-girls in Women Seeking Men 🔝mehsana🔝 Escorts...nirzagarg

Microsoft Azure Arc Customer Deck MicrosoftAanSulistiyo

Organizational Overlap on Social Networks and its Applications

1. Organizational Overlap on Social Networks and its Applications Mitul Tiwari Joint work with Cho-Jui Hsieh, Deepak Agarwal, Xinyi (Lisa) Huang, and Sam Shah LinkedIn

2. 2 Who am I

3. 3 Outline • Motivation • Organizational Overlap Model • Problem Definition • Data Analysis • Mathematical Formulation • Experimental Validation • Applications • Link Prediction • Community Detection

4. 4 Motivation • Social Networks : important for • Sharing and Discovery • Communication • Networking • Online Social Networks are partially observed • Link Prediction and Recommending entities are important

5. 5 Motivation: Rich Member Profile

6. 6 Motivation: Network is Important

7. 7 Motivation: People You May Know

8. 8 Motivation: Other Entities

9. 10 Motivation: Recommender Ecosystem Similar Profiles Connections News Skill Endorsements

10. 11 Motivation • Member profile contains various types of organizations • Company, Schools, Groups, ... • Can we compute edge affinity based on these organization information? • Useful for many applications: • Recommending members to connect (link prediction) • Recommending other entities from the same community (community detection)

11. 12 Outline • Motivation • Organizational Overlap Model • Problem Definition • Data Analysis • Mathematical Formulation • Experimental Validation • Applications • Link Prediction • Community Detection

12. 13 Organizational Overlap Problem • Goal: compute the probability of connection based on the organizational time overlap • Organizational time overlap between two members A and B, who belonged to the same organization O : T(A, B, O) • Probability that A and B are connected: P(A, B) • P(A, B) = f(T(A, B, O), O), over all organizations O • A function of time overlapped in the organization O • Properties of the organization O

13. 14 Organizational Overlap Data Analysis • Insight 1: Connection density increases with organizational time overlap

14. 15 Organizational Overlap Data Analysis • Insight 2: Connection density decreases with the size of the organizational

15. 16 Organizational Overlap Model

16. 17 Organizational Overlap Model

17. 18 Organizational Overlap Model Validation • Empirical connection density fits our model

18. 19 Organizational Overlap Model: Estimating λ • λ: organization dependent parameter • Members of smaller organization is more likely to know each other • Empirical and MLE estimates for log(λ) ~ -0.8 log(|S|)

19. 20 Outline • Motivation • Organizational Overlap Model • Problem Definition • Data Analysis • Mathematical Formulation • Experimental Validation • Applications • Link Prediction • Community Detection

20. 21 Application: Link Prediction • Warm start: existing edges • 2 features: org. overlap time and size • Common Neighbors (CN) • Adamic-Adar (AA) • Data Sets: LinkedIn, Enron emails, Wiki talk

21. 22 Application: Link Prediction • Cold start: no or sparse edges • All features: • time overlap, company size, company propensity, node propensity, ...

22. 23 Application: Community Detection • Good for candidate generation for an entity recommendation systems, such as, companies to follow • Graph Clustering algorithm (Graclus) • Members as nodes and an edge between any pair of nodes with overlap • Organizational overlap model for computing edge weight • Graclus: minimizes the total weight of the cuts • Evaluation using • Virality of company follow within communities • Virality of article updates

23. 24 Community Detection Evaluation • Using Spread of company follow • Compared 3 methods • Organizational overlap based • Using social connections graph • Random: partition the nodes in the same company • Spread: avg # of companies followed within d days of the first follow event • Propagation rate: norm. spread

24. 25 Community Detection Evaluation • Virality of article updates within communities Avg degree: 4-6 Avg degree: 12-14

25. 26 Related Work

26. 27 Summary • Motivation • Organizational Overlap Model • Problem Definition • Data Analysis • Mathematical Formulation • Experimental Validation • Applications • Link Prediction • Community Detection

27. 28 Acknowledgement • http://data.linkedin.com • We are hiring! • Contact: mtiwari[at]linkedin.com • Follow: @mitultiwari on Twitter

28. 29 Questions?

Notas del editor

Hi, I am Mitul Tiwari. Today I am going to present our paper on “Organizational Overlap on Social Networks and its Applications”. This is joint work with Cho-Jui, Deepak, Lisa, and Sam
Here is the outline of the rest of my talk.
LinkedIn is the second largest social network for professionals with more than 225 million members.
Members can create profiles with their education and employment details
Members can connect with each other and maintain their professional network on linkedin. TODO: replace screenshot
PYMK is a large scale recommendation system that helps you connect with others. Basically, PYMK is a link prediction problem, where we analyze billions of edges to recommend possible connections to you. A big big-data problem!
Companies can create pages and members can follow companies. TODO: replace screenshot
LinkedIn’s homepage is powered by recommendation engine: News, Connections, Jobs, Groups, Companies Also, ADs, Releavant Updates
A rich recommender systems ecosystem at linkedin: from connections, news, skills, Jobs, companies, groups, search queries, talent, similar profiles, ...
Here is the outline of the rest of my talk.
For a company A, this graph shows connection density, that is, the ratio of the # of connection with certain time overlap t within Company A and the total number of pairs with time overlap t within Company A We observe that connection density increases with time overlap t We see similar behavior with many companies, groups, and schools We came to this insight that connection density increases with organizational time overlap
we sampled companies of different sizes we calculated connection density with respect to company size we observed that connection density decreases as the size of the organization increases it makes sense since in a smaller organization people know each other
1. Community-Affiliation Graph Model (AGM) proposes P(O1, O2) = 1 - (1-P(O1))(1-P(O2)) Using that we can come to assumption 1 2. P(t) is probability, so we can safely assume that it is between 0 and 1. And P(t) is 0 iff t=0, that is, there is no overlap
1. Assumption 1 can be used to further decompose a time interval t into m smaller intervals to get Lemma1 2. P(δt) = 0 from Assumption 2. Using Assumption 1: P(t-delta t) = p(t) = p(t+delta t) 3. From Lemma 1 and Lemma 2 we can derive: 1-P(t) = Q(1)^t
Empirical connection density value fits our model well. In large companies it is not possible to have P(t) to be 1 for large t. We observe an upper bound mu for the probability
MLE: maximize log likelihood that is : Sum ( X_i log(P(t_i) + (1-X_i)log(1-P(t_i)) )
Here is the outline of the rest of my talk.
warm start setting where we have existing edges Enron emails: Wiki talk: conversation, discussion between editors. Edits on the same page implies conversation
Here is the outline of the rest of my talk.
questions, details, hiring

Organizational Overlap on Social Networks and its Applications

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (7)

Similar a Organizational Overlap on Social Networks and its Applications

Similar a Organizational Overlap on Social Networks and its Applications (20)

Más de Mitul Tiwari

Más de Mitul Tiwari (8)

Último

Último (20)

Organizational Overlap on Social Networks and its Applications

Notas del editor