SlideShare una empresa de Scribd logo
1 de 21
Mining Email Social Networks in OSS Christian Bird, Prem Devanbu, Alex Gourley, and Michael Gertz Department of Computer Science Anand Swaminathan Graduate School of Management University of California, Davis
Motivation ,[object Object],[object Object],[object Object],[object Object]
Apache Communication and Development (since 1996) ,[object Object],[object Object]
It is widely believed that OSS communities form a hierarchy Can we use social network analysis to examine these OSS communities? Image from  Socialization in an Open Source Community , Nicolas Ducheneaut
Social Networks ,[object Object],Network of who dated who in high school.  Courtesy of Mark Newman
Related Work ,[object Object],[object Object],[object Object],[object Object],Python Alice Bob undirected link contribute contribute Bug Report Alice Bob undirected link resolve submit foo.c Alice Bob undirected link commit commit Mailing List Alice Bob directed link respond post
Issues with Mailing List Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object]
Email Aliases ,[object Object],[object Object],[object Object],[email_address] [email_address] [email_address] [email_address] [email_address]
Email Alias Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Email addresses contain a <name, address> tuple.  Often the name is empty.
Alias Results ,[object Object],[object Object],[object Object]
Creating the Email Social Network ,[object Object],[object Object],[object Object],[object Object]
Intro to Social Network Metrics ,[object Object],[object Object],[object Object],[object Object]
3 7 2 5 6 4 1 12 10 8 9 11 Example High Out-Degree High Betweenness High In-Degree
Betweenness more formally For a given vertex  i ,[object Object],[object Object],[object Object]
Everybody likes a pretty picture! This is the social network of some of the most active participants on the Apache developer mailing list.  Each link indicates at least 150 messages between participants. Ryan Bloom has high betweenness in this network.  Of the participants shown, he has the highest number of source file commits.
The distribution of in-degree and out-degree both exhibit a power-law character
Status of Developers vs. Non-Developers Largest difference is in betweenness 0.000367 0.00794 In-Degree 0.000451 0.00666 Out-degree 0.000140 0.0114 Betweenness Non-Developer Developer
Correlation between communication and development ,[object Object],[object Object],[object Object],1 0.917 0.955 0.327 0.757 0.553 Betweenness 1 0.971 0.263 0.679 0.474 In-degree 1 0.308 0.712 0.520 Out-degree 1 0.514 0.932 Doc Changes 1 0.789 Src Changes 1 Changes betweenness In-degree Out-degree Doc Changes Src Changes Changes  
Observations from the network ,[object Object],[object Object],[object Object],[object Object]
Topics of future research ,[object Object],[object Object],[object Object],[object Object],[object Object]
Average In-Degree Months Avg In-Degree

Más contenido relacionado

Destacado

одаренные
одаренныеодаренные
одаренные
metodistinf
 
Obiee beginner guide iv
Obiee beginner guide ivObiee beginner guide iv
Obiee beginner guide iv
Amit Sharma
 
An introduction to hyperion public sector planning
An introduction to hyperion public sector planningAn introduction to hyperion public sector planning
An introduction to hyperion public sector planning
Amit Sharma
 
How to Drive Business through Employee Engagement
How to Drive Business through Employee EngagementHow to Drive Business through Employee Engagement
How to Drive Business through Employee Engagement
Drake International
 

Destacado (18)

Pramod Energy Harvesting Project 2008
Pramod   Energy Harvesting Project  2008Pramod   Energy Harvesting Project  2008
Pramod Energy Harvesting Project 2008
 
одаренные
одаренныеодаренные
одаренные
 
Top 3 Most Beautiful Flowers
Top 3 Most Beautiful FlowersTop 3 Most Beautiful Flowers
Top 3 Most Beautiful Flowers
 
Herramientas para el mantenimiento del pc
Herramientas para el mantenimiento del pcHerramientas para el mantenimiento del pc
Herramientas para el mantenimiento del pc
 
Customer Experience - How to survive in the 21st century
Customer Experience - How to survive in the 21st centuryCustomer Experience - How to survive in the 21st century
Customer Experience - How to survive in the 21st century
 
wind,storms and hurricanes
wind,storms and hurricanes wind,storms and hurricanes
wind,storms and hurricanes
 
파밀 ios팀은 이런걸 했다능..
파밀 ios팀은 이런걸 했다능..파밀 ios팀은 이런걸 했다능..
파밀 ios팀은 이런걸 했다능..
 
2200m ah external battery charger pack case for samsung galaxy s2 i9100
2200m ah external battery charger pack case for samsung galaxy s2 i91002200m ah external battery charger pack case for samsung galaxy s2 i9100
2200m ah external battery charger pack case for samsung galaxy s2 i9100
 
Obiee beginner guide iv
Obiee beginner guide ivObiee beginner guide iv
Obiee beginner guide iv
 
18 GIUGNO 2016 Milano Nutraceutica in andrologia: fertilita’ e sessualita’. D...
18 GIUGNO 2016 Milano Nutraceutica in andrologia: fertilita’ e sessualita’. D...18 GIUGNO 2016 Milano Nutraceutica in andrologia: fertilita’ e sessualita’. D...
18 GIUGNO 2016 Milano Nutraceutica in andrologia: fertilita’ e sessualita’. D...
 
ww2
ww2ww2
ww2
 
An introduction to hyperion public sector planning
An introduction to hyperion public sector planningAn introduction to hyperion public sector planning
An introduction to hyperion public sector planning
 
How to Drive Business through Employee Engagement
How to Drive Business through Employee EngagementHow to Drive Business through Employee Engagement
How to Drive Business through Employee Engagement
 
ebrochure (2) (1)
ebrochure (2) (1)ebrochure (2) (1)
ebrochure (2) (1)
 
1996 01
1996 011996 01
1996 01
 
Parent introduction 2013
Parent introduction 2013Parent introduction 2013
Parent introduction 2013
 
Job interview? Avoid these 6 psychological "leaks"
Job interview? Avoid these 6 psychological "leaks"Job interview? Avoid these 6 psychological "leaks"
Job interview? Avoid these 6 psychological "leaks"
 
Mariner 4
Mariner 4Mariner 4
Mariner 4
 

Similar a MiningEmailSocialNetworks

2006-05-25__coi-semdis
2006-05-25__coi-semdis2006-05-25__coi-semdis
2006-05-25__coi-semdis
webuploader
 
Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.
Deepak K
 
Understanding Email Traffic (talk @ E-Discovery NL Symposium)
Understanding Email Traffic (talk @ E-Discovery NL Symposium)Understanding Email Traffic (talk @ E-Discovery NL Symposium)
Understanding Email Traffic (talk @ E-Discovery NL Symposium)
David Graus
 
Simple Program for Enhancing Quality in Discussion Boards
Simple Program for Enhancing Quality in Discussion BoardsSimple Program for Enhancing Quality in Discussion Boards
Simple Program for Enhancing Quality in Discussion Boards
Rafael Hernandez
 
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
Steven Wardell
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based Tagging
Francesco Rizzo
 

Similar a MiningEmailSocialNetworks (20)

Vivo Search
Vivo SearchVivo Search
Vivo Search
 
2006-05-25__coi-semdis
2006-05-25__coi-semdis2006-05-25__coi-semdis
2006-05-25__coi-semdis
 
Sattose talk
Sattose talkSattose talk
Sattose talk
 
Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.Improving VIVO search through semantic ranking.
Improving VIVO search through semantic ranking.
 
Understanding Email Traffic (talk @ E-Discovery NL Symposium)
Understanding Email Traffic (talk @ E-Discovery NL Symposium)Understanding Email Traffic (talk @ E-Discovery NL Symposium)
Understanding Email Traffic (talk @ E-Discovery NL Symposium)
 
Simple Program for Enhancing Quality in Discussion Boards
Simple Program for Enhancing Quality in Discussion BoardsSimple Program for Enhancing Quality in Discussion Boards
Simple Program for Enhancing Quality in Discussion Boards
 
Duke talk
Duke talkDuke talk
Duke talk
 
Relationships In Wbs Ns (Tin180 Com)
Relationships In Wbs Ns (Tin180 Com)Relationships In Wbs Ns (Tin180 Com)
Relationships In Wbs Ns (Tin180 Com)
 
Repositories thru the looking glass
Repositories thru the looking glassRepositories thru the looking glass
Repositories thru the looking glass
 
2009 December NodeXL Overview
2009 December NodeXL Overview2009 December NodeXL Overview
2009 December NodeXL Overview
 
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
CISummit 2013: Luke Matthews, Tracking the Electronic Metadata Trail of the S...
 
On Incentive-based Tagging
On Incentive-based TaggingOn Incentive-based Tagging
On Incentive-based Tagging
 
New Metrics for New Media Bay Area CIO IT Executives Meetup
New Metrics for New Media Bay Area CIO IT Executives MeetupNew Metrics for New Media Bay Area CIO IT Executives Meetup
New Metrics for New Media Bay Area CIO IT Executives Meetup
 
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
What Are Links in Linked Open Data? A Characterization and Evaluation of Link...
 
Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011Contextual Ontology Alignment - ESWC 2011
Contextual Ontology Alignment - ESWC 2011
 
Modeling and Representing Trust Relations in Semantic Web-Driven Social Networks
Modeling and Representing Trust Relations in Semantic Web-Driven Social NetworksModeling and Representing Trust Relations in Semantic Web-Driven Social Networks
Modeling and Representing Trust Relations in Semantic Web-Driven Social Networks
 
Deploying Semantic Technologies for Digital Publishing: A Case Study from Log...
Deploying Semantic Technologies for Digital Publishing: A Case Study from Log...Deploying Semantic Technologies for Digital Publishing: A Case Study from Log...
Deploying Semantic Technologies for Digital Publishing: A Case Study from Log...
 
Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)
Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)
Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory
 
How To Make Linked Data More than Data
How To Make Linked Data More than DataHow To Make Linked Data More than Data
How To Make Linked Data More than Data
 

Más de webuploader

Michael_Hulme_Banff_Social_Networking
Michael_Hulme_Banff_Social_NetworkingMichael_Hulme_Banff_Social_Networking
Michael_Hulme_Banff_Social_Networking
webuploader
 
cyberSecurity_Milliron
cyberSecurity_MillironcyberSecurity_Milliron
cyberSecurity_Milliron
webuploader
 
LiveseyMotleyPresentation
LiveseyMotleyPresentationLiveseyMotleyPresentation
LiveseyMotleyPresentation
webuploader
 
FairShare_Morningstar_022607
FairShare_Morningstar_022607FairShare_Morningstar_022607
FairShare_Morningstar_022607
webuploader
 
3_System_Requirements_and_Scaling
3_System_Requirements_and_Scaling3_System_Requirements_and_Scaling
3_System_Requirements_and_Scaling
webuploader
 
ScalabilityAvailability
ScalabilityAvailabilityScalabilityAvailability
ScalabilityAvailability
webuploader
 
scale_perf_best_practices
scale_perf_best_practicesscale_perf_best_practices
scale_perf_best_practices
webuploader
 
7496_Hall 070204 Research Faculty Summit
7496_Hall 070204 Research Faculty Summit7496_Hall 070204 Research Faculty Summit
7496_Hall 070204 Research Faculty Summit
webuploader
 
FreeBSD - LinuxExpo
FreeBSD - LinuxExpoFreeBSD - LinuxExpo
FreeBSD - LinuxExpo
webuploader
 

Más de webuploader (20)

Michael_Hulme_Banff_Social_Networking
Michael_Hulme_Banff_Social_NetworkingMichael_Hulme_Banff_Social_Networking
Michael_Hulme_Banff_Social_Networking
 
socialpref
socialprefsocialpref
socialpref
 
cyberSecurity_Milliron
cyberSecurity_MillironcyberSecurity_Milliron
cyberSecurity_Milliron
 
PJO-3B
PJO-3BPJO-3B
PJO-3B
 
LiveseyMotleyPresentation
LiveseyMotleyPresentationLiveseyMotleyPresentation
LiveseyMotleyPresentation
 
FairShare_Morningstar_022607
FairShare_Morningstar_022607FairShare_Morningstar_022607
FairShare_Morningstar_022607
 
saito_porcupine
saito_porcupinesaito_porcupine
saito_porcupine
 
3_System_Requirements_and_Scaling
3_System_Requirements_and_Scaling3_System_Requirements_and_Scaling
3_System_Requirements_and_Scaling
 
ScalabilityAvailability
ScalabilityAvailabilityScalabilityAvailability
ScalabilityAvailability
 
scale_perf_best_practices
scale_perf_best_practicesscale_perf_best_practices
scale_perf_best_practices
 
7496_Hall 070204 Research Faculty Summit
7496_Hall 070204 Research Faculty Summit7496_Hall 070204 Research Faculty Summit
7496_Hall 070204 Research Faculty Summit
 
Chapter5
Chapter5Chapter5
Chapter5
 
Mak3
Mak3Mak3
Mak3
 
visagie_freebsd
visagie_freebsdvisagie_freebsd
visagie_freebsd
 
freebsd-watitis
freebsd-watitisfreebsd-watitis
freebsd-watitis
 
BPotter-L1-05
BPotter-L1-05BPotter-L1-05
BPotter-L1-05
 
FreeBSD - LinuxExpo
FreeBSD - LinuxExpoFreeBSD - LinuxExpo
FreeBSD - LinuxExpo
 
CLI313
CLI313CLI313
CLI313
 
CFInterop
CFInteropCFInterop
CFInterop
 
WCE031_WH06
WCE031_WH06WCE031_WH06
WCE031_WH06
 

Último

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

MiningEmailSocialNetworks

  • 1. Mining Email Social Networks in OSS Christian Bird, Prem Devanbu, Alex Gourley, and Michael Gertz Department of Computer Science Anand Swaminathan Graduate School of Management University of California, Davis
  • 2.
  • 3.
  • 4. It is widely believed that OSS communities form a hierarchy Can we use social network analysis to examine these OSS communities? Image from Socialization in an Open Source Community , Nicolas Ducheneaut
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13. 3 7 2 5 6 4 1 12 10 8 9 11 Example High Out-Degree High Betweenness High In-Degree
  • 14.
  • 15. Everybody likes a pretty picture! This is the social network of some of the most active participants on the Apache developer mailing list. Each link indicates at least 150 messages between participants. Ryan Bloom has high betweenness in this network. Of the participants shown, he has the highest number of source file commits.
  • 16. The distribution of in-degree and out-degree both exhibit a power-law character
  • 17. Status of Developers vs. Non-Developers Largest difference is in betweenness 0.000367 0.00794 In-Degree 0.000451 0.00666 Out-degree 0.000140 0.0114 Betweenness Non-Developer Developer
  • 18.
  • 19.
  • 20.
  • 21. Average In-Degree Months Avg In-Degree

Notas del editor

  1. Get names right Me – project – joint work with