SlideShare a Scribd company logo
1 of 16
  data  and
[object Object],[object Object]
[object Object],[object Object]
[object Object],[object Object]
[object Object],[object Object]
The English Wikipedia: 10 years of data As of September 2011 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
User Funnel English Wikipedia per month ,[object Object],[object Object],[object Object],[object Object],91% male College Educated Average age: 32 Predominantly from North America, Western Europe
Most Edited Wikipedia Article? George W. Bush
Most Edited Pages Total Edits Total Unique Editors Article 43,648  13,783  George W. Bush 33,534  4,306  Barack Obama (discussion) 30,567  3,817  List of World Wrestling Entertainment employees 27,433  8,242  United States 25,308  2,609  Global warming (discussion) 25,224  1,821  Sarah Palin (discussion) 23,241  5,672  Michael Jackson 21,768  5,933  Jesus 21,501  4,647  George W. Bush (discussion) 21,343  753  Gaza War (discussion) ,[object Object],[object Object],[object Object]
Why do editors leave Wikipedia?
70% of new users receive their first message from a bot
How we use data ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Get Involved! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

Similar to Wikimedia presentation data mining meetup pub

Sla 2011 building expert networks
Sla 2011 building expert networksSla 2011 building expert networks
Sla 2011 building expert networks
Ken Sickles
 
Web 2 An introduction for Library staff
Web 2 An introduction for Library staffWeb 2 An introduction for Library staff
Web 2 An introduction for Library staff
SteveJBaker
 

Similar to Wikimedia presentation data mining meetup pub (20)

Navigating science using citation networks
Navigating science using citation networksNavigating science using citation networks
Navigating science using citation networks
 
Social Media - Apply it to your Company Website
Social Media - Apply it to your Company WebsiteSocial Media - Apply it to your Company Website
Social Media - Apply it to your Company Website
 
Online News and Social Media
Online News and Social MediaOnline News and Social Media
Online News and Social Media
 
Sla 2011 building expert networks
Sla 2011 building expert networksSla 2011 building expert networks
Sla 2011 building expert networks
 
Science and Social Media: The Importance of Being Online
Science and Social Media: The Importance of Being OnlineScience and Social Media: The Importance of Being Online
Science and Social Media: The Importance of Being Online
 
ARC 211: American Diversity and Design: Zechariah Taitt
ARC 211: American Diversity and Design: Zechariah TaittARC 211: American Diversity and Design: Zechariah Taitt
ARC 211: American Diversity and Design: Zechariah Taitt
 
Wikipedia. lih.
Wikipedia. lih.Wikipedia. lih.
Wikipedia. lih.
 
ARC 211: American Diversity and Design: Joshua Rogers
ARC 211: American Diversity and Design: Joshua RogersARC 211: American Diversity and Design: Joshua Rogers
ARC 211: American Diversity and Design: Joshua Rogers
 
Arc 211 american diversity and design yuuki jo
Arc 211 american diversity and design yuuki joArc 211 american diversity and design yuuki jo
Arc 211 american diversity and design yuuki jo
 
Ponencia Congreso Andaluz Sociología, Almeria 25.11.2016 Social media el quin...
Ponencia Congreso Andaluz Sociología, Almeria 25.11.2016 Social media el quin...Ponencia Congreso Andaluz Sociología, Almeria 25.11.2016 Social media el quin...
Ponencia Congreso Andaluz Sociología, Almeria 25.11.2016 Social media el quin...
 
Web 2 An introduction for Library staff
Web 2 An introduction for Library staffWeb 2 An introduction for Library staff
Web 2 An introduction for Library staff
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
 
Social Marketing as a Tool for Policy Change
Social Marketing as a Tool for Policy ChangeSocial Marketing as a Tool for Policy Change
Social Marketing as a Tool for Policy Change
 
DMI Summer 2010 - Final Presentations
DMI Summer 2010 - Final PresentationsDMI Summer 2010 - Final Presentations
DMI Summer 2010 - Final Presentations
 
Arc211 americandesignanddiversityryancortazzo
Arc211 americandesignanddiversityryancortazzoArc211 americandesignanddiversityryancortazzo
Arc211 americandesignanddiversityryancortazzo
 
Media literacy panel
Media literacy panel Media literacy panel
Media literacy panel
 
Working with New Media
Working with New MediaWorking with New Media
Working with New Media
 
ARC 211: American Diversity and Design: Justin Bender
ARC 211: American Diversity and Design: Justin BenderARC 211: American Diversity and Design: Justin Bender
ARC 211: American Diversity and Design: Justin Bender
 
Professor Hendrik Speck - E*Lobbying. Elobbying.
Professor Hendrik Speck - E*Lobbying. Elobbying.Professor Hendrik Speck - E*Lobbying. Elobbying.
Professor Hendrik Speck - E*Lobbying. Elobbying.
 
Taxonomy, Social Networks and Pace Layering
Taxonomy, Social Networks and Pace LayeringTaxonomy, Social Networks and Pace Layering
Taxonomy, Social Networks and Pace Layering
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 

Wikimedia presentation data mining meetup pub

Editor's Notes

  1. Intro – who I am Supposed to talk about how data is used at the foundation to support our cause
  2. What’s our cause? Not just an encyclopedia
  3. The way I’m going to talk about it is using this dichotomy 2011 fundraising: 29.5m 2011 budget: 24m, 17 of which came from online fundraiser
  4. 40M words, 160k pages 50-60x larger than brittanica
  5. http://commons.wikimedia.org/wiki/File:Editor_Survey_Report_-_April_2011.pdf http://stats.wikimedia.org/EN/TablesWikipediaEN.htm About half of all edits made by 1400 people
  6. http://toolserver.org/~daniel/WikiSense/Contributors.php?wikilang=en&wikifam=.wikipedia.org&grouped=on&page=Global_warming http://en.wikipedia.org/wiki/Wikipedia:Database_reports/Pages_with_the_most_revisions
  7. Removed article, wikipedia, information, page
  8. What are we not doing with data that could support your research/work