SlideShare una empresa de Scribd logo
1 de 21
Descargar para leer sin conexión
Metrics standardization
Dario Taraborelli • Aaron Halfaker
Wikimedia Research and Data showcase
March 2014
summer 2013
cohort-level metrics
cohort-level metrics project-level metrics
project-level metrics
project-level metrics
ENWIKI New Editors / day 1D: 21% 30D: 18% YTD:
20%
Editor engagement vital signs
key performance indicators for user engagement, community and content growth
aggregated daily / weekly / monthly
for every single Wikimedia project
https://www.mediawiki.org/wiki/Analytics/Epics/Editor_Engagement_Vital_Signs
02/01: 1240
•
summer 2014
Key metrics
New users Community Content Curation
Newly registered users
New editors
Productive new editors
Surviving new editors
...
Editors
Active editors
Very active editors
IP editors
Bots
Page creators
...
Edits
Bot edits
Uploads
Pages
...
Page deletions
Reverts
...
https://meta.wikimedia.org/wiki/Research:Metrics_standardization
Relevant
Measure quantities that describe important phenomena
Replicable
Make research easily reproducible and verifiable
Transparent
Provide formal specifications, remove ambiguity
Consistent
Replace proprietary, ad-hoc metric definitions; compare apples to apples
Robust
Make metrics replicable via multiple data sources at any point in time
Granular
Computable at different time scales
Rationale
Anatomy of a metric 1. specification
Anatomy of a metric 2. visualizations
registration
time
Activation Trial Survival
Anatomy of a metric 2. visualizations
New editor
Productive new editor
Surviving new editor
Anatomy of a metric 3. discussion
Anatomy of a metric 4. sensitivity analysis
Sensitivity analysis
https://meta.wikimedia.org/wiki/Research:Productive_new_editor
Does new editor productivity vary when we measure it over the first day or the first week?
Sensitivity analysis
https://meta.wikimedia.org/wiki/Research:New_editor
Does it really matter to limit new editor activation to main namespace edits only?
Sensitivity analysis
https://meta.wikimedia.org/wiki/Research:Surviving_new_editor
Does the length of the trial and survival period affect the measurement of new editor survival?
Why does this matter at all?
1. Data exploration
“Newly registered users on German and Dutch Wikipedia have a higher activation
rate than newbies on English Wikipedia”
“Spanish Wikipedia adds every day twice as many new editors than German
Wikipedia, despite having only half its new user activation rate”
2. Natural experiments
“A change in abuse filter rules on the Italian Wikipedia significantly increased new
editor survival”
Metric specification
Sensitivity analysis
Parameter recommendation
Release
Evaluation
Evaluation
Metric specification
Sensitivity analysis
Parameter recommendation
Release
Evaluation
Evaluation
feedback
Questions?
dario@wikimedia.org
ahalfaker@wikimedia.org
Read more
https://meta.wikimedia.org/wiki/Research:Metrics_standardization
Image credits
W.Wood (1839) Index Entomologicus
http://dx.doi.org/10.5962/bhl.title.12503

Más contenido relacionado

Similar a Metrics standardization. Wikimedia Research & Data Showcase, March 2014

Water Wiki, a global platform for water community
Water Wiki, a global platform for water communityWater Wiki, a global platform for water community
Water Wiki, a global platform for water community
XWiki
 
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
Simon Tanner
 

Similar a Metrics standardization. Wikimedia Research & Data Showcase, March 2014 (20)

Measuring community health: Vital Signs for Wikimedia projects (Wikimania 2014)
Measuring community health: Vital Signs for Wikimedia projects (Wikimania 2014)Measuring community health: Vital Signs for Wikimedia projects (Wikimania 2014)
Measuring community health: Vital Signs for Wikimedia projects (Wikimania 2014)
 
Building Real Time, Open-Source Tools for Wikipedia
Building Real Time, Open-Source Tools for WikipediaBuilding Real Time, Open-Source Tools for Wikipedia
Building Real Time, Open-Source Tools for Wikipedia
 
Water Wiki, a global platform for water community
Water Wiki, a global platform for water communityWater Wiki, a global platform for water community
Water Wiki, a global platform for water community
 
New from BookNet Canada: BNC BiblioShare
New from BookNet Canada: BNC BiblioShareNew from BookNet Canada: BNC BiblioShare
New from BookNet Canada: BNC BiblioShare
 
Demonstrating the value of communications
Demonstrating the value of communicationsDemonstrating the value of communications
Demonstrating the value of communications
 
A bird's eye view of Wikipedia's new editor activation
A bird's eye view of Wikipedia's new editor activationA bird's eye view of Wikipedia's new editor activation
A bird's eye view of Wikipedia's new editor activation
 
Crowd control - how to build effective community monitoring for crowdsourced ...
Crowd control - how to build effective community monitoring for crowdsourced ...Crowd control - how to build effective community monitoring for crowdsourced ...
Crowd control - how to build effective community monitoring for crowdsourced ...
 
Deep Dive Microsoft Viva Insights - Collabdays Bletchley Park 2023
Deep Dive Microsoft Viva Insights - Collabdays Bletchley Park 2023Deep Dive Microsoft Viva Insights - Collabdays Bletchley Park 2023
Deep Dive Microsoft Viva Insights - Collabdays Bletchley Park 2023
 
Big Data World Africa 2012 - Mike Wronski Presentation
Big Data World Africa 2012 - Mike Wronski PresentationBig Data World Africa 2012 - Mike Wronski Presentation
Big Data World Africa 2012 - Mike Wronski Presentation
 
A tool for librarians to select metrics across the research lifecycle
A tool for librarians to select metrics across the research lifecycleA tool for librarians to select metrics across the research lifecycle
A tool for librarians to select metrics across the research lifecycle
 
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
Introduction to Implementing the Balanced Value Impact Model - Workshop for N...
 
The UserMetrics API. Measuring participation in Wikimedia projects
The UserMetrics API. Measuring participation in Wikimedia projectsThe UserMetrics API. Measuring participation in Wikimedia projects
The UserMetrics API. Measuring participation in Wikimedia projects
 
Redbooks Wiki
Redbooks WikiRedbooks Wiki
Redbooks Wiki
 
LinkedIn Content Story
LinkedIn Content StoryLinkedIn Content Story
LinkedIn Content Story
 
GFAR webinar on "innovative annual reports"
GFAR webinar on "innovative annual reports"GFAR webinar on "innovative annual reports"
GFAR webinar on "innovative annual reports"
 
JahiaOne - Jahia, the global website factory and "Ville de Nantes" case study...
JahiaOne - Jahia, the global website factory and "Ville de Nantes" case study...JahiaOne - Jahia, the global website factory and "Ville de Nantes" case study...
JahiaOne - Jahia, the global website factory and "Ville de Nantes" case study...
 
SharePoint Wiki Feasibility Report (Draft) - Travis Barker.pdf
SharePoint Wiki Feasibility Report (Draft) - Travis Barker.pdfSharePoint Wiki Feasibility Report (Draft) - Travis Barker.pdf
SharePoint Wiki Feasibility Report (Draft) - Travis Barker.pdf
 
Public Relations (PR) Measurement
Public Relations (PR) MeasurementPublic Relations (PR) Measurement
Public Relations (PR) Measurement
 
The Convergence of Search Marketing and Social Media – What You Need to Know
The Convergence of Search Marketing and Social Media – What You Need to KnowThe Convergence of Search Marketing and Social Media – What You Need to Know
The Convergence of Search Marketing and Social Media – What You Need to Know
 
UKSG Conference 2017 Breakout - Crossref Event Data: tools for DIY analyses o...
UKSG Conference 2017 Breakout - Crossref Event Data: tools for DIY analyses o...UKSG Conference 2017 Breakout - Crossref Event Data: tools for DIY analyses o...
UKSG Conference 2017 Breakout - Crossref Event Data: tools for DIY analyses o...
 

Más de Dario Taraborelli

Transparency in measures of scientific impact
Transparency in measures of scientific impactTransparency in measures of scientific impact
Transparency in measures of scientific impact
Dario Taraborelli
 

Más de Dario Taraborelli (16)

Verifiable, linked open knowledge that anyone can edit
Verifiable, linked open knowledge that anyone can editVerifiable, linked open knowledge that anyone can edit
Verifiable, linked open knowledge that anyone can edit
 
Citing as a public service. Building the sum of all human citations
Citing as a public service. Building the sum of all human citationsCiting as a public service. Building the sum of all human citations
Citing as a public service. Building the sum of all human citations
 
A new research agenda for Wikimedia – Big Dive 2015
A new research agenda for Wikimedia – Big Dive 2015A new research agenda for Wikimedia – Big Dive 2015
A new research agenda for Wikimedia – Big Dive 2015
 
The sum of all human knowledge in the age of machines: A new research agenda ...
The sum of all human knowledge in the age of machines: A new research agenda ...The sum of all human knowledge in the age of machines: A new research agenda ...
The sum of all human knowledge in the age of machines: A new research agenda ...
 
Crossing the streams: Social and technical interfaces between Wikimedia and O...
Crossing the streams: Social and technical interfaces between Wikimedia and O...Crossing the streams: Social and technical interfaces between Wikimedia and O...
Crossing the streams: Social and technical interfaces between Wikimedia and O...
 
The Missing Wikipedia ads (Wikimania 2014)
The Missing Wikipedia ads (Wikimania 2014)The Missing Wikipedia ads (Wikimania 2014)
The Missing Wikipedia ads (Wikimania 2014)
 
Wikimedia 2013 traffic trends
Wikimedia 2013 traffic trendsWikimedia 2013 traffic trends
Wikimedia 2013 traffic trends
 
Descending Mount Everest. Steps towards applied Wikipedia research
Descending Mount Everest. Steps towards applied Wikipedia researchDescending Mount Everest. Steps towards applied Wikipedia research
Descending Mount Everest. Steps towards applied Wikipedia research
 
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
Everything You Always Wanted to Know About Cohorts (But Were Afraid to Ask)
 
EventLogging Workshop
EventLogging WorkshopEventLogging Workshop
EventLogging Workshop
 
Experts as contributors, contributors as experts. Bridging the gap between Wi...
Experts as contributors, contributors as experts. Bridging the gap between Wi...Experts as contributors, contributors as experts. Bridging the gap between Wi...
Experts as contributors, contributors as experts. Bridging the gap between Wi...
 
Microtasks and new editor engagement
Microtasks and new editor engagementMicrotasks and new editor engagement
Microtasks and new editor engagement
 
Venues for expert participation in Wikipedia
Venues for expert participation in WikipediaVenues for expert participation in Wikipedia
Venues for expert participation in Wikipedia
 
New editors not welcome: When Wikipedia articles trend
New editors not welcome: When Wikipedia articles trendNew editors not welcome: When Wikipedia articles trend
New editors not welcome: When Wikipedia articles trend
 
Transparency in measures of scientific impact
Transparency in measures of scientific impactTransparency in measures of scientific impact
Transparency in measures of scientific impact
 
Measuring Wiki viability
Measuring Wiki viabilityMeasuring Wiki viability
Measuring Wiki viability
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 

Metrics standardization. Wikimedia Research & Data Showcase, March 2014