SlideShare una empresa de Scribd logo
1 de 86
Descargar para leer sin conexión
Ed H. Chi

Area Manager and Principal Scientist
Augmented Social Cognition Area
Palo Alto Research Center
    Cognition:	
  the	
  ability	
  to	
  remember,	
  think,	
  and	
  reason;	
  the	
  faculty	
  of	
  
      knowing.	
  
     Social	
  Cognition:	
  the	
  ability	
  of	
  a	
  group	
  to	
  remember,	
  think,	
  and	
  
      reason;	
  the	
  construction	
  of	
  knowledge	
  structures	
  by	
  a	
  group.	
  
        –  (not	
  quite	
  the	
  same	
  as	
  in	
  the	
  branch	
  of	
  psychology	
  that	
  studies	
  the	
  
           cognitive	
  processes	
  involved	
  in	
  social	
  interaction,	
  though	
  included)	
  
     Augmented	
  Social	
  Cognition:	
  Supported	
  by	
  systems,	
  the	
  
      enhancement	
  	
  of	
  the	
  ability	
  of	
  a	
  group	
  to	
  remember,	
  think,	
  and	
  
      reason;	
  the	
  system-­‐supported	
  construction	
  of	
  knowledge	
  
      structures	
  by	
  a	
  group.	
  	
  

 Citation:	
  Chi,	
  IEEE	
  Computer,	
  Sept	
  2008	
  



2010-02-22                                    Ed H. Chi ASC Overview                                                     2
                                                                                                                             2
Characteriza*on	
         Models	
  




                    Evalua*ons	
           Prototypes	
  



    Characterize activity on social systems with analytics
    Model interaction social and community dynamics and variables
    Prototype tools to increase benefits or reduce cost
    Evaluate prototypes via Living Laboratories with real users

                                                                 3
2010-02-22                  Ed H. Chi ASC Overview                   3
    Characterization and Modeling:
     –  Community Analytics and Wikipedia Dynamics
    Prototyping:
     –  Social Transparency thru WikiDashboard
    Evaluation:
     –  Evaluations using Amazon Mechanical Turk




                                                     4
2010-02-22           Ed H. Chi ASC Overview              4
Characteriza*on	
        Models	
  




 Evalua*ons	
         Prototypes	
  
Conflict/Coordination	
  Effects	
  in	
  Wikipedia	
  




2010-02-22           Ed H. Chi ASC Overview            6
Mediator	
  Pattern	
  -­‐	
  Terri	
  Schiavo	
  
                                          Anonymous (vandals/
                                          spammers)




             Sympathetic to
             husband


                                                  Mediators




                              Sympathetic to parents


2010-02-22               Ed H. Chi ASC Overview                 7
Measure	
  of	
  controversy	
  
•       Controversial”	
  tag	
  



• Use	
  #	
  revisions	
  tagged	
  controversial	
  




2010-02-22              Ed H. Chi ASC Overview           8
Page	
  metrics	
  
•  Possible	
  metrics	
  for	
  identifying	
  conflict	
  in	
  articles	
  

                   Metric type                    Page Type
                     Revisions (#)             Article, talk, article/talk
                      Page length              Article, talk, article/talk
                     Unique editors            Article, talk, article/talk
               Unique editors / revisions            Article, talk
                Links from other articles            Article, talk
                 Links to other articles             Article, talk
                Anonymous edits (#, %)               Article, talk
               Administrator edits (#, %)            Article, talk
                   Minor edits (#, %)                Article, talk
                 Reverts (#, by unique
                                                         Article
                       editors)

2010-02-22                      Ed H. Chi ASC Overview                          9
Performance:	
  Cross-­‐validation	
  
• 5x	
  cross-­‐validation,	
  R2	
  =	
  0.897	
  




2010-02-22               Ed H. Chi ASC Overview       10
Determinants	
  of	
  conflict	
  
             Highly weighted features of conflict model:

                  Revisions	
  (talk)	
  
                  Minor	
  edits	
  (talk)	
  
                  Unique	
  editors	
  (talk)	
  
                  Revisions	
  (article)	
  
                  Unique	
  editors	
  (article)	
  
                  Anonymous	
  edits	
  (talk)	
  
                  Anonymous	
  edits	
  (article)	
  

2010-02-22                 Ed H. Chi ASC Overview          11
Number of Articles (Log Scale)




     http://en.wikipedia.org/wiki/Wikipedia:Modelling_Wikipedia’s_growth
2010-02-22                            Ed H. Chi ASC Overview               12
                                                                                12
2010-02-22   Ed H. Chi ASC Overview   13
                                           13
Monthly Edits




2010-02-22     Ed H. Chi ASC Overview   14
                                             14
Monthly Edits




2010-02-22     Ed H. Chi ASC Overview   15
                                             15
Monthly Active Editors




2010-02-22        Ed H. Chi ASC Overview   16
                                                16
Characteriza*on	
        Models	
  




 Evalua*ons	
         Prototypes	
  
2010-02-22   Ed H. Chi ASC Overview   18
                                           18
    Edits beget edits
          –  more number of previous edits, more number of new edits

         Growth rate depends on current population N
         r = growth rate of the population


                                                    N(t) = N 0 ⋅ e rt
                 dN
                    = r⋅ N
                 dt
             Growth rate      Current
            of population           €
                             population

€   2010-02-22                  Ed H. Chi ASC Overview                  19
                                                                             19
    Ecological population growth model
      –  r, growth rate of the population
      –  K, carrying capacity (due to resource limitation)
                                            4000000
                                                                        K
                                            3500000
                                            3000000

dN              N              Population
                                            2500000
   = r ⋅ N ⋅ (1− )                          2000000
dt              K                           1500000
                                            1000000
                                            500000
                                                 0
                                                  2000   2002   2004          2006   2008         2010
                                                                       Year


 2010-02-22                Ed H. Chi ASC Overview                                       20
                                                                                             20
    Follows a logistic growth curve

                                                          New Article




  http://en.wikipedia.org/wiki/Wikipedia:Modelling_Wikipedia’s_growth
2010-02-22                             Ed H. Chi ASC Overview           21
                                                                             21
     Carrying Capacity as a function of time.


                                                 K(t)
     Population




       2000       2001   2002   2003   2004   2005      2006   2007   2008   2009   2010
                                              Year


2010-02-22                             Ed H. Chi ASC Overview                              22
                                                                                                22
     Biological system
       –  Competition increases as
          population hit the limits of the
          ecology
       –  Advantage go to members of the
          population that have competitive
          dominance over others
     Analogy
       –  Limited opportunities to make
          novel contributions
       –  Increased patterns of conflict and
          dominance


     2010-02-22               Ed H. Chi ASC Overview   23
                                                            23
2010-02-22   Ed H. Chi ASC Overview   24
                                           24
    Highly skewed contribution pattern
     –  Top 3% users contribute 50%+ edits
     –  A lot of single-edit users
    Five Editor Classes
     –  Monthly edit count
     –  No bot, vandalism included in the analysis
     –    1000+: editors who made more than 1000 edits in that month
     –    100-999
     –    10-99
     –    2-9
     –    1

2010-02-22                   Ed H. Chi ASC Overview                    25
                                                                            25
Monthly Edits by Editor Class (in thousands)




2010-02-22                Ed H. Chi ASC Overview            26
                                                                 26
2010-02-22   Ed H. Chi ASC Overview   27
                                           27
Monthly Ratio of Reverted Edits




2010-02-22               Ed H. Chi ASC Overview   28
                                                       28
    Two interpretations:
     –  Overall increased resistance
        from the Wikipedia community
        to changing content
     –  Disparity of treatment of edits
             »  Occasional editors have been
                reverted in a higher rate


    Example of increased
     patterns of conflict and
     dominance

                                                  Photo: http://www.flickr.com/photos/efan78/3619921561/


2010-02-22                       Ed H. Chi ASC Overview                                     29
                                                                                                 29
2010-02-22   Ed H. Chi ASC Overview   30
                                           30
Bongwon Suh, Gregorio Convertino, Ed H. Chi, Peter Pirolli. WikiSym 2009




2010-02-22              Ed H. Chi ASC Overview                                          31
                                                                                             31
Characteriza*on	
        Models	
  




 Evalua*ons	
         Prototypes	
  
“Wikipedia is the best thing ever. Anyone in the world can write
anything they want about any subject, so you know you’re getting the
                     best possible information.”
                      – Steve Carell, The Office

2010-02-22               Ed H. Chi ASC Overview                  33
                                                                      33
    Content in Wikipedia can be added or
     changed by anyone
    Because of this, WP has become one of the
     most important resources on the web
     –  Hundreds of thousands of contributors
     –  Over 2 million articles
     –  5th most used websites (Alexa.com)
    Also because of this, is viewed with
     skepticism by readers, press, researchers

2010-02-22            Ed H. Chi ASC Overview     34
                                                      34
2010-02-22   Ed H. Chi ASC Overview   35
                                           35
Nothing


2010-02-22   Ed H. Chi ASC Overview   36
                                           36
“Wikipedia, just by its nature, is
             impossible to trust completely. I don't
             think this can necessarily be
             changed.”




2010-02-22               Ed H. Chi ASC Overview        37
                                                            37
    Risks with using Wikipedia
     –    Accuracy of content
     –    Motives of editors
     –    Expertise of editors
     –    Stability of article
     –    Coverage of topics
     –    Quality of cited information

              Insufficient information to evaluate
                         trustworthiness

2010-02-22               Ed H. Chi ASC Overview      38
                                                          38
     Transparency of social dynamics can reduce conflict and coordination
        issues
       Attribution encourages contribution
         –  WikiDashboard: Social dashboard for wikis
         –  Prototype system: http://wikidashboard.parc.com



       Visualization for every wiki page
        showing edit history timeline and
        top individual editors

       Can drill down into activity history
        for specific editors and view edits
        to see changes side-by-side

Citation: Suh et al.
CHI 2008 Proceedings
                                                                               39
  2010-02-22                        Ed H. Chi ASC Overview                          39
2010-02-22   Ed H. Chi ASC Overview   40
                                           40
2010-02-22   Ed H. Chi ASC Overview   41
Characteriza*on	
        Models	
  




 Evalua*ons	
         Prototypes	
  
Surfacing information

•  Numerous studies mining Wikipedia revision
   history to surface trust-relevant information
   –  Adler & Alfaro, 2007; Dondio et al., 2006; Kittur et al., 2007;
      Viegas et al., 2004; Zeng et al., 2006




                                          Suh, Chi, Kittur, & Pendleton, CHI2008


•  But how much impact can this have on user
   perceptions in a system which is inherently
   mutable?
                                                                              43
Hypotheses

1.  Visualization will impact perceptions of trust
2.  Compared to baseline, visualization will
    impact trust both positively and negatively
3.  Visualization should have most impact when
    high uncertainty about article
   •    Low quality
   •    High controversy




                                                     44
Design

        •  3 x 2 x 2 design


                          Controversial    Uncontroversial


Visualization              Abortion          Volcano
                                                             High quality
•    High stability     George Bush           Shark
•    Low stability
•    Baseline (none)   Pro-life feminism        Disk
                                           defragmenter      Low quality
                       Scientology and
                          celebrities        Beeswax




                                                                           45
Example: High trust visualization




                                    46
Example: Low trust visualization




                                   47
Summary info

          •  % from anonymous
             users




                                48
Summary info

          •  % from anonymous
             users
          •  Last change by
             anonymous or
             established user




                                49
Summary info

          •  % from anonymous
             users
          •  Last change by
             anonymous or
             established user
          •  Stability of words




                                  50
Graph

•  Instability




                         51
Graph

•  Instability
•  Revert activity




                             52
Method

•  Users recruited via Amazon’s Mechanical Turk
   –    253 participants
   –    673 ratings
   –    7 cents per rating
   –    Kittur, Chi, & Suh, CHI 2008: Crowdsourcing user studies
•  To ensure salience and valid answers, participants
   answered:
   –    In what time period was this article the least stable?
   –    How stable has this article been for the last month?
   –    Who was the last editor?
   –    How trustworthy do you consider the above editor?




                                                                 53
Results




main effects of quality and controversy:
• high-quality articles > low-quality articles (F(1, 425) = 25.37, p < .001)
• uncontroversial articles > controversial articles (F(1, 425) = 4.69, p = .
031)

                                                                               54
Results




interaction effects of quality and controversy:
• high quality articles were rated equally trustworthy whether controversial
or not, while
• low quality articles were rated lower when they were controversial than
when they were uncontroversial service.
                                                                               55
Results

1.  Significant effect of
    visualization
   –  High > low, p < .001
2.  Viz has both positive and
    negative effects
   –  High > baseline, p < .001
   –  Low > baseline, p < .01
3.  No interaction of
    visualization with either
    quality or controversy
   –  Robust across conditions



                                       56
Results

1.  Significant effect of
    visualization
   –  High > low, p < .001
2.  Viz has both positive and
    negative effects
   –  High > baseline, p < .001
   –  Low > baseline, p < .01
3.  No interaction of
    visualization with either
    quality or controversy
   –  Robust across conditions



                                       57
Results

1.  Significant effect of
    visualization
   –  High > low, p < .001
2.  Viz has both positive and
    negative effects
   –  High > baseline, p < .001
   –  Low > baseline, p < .01
3.  No interaction effect of
    visualization with either
    quality or controversy
   –  Robust across conditions



                                       58
Characteriza*on	
        Models	
  




           Methodology

 Evalua*ons	
         Prototypes	
  
User studies

•  Getting input from users is important in HCI
   –    surveys
   –    rapid prototyping
   –    usability tests
   –    cognitive walkthroughs
   –    performance measures
   –    quantitative ratings
User studies

•  Getting input from users is expensive
   –  Time costs
   –  Monetary costs
•  Often have to trade off costs with sample size
Online solutions

•    Online user surveys
•    Remote usability testing
•    Online experiments
•    But still have difficulties
     –  Rely on practitioner for recruiting participants
     –  Limited pool of participants
Crowdsourcing

•  Make tasks available for anyone online to complete
•  Quickly access a large user pool, collect data, and
   compensate users

•  Experiences at PARC:
    –  CSL UbiComp group
    –  ISL’s NLTT group
Crowdsourcing

•  Make tasks available for anyone online to complete
•  Quickly access a large user pool, collect data, and
   compensate users
•  Example: NASA Clickworkers
    –  100k+ volunteers identified Mars craters from
       space photographs
    –  Aggregate results “virtually indistinguishable” from
       expert geologists

                                                     experts

                                                    crowds

                http://clickworkers.arc.nasa.gov
Amazon’s Mechanical turk

•  Market for “human intelligence tasks”
•  Typically short, objective tasks
   –  Tag an image
   –  Find a webpage
   –  Evaluate relevance of search results
•  Users complete for a few pennies each
Example task
Using Mechanical Turk for user studies

                       Traditional user        Mechanical Turk
                           studies
Task complexity           Complex                   Simple
                           Long                     Short
Task subjectivity         Subjective               Objective
                          Opinions                 Verifiable
User information    Targeted demographics   Unknown demographics
                       High interactivity     Limited interactivity


    Can Mechanical Turk be usefully used for user studies?
Task

•  Assess quality of Wikipedia articles
•  Started with ratings from expert Wikipedians
    –  14 articles (e.g., “Germany”, “Noam Chomsky”)
    –  7-point scale
•  Can we get matching ratings with mechanical turk?
Experiment 1

•  Rate articles on 7-point scales:
   –  Well written
   –  Factually accurate
   –  Overall quality
•  Free-text input:
   –  What improvements does the article need?
•  Paid $0.05 each
Experiment 1: Good news

•  58 users made 210 ratings (15 per article)
   –  $10.50 total
•  Fast results
   –  44% within a day, 100% within two days
   –  Many completed within minutes
Experiment 1: Bad news

•  Correlation between turkers and Wikipedians
   only marginally significant (r=.50, p=.07)
•  Worse, 59% potentially invalid responses
                         Experiment 1
           Invalid           49%
         comments
           <1 min            31%
         responses

•  Nearly 75% of these done by only 8 users
Not a good start
•  Summary so far:
   –  Only marginal correlation with experts.
   –  Heavy gaming of the system by a minority
•  Possible Response:
   –  Can make sure these gamers are not rewarded
   –  Ban them from doing your hits in the future
   –  Create a reputation system [Delores Lab]
•  Can we change how we collect user input ?
Design changes

•  Use verifiable questions to signal monitoring
   –  “How many sections does the article have?”
   –  “How many images does the article have?”
   –  “How many references does the article have?”
Design changes

•  Use verifiable questions to signal monitoring
•  Make malicious answers as high cost as
   good-faith answers
   –  “Provide 4-6 keywords that would give someone a
      good summary of the contents of the article”
Design changes

•  Use verifiable questions to signal monitoring
•  Make malicious answers as high cost as
   good-faith answers
•  Make verifiable answers useful for completing
   task
   –  Used tasks similar to how Wikipedians described
      evaluating quality (organization, presentation,
      references)
Design changes

•  Use verifiable questions to signal monitoring
•  Make malicious answers as high cost as
   good-faith answers
•  Make verifiable answers useful for completing
   task
•  Put verifiable tasks before subjective
   responses
   –  First do objective tasks and summarization
   –  Only then evaluate subjective quality
   –  Ecological validity?
Experiment 2: Results

   •  124 users provided 277 ratings (~20 per article)
   •  Significant positive correlation with Wikipedians (r=.
      66, p=.01)

   •  Smaller proportion malicious responses
   •  Increased time on task

                      Experiment 1        Experiment 2
  Invalid                49%                  3%
comments
  <1 min                 31%                  7%
responses
Median time              1:30                4:06
Generalizing to other user studies

•  Combine objective and subjective questions
   –  Rapid prototyping: ask verifiable questions about
      content/design of prototype before subjective
      evaluation
   –  User surveys: ask common-knowledge questions
      before asking for opinions
Limitations of mechanical turk

•  No control of users’ environment
   –  Potential for different browsers, physical
      distractions
   –  General problem with online experimentation
•  Not designed for user studies
   –  Difficult to do between-subjects design
   –  Involves some programming
•  Users
   –  Uncertainty about user demographics, expertise
Conclusion

•  Mechanical Turk offers the practitioner a way to
   access a large user pool and quickly collect data at
   low cost
•  Good results require careful task design
  1.  Use verifiable questions to signal monitoring
  2.  Make malicious answers as high cost as good-faith
      answers
  3.  Make verifiable answers useful for completing task
  4.  Put verifiable tasks before subjective responses
Ed	
  H.	
  Chi	
  (manager,	
  PS)	
  
Peter	
  Pirolli	
  (RF)	
  
Lichan	
  Hong	
  
Bongwon	
  Suh	
  
Les	
  Nelson	
  
Rowan	
  Nairn	
  	
  
Gregorio	
  Convertino	
  	
  

Interns/Collaborators:	
  	
  
Sanjay	
  Kairam,	
  Jilin	
  Chen	
  (UMinn),	
  Michael	
  Bernstein	
  (MIT)	
  
                         http://asc-­‐parc.blogspot.com	
  
                                                       	
  
                            Ed H. Chi ASC Overview             81
2010-02-22
2010-02-22   Ed H. Chi ASC Overview   82
    r, growth rate                                      dN        N
                                                            = rN(1− )
    K, carrying capacity                                dt        K

             4000000
             3500000
             3000000
                                   €                                K dominates
             2500000    r dominates                                 when N K
             2000000
                       when N is small                                   N
             1500000
                             N                                        (1− ) ≈ 0
             1000000
                          (1− ) ≈ 1                                      K
             500000          K
                  0
                   2000     2002    2004          2006       2008   2010
                                           Year
2010-02-22                     Ed H. Chi ASC Overview                             83
             €                                           €
    r-Strategist
     –  Growth or exploitation
     –  Less-crowded niches / produce many offspring


    K-Strategist
     –  Conservation
     –  Strong competitors in crowded niches / invest more heavily in
        fewer offspring


    Evolution cycle
     –  Resilience of an ecological system
     –  Gunderson & Holling 2001
2010-02-22               Ed H. Chi ASC Overview                   84
    Exponential growth model                               dN
     –  Growth rate depends on the current N
                                                               = r*N
                                                            dt
    Ecological population growth model
     –  r, growth rate of the population
     –  K, carrying capacity (due to resource limitation)
                                            €
                         dN        N
                            = rN(1− )
                         dt        K



2010-02-22
             €            Ed H. Chi ASC Overview                 85
    People-ware
      –  Growing resistance to changing content
      –  Coordination cost and bureaucracy
    Knowledge-ware: Availability of easy topics to write about
    Tool-ware: Quality of tools used by editors and admins




        http://www.aerostich.com/
        http://www.mikestreetmedia.co.uk/blog/wp-content/uploads/2009/01/knowledge.jpg
2010-02-22                            Ed H. Chi ASC Overview                             86
        http://youropenbook.agitprop.co.uk/growing.php?p=2

Más contenido relacionado

Destacado

Style Guide for Short-Film 'The Ordinary'
Style Guide for Short-Film 'The Ordinary'Style Guide for Short-Film 'The Ordinary'
Style Guide for Short-Film 'The Ordinary'
John Smith
 
Entrevista a D. Miguel, profesor de alemán en la E.O.I
Entrevista a D. Miguel,  profesor de alemán en la E.O.IEntrevista a D. Miguel,  profesor de alemán en la E.O.I
Entrevista a D. Miguel, profesor de alemán en la E.O.I
europadejahuella
 
Concurso la papa - Mentoring equipo y elevator pitch
Concurso la papa - Mentoring equipo y elevator pitchConcurso la papa - Mentoring equipo y elevator pitch
Concurso la papa - Mentoring equipo y elevator pitch
concursolapapa
 
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
Chih-Ju Lin
 

Destacado (17)

Alerta y valora
Alerta y valoraAlerta y valora
Alerta y valora
 
Informe ejecutivo rendicion
Informe ejecutivo rendicionInforme ejecutivo rendicion
Informe ejecutivo rendicion
 
Accses 2
Accses 2Accses 2
Accses 2
 
DataManagement at Cummins_Mod
DataManagement at Cummins_ModDataManagement at Cummins_Mod
DataManagement at Cummins_Mod
 
Oracle12c threaded execution - Ressourcen sparen zum Nulltarif?!?
Oracle12c threaded execution - Ressourcen sparen zum Nulltarif?!?Oracle12c threaded execution - Ressourcen sparen zum Nulltarif?!?
Oracle12c threaded execution - Ressourcen sparen zum Nulltarif?!?
 
Style Guide for Short-Film 'The Ordinary'
Style Guide for Short-Film 'The Ordinary'Style Guide for Short-Film 'The Ordinary'
Style Guide for Short-Film 'The Ordinary'
 
Bonne Impression
Bonne ImpressionBonne Impression
Bonne Impression
 
South Orange Police Operational Report
South Orange Police Operational ReportSouth Orange Police Operational Report
South Orange Police Operational Report
 
Entrevista a D. Miguel, profesor de alemán en la E.O.I
Entrevista a D. Miguel,  profesor de alemán en la E.O.IEntrevista a D. Miguel,  profesor de alemán en la E.O.I
Entrevista a D. Miguel, profesor de alemán en la E.O.I
 
Inicio rapido
Inicio rapidoInicio rapido
Inicio rapido
 
Concurso la papa - Mentoring equipo y elevator pitch
Concurso la papa - Mentoring equipo y elevator pitchConcurso la papa - Mentoring equipo y elevator pitch
Concurso la papa - Mentoring equipo y elevator pitch
 
Litebi Truebi - Cloud Business Intelligence for your Department
Litebi   Truebi - Cloud Business Intelligence for your DepartmentLitebi   Truebi - Cloud Business Intelligence for your Department
Litebi Truebi - Cloud Business Intelligence for your Department
 
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
Thermoelectric and magnetic properties of Ca3Co4-xCuxO9+ δ with x = 0.00, 0.0...
 
IxDA Oslo Kick-off
IxDA Oslo Kick-offIxDA Oslo Kick-off
IxDA Oslo Kick-off
 
Proyecto
ProyectoProyecto
Proyecto
 
Bba 1 ibo u 5.3 e-commerce
Bba 1 ibo u 5.3 e-commerceBba 1 ibo u 5.3 e-commerce
Bba 1 ibo u 5.3 e-commerce
 
Ppt presentación proyecto fao mana
Ppt presentación proyecto fao manaPpt presentación proyecto fao mana
Ppt presentación proyecto fao mana
 

Similar a 2010-02-22 Wikipedia MTurk Research talk given in Taiwan's Academica Sinica

Marsh territorial world_bank
Marsh territorial world_bankMarsh territorial world_bank
Marsh territorial world_bank
jexxon
 
Wikimedia Foundation presentation to MS Technology Management students at Uni...
Wikimedia Foundation presentation to MS Technology Management students at Uni...Wikimedia Foundation presentation to MS Technology Management students at Uni...
Wikimedia Foundation presentation to MS Technology Management students at Uni...
webbyj
 

Similar a 2010-02-22 Wikipedia MTurk Research talk given in Taiwan's Academica Sinica (17)

China HCI Symposium 2010 March: Augmented Social Cognition Research from PARC...
China HCI Symposium 2010 March: Augmented Social Cognition Research from PARC...China HCI Symposium 2010 March: Augmented Social Cognition Research from PARC...
China HCI Symposium 2010 March: Augmented Social Cognition Research from PARC...
 
Research on Social Dynamics in Wikipedia
Research on Social Dynamics in WikipediaResearch on Social Dynamics in Wikipedia
Research on Social Dynamics in Wikipedia
 
BayCHI: Wikimania Redux
BayCHI: Wikimania ReduxBayCHI: Wikimania Redux
BayCHI: Wikimania Redux
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Openness as an Organizing Principle: Open for Diversity or Exclusionary Openn...
Openness as an Organizing Principle: Open for Diversity or Exclusionary Openn...Openness as an Organizing Principle: Open for Diversity or Exclusionary Openn...
Openness as an Organizing Principle: Open for Diversity or Exclusionary Openn...
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Model-Driven Research in Social Computing
Model-Driven Research in Social ComputingModel-Driven Research in Social Computing
Model-Driven Research in Social Computing
 
Offenheit als Organisationsprinzip: Offen für Diversität oder Exklusion durch...
Offenheit als Organisationsprinzip: Offen für Diversität oder Exklusion durch...Offenheit als Organisationsprinzip: Offen für Diversität oder Exklusion durch...
Offenheit als Organisationsprinzip: Offen für Diversität oder Exklusion durch...
 
Repurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical deviceRepurposing Wikipedia: Wikipedia as data set and analytical device
Repurposing Wikipedia: Wikipedia as data set and analytical device
 
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
 
Building better knowledge graphs through social computing
Building better knowledge graphs through social computingBuilding better knowledge graphs through social computing
Building better knowledge graphs through social computing
 
BioWikis BSB10
BioWikis BSB10BioWikis BSB10
BioWikis BSB10
 
Sustainability Committee 2009 SDAT Proposal
Sustainability Committee 2009 SDAT ProposalSustainability Committee 2009 SDAT Proposal
Sustainability Committee 2009 SDAT Proposal
 
Sustainability Committee 2009 SDAT Proposal
Sustainability Committee 2009 SDAT ProposalSustainability Committee 2009 SDAT Proposal
Sustainability Committee 2009 SDAT Proposal
 
Citation indexing
Citation indexingCitation indexing
Citation indexing
 
Marsh territorial world_bank
Marsh territorial world_bankMarsh territorial world_bank
Marsh territorial world_bank
 
Wikimedia Foundation presentation to MS Technology Management students at Uni...
Wikimedia Foundation presentation to MS Technology Management students at Uni...Wikimedia Foundation presentation to MS Technology Management students at Uni...
Wikimedia Foundation presentation to MS Technology Management students at Uni...
 

Más de Ed Chi

2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...
Ed Chi
 
Location and Language in Social Media (Stanford Mobi Social Invited Talk)
Location and Language in Social Media (Stanford Mobi Social Invited Talk)Location and Language in Social Media (Stanford Mobi Social Invited Talk)
Location and Language in Social Media (Stanford Mobi Social Invited Talk)
Ed Chi
 
Crowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI researchCrowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI research
Ed Chi
 
CIKM 2011 Social Computing Industry Invited Talk
CIKM 2011 Social Computing Industry Invited TalkCIKM 2011 Social Computing Industry Invited Talk
CIKM 2011 Social Computing Industry Invited Talk
Ed Chi
 
WikiSym 2011 Closing Keynote
WikiSym 2011 Closing KeynoteWikiSym 2011 Closing Keynote
WikiSym 2011 Closing Keynote
Ed Chi
 
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
Ed Chi
 
Zerozero88 Twitter URL Item Recommender
Zerozero88 Twitter URL Item RecommenderZerozero88 Twitter URL Item Recommender
Zerozero88 Twitter URL Item Recommender
Ed Chi
 
ASC Disaster Response Proposal from Aug 2007
ASC Disaster Response Proposal from Aug 2007ASC Disaster Response Proposal from Aug 2007
ASC Disaster Response Proposal from Aug 2007
Ed Chi
 

Más de Ed Chi (20)

2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...
 
HCI Korea 2012 Keynote Talk on Model-Driven Research in Social Computing
HCI Korea 2012 Keynote Talk on Model-Driven Research in Social ComputingHCI Korea 2012 Keynote Talk on Model-Driven Research in Social Computing
HCI Korea 2012 Keynote Talk on Model-Driven Research in Social Computing
 
Location and Language in Social Media (Stanford Mobi Social Invited Talk)
Location and Language in Social Media (Stanford Mobi Social Invited Talk)Location and Language in Social Media (Stanford Mobi Social Invited Talk)
Location and Language in Social Media (Stanford Mobi Social Invited Talk)
 
Crowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI researchCrowdsourcing using MTurk for HCI research
Crowdsourcing using MTurk for HCI research
 
CIKM 2011 Social Computing Industry Invited Talk
CIKM 2011 Social Computing Industry Invited TalkCIKM 2011 Social Computing Industry Invited Talk
CIKM 2011 Social Computing Industry Invited Talk
 
WikiSym 2011 Closing Keynote
WikiSym 2011 Closing KeynoteWikiSym 2011 Closing Keynote
WikiSym 2011 Closing Keynote
 
CSCL 2011 Keynote on Social Computing and eLearning
CSCL 2011 Keynote on Social Computing and eLearningCSCL 2011 Keynote on Social Computing and eLearning
CSCL 2011 Keynote on Social Computing and eLearning
 
Replication is more than Duplication: Position slides for CHI2011 panel on re...
Replication is more than Duplication: Position slides for CHI2011 panel on re...Replication is more than Duplication: Position slides for CHI2011 panel on re...
Replication is more than Duplication: Position slides for CHI2011 panel on re...
 
Tutorial on Using Amazon Mechanical Turk (MTurk) for HCI Research
Tutorial on Using Amazon Mechanical Turk (MTurk) for HCI ResearchTutorial on Using Amazon Mechanical Turk (MTurk) for HCI Research
Tutorial on Using Amazon Mechanical Turk (MTurk) for HCI Research
 
Crowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical TurkCrowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical Turk
 
Eddi: Topic Browsing of Twitter Streams
Eddi: Topic Browsing of Twitter StreamsEddi: Topic Browsing of Twitter Streams
Eddi: Topic Browsing of Twitter Streams
 
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
Large Scale Social Analytics on Wikipedia, Delicious, and Twitter (presented ...
 
Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...
Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...
Model-based Research in Human-Computer Interaction (HCI): Keynote at Mensch u...
 
Zerozero88 Twitter URL Item Recommender
Zerozero88 Twitter URL Item RecommenderZerozero88 Twitter URL Item Recommender
Zerozero88 Twitter URL Item Recommender
 
Smart eBooks: ScentIndex and ScentHighlight research published at VAST2006
Smart eBooks: ScentIndex and ScentHighlight research published at VAST2006Smart eBooks: ScentIndex and ScentHighlight research published at VAST2006
Smart eBooks: ScentIndex and ScentHighlight research published at VAST2006
 
ASC Disaster Response Proposal from Aug 2007
ASC Disaster Response Proposal from Aug 2007ASC Disaster Response Proposal from Aug 2007
ASC Disaster Response Proposal from Aug 2007
 
Using Information Scent to Model Users in Web1.0 and Web2.0
Using Information Scent to Model Users in Web1.0 and Web2.0Using Information Scent to Model Users in Web1.0 and Web2.0
Using Information Scent to Model Users in Web1.0 and Web2.0
 
2010-03-10 PARC Augmented Social Cognition Research Overview
2010-03-10 PARC Augmented Social Cognition Research Overview2010-03-10 PARC Augmented Social Cognition Research Overview
2010-03-10 PARC Augmented Social Cognition Research Overview
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
 
Slowing Growth of Wikipedia and Models of its Dynamic (Presented at Wikimedia...
Slowing Growth of Wikipedia and Models of its Dynamic (Presented at Wikimedia...Slowing Growth of Wikipedia and Models of its Dynamic (Presented at Wikimedia...
Slowing Growth of Wikipedia and Models of its Dynamic (Presented at Wikimedia...
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

2010-02-22 Wikipedia MTurk Research talk given in Taiwan's Academica Sinica

  • 1. Ed H. Chi Area Manager and Principal Scientist Augmented Social Cognition Area Palo Alto Research Center
  • 2.   Cognition:  the  ability  to  remember,  think,  and  reason;  the  faculty  of   knowing.     Social  Cognition:  the  ability  of  a  group  to  remember,  think,  and   reason;  the  construction  of  knowledge  structures  by  a  group.   –  (not  quite  the  same  as  in  the  branch  of  psychology  that  studies  the   cognitive  processes  involved  in  social  interaction,  though  included)     Augmented  Social  Cognition:  Supported  by  systems,  the   enhancement    of  the  ability  of  a  group  to  remember,  think,  and   reason;  the  system-­‐supported  construction  of  knowledge   structures  by  a  group.     Citation:  Chi,  IEEE  Computer,  Sept  2008   2010-02-22 Ed H. Chi ASC Overview 2 2
  • 3. Characteriza*on   Models   Evalua*ons   Prototypes     Characterize activity on social systems with analytics   Model interaction social and community dynamics and variables   Prototype tools to increase benefits or reduce cost   Evaluate prototypes via Living Laboratories with real users 3 2010-02-22 Ed H. Chi ASC Overview 3
  • 4.   Characterization and Modeling: –  Community Analytics and Wikipedia Dynamics   Prototyping: –  Social Transparency thru WikiDashboard   Evaluation: –  Evaluations using Amazon Mechanical Turk 4 2010-02-22 Ed H. Chi ASC Overview 4
  • 5. Characteriza*on   Models   Evalua*ons   Prototypes  
  • 6. Conflict/Coordination  Effects  in  Wikipedia   2010-02-22 Ed H. Chi ASC Overview 6
  • 7. Mediator  Pattern  -­‐  Terri  Schiavo   Anonymous (vandals/ spammers) Sympathetic to husband Mediators Sympathetic to parents 2010-02-22 Ed H. Chi ASC Overview 7
  • 8. Measure  of  controversy   •  Controversial”  tag   • Use  #  revisions  tagged  controversial   2010-02-22 Ed H. Chi ASC Overview 8
  • 9. Page  metrics   •  Possible  metrics  for  identifying  conflict  in  articles   Metric type Page Type Revisions (#) Article, talk, article/talk Page length Article, talk, article/talk Unique editors Article, talk, article/talk Unique editors / revisions Article, talk Links from other articles Article, talk Links to other articles Article, talk Anonymous edits (#, %) Article, talk Administrator edits (#, %) Article, talk Minor edits (#, %) Article, talk Reverts (#, by unique Article editors) 2010-02-22 Ed H. Chi ASC Overview 9
  • 10. Performance:  Cross-­‐validation   • 5x  cross-­‐validation,  R2  =  0.897   2010-02-22 Ed H. Chi ASC Overview 10
  • 11. Determinants  of  conflict   Highly weighted features of conflict model:  Revisions  (talk)    Minor  edits  (talk)    Unique  editors  (talk)    Revisions  (article)    Unique  editors  (article)    Anonymous  edits  (talk)    Anonymous  edits  (article)   2010-02-22 Ed H. Chi ASC Overview 11
  • 12. Number of Articles (Log Scale) http://en.wikipedia.org/wiki/Wikipedia:Modelling_Wikipedia’s_growth 2010-02-22 Ed H. Chi ASC Overview 12 12
  • 13. 2010-02-22 Ed H. Chi ASC Overview 13 13
  • 14. Monthly Edits 2010-02-22 Ed H. Chi ASC Overview 14 14
  • 15. Monthly Edits 2010-02-22 Ed H. Chi ASC Overview 15 15
  • 16. Monthly Active Editors 2010-02-22 Ed H. Chi ASC Overview 16 16
  • 17. Characteriza*on   Models   Evalua*ons   Prototypes  
  • 18. 2010-02-22 Ed H. Chi ASC Overview 18 18
  • 19.   Edits beget edits –  more number of previous edits, more number of new edits Growth rate depends on current population N r = growth rate of the population N(t) = N 0 ⋅ e rt dN = r⋅ N dt Growth rate Current of population € population € 2010-02-22 Ed H. Chi ASC Overview 19 19
  • 20.   Ecological population growth model –  r, growth rate of the population –  K, carrying capacity (due to resource limitation) 4000000 K 3500000 3000000 dN N Population 2500000 = r ⋅ N ⋅ (1− ) 2000000 dt K 1500000 1000000 500000 0 2000 2002 2004 2006 2008 2010 Year 2010-02-22 Ed H. Chi ASC Overview 20 20
  • 21.   Follows a logistic growth curve New Article http://en.wikipedia.org/wiki/Wikipedia:Modelling_Wikipedia’s_growth 2010-02-22 Ed H. Chi ASC Overview 21 21
  • 22.   Carrying Capacity as a function of time. K(t) Population 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Year 2010-02-22 Ed H. Chi ASC Overview 22 22
  • 23.   Biological system –  Competition increases as population hit the limits of the ecology –  Advantage go to members of the population that have competitive dominance over others   Analogy –  Limited opportunities to make novel contributions –  Increased patterns of conflict and dominance 2010-02-22 Ed H. Chi ASC Overview 23 23
  • 24. 2010-02-22 Ed H. Chi ASC Overview 24 24
  • 25.   Highly skewed contribution pattern –  Top 3% users contribute 50%+ edits –  A lot of single-edit users   Five Editor Classes –  Monthly edit count –  No bot, vandalism included in the analysis –  1000+: editors who made more than 1000 edits in that month –  100-999 –  10-99 –  2-9 –  1 2010-02-22 Ed H. Chi ASC Overview 25 25
  • 26. Monthly Edits by Editor Class (in thousands) 2010-02-22 Ed H. Chi ASC Overview 26 26
  • 27. 2010-02-22 Ed H. Chi ASC Overview 27 27
  • 28. Monthly Ratio of Reverted Edits 2010-02-22 Ed H. Chi ASC Overview 28 28
  • 29.   Two interpretations: –  Overall increased resistance from the Wikipedia community to changing content –  Disparity of treatment of edits »  Occasional editors have been reverted in a higher rate   Example of increased patterns of conflict and dominance Photo: http://www.flickr.com/photos/efan78/3619921561/ 2010-02-22 Ed H. Chi ASC Overview 29 29
  • 30. 2010-02-22 Ed H. Chi ASC Overview 30 30
  • 31. Bongwon Suh, Gregorio Convertino, Ed H. Chi, Peter Pirolli. WikiSym 2009 2010-02-22 Ed H. Chi ASC Overview 31 31
  • 32. Characteriza*on   Models   Evalua*ons   Prototypes  
  • 33. “Wikipedia is the best thing ever. Anyone in the world can write anything they want about any subject, so you know you’re getting the best possible information.” – Steve Carell, The Office 2010-02-22 Ed H. Chi ASC Overview 33 33
  • 34.   Content in Wikipedia can be added or changed by anyone   Because of this, WP has become one of the most important resources on the web –  Hundreds of thousands of contributors –  Over 2 million articles –  5th most used websites (Alexa.com)   Also because of this, is viewed with skepticism by readers, press, researchers 2010-02-22 Ed H. Chi ASC Overview 34 34
  • 35. 2010-02-22 Ed H. Chi ASC Overview 35 35
  • 36. Nothing 2010-02-22 Ed H. Chi ASC Overview 36 36
  • 37. “Wikipedia, just by its nature, is impossible to trust completely. I don't think this can necessarily be changed.” 2010-02-22 Ed H. Chi ASC Overview 37 37
  • 38.   Risks with using Wikipedia –  Accuracy of content –  Motives of editors –  Expertise of editors –  Stability of article –  Coverage of topics –  Quality of cited information Insufficient information to evaluate trustworthiness 2010-02-22 Ed H. Chi ASC Overview 38 38
  • 39.   Transparency of social dynamics can reduce conflict and coordination issues   Attribution encourages contribution –  WikiDashboard: Social dashboard for wikis –  Prototype system: http://wikidashboard.parc.com   Visualization for every wiki page showing edit history timeline and top individual editors   Can drill down into activity history for specific editors and view edits to see changes side-by-side Citation: Suh et al. CHI 2008 Proceedings 39 2010-02-22 Ed H. Chi ASC Overview 39
  • 40. 2010-02-22 Ed H. Chi ASC Overview 40 40
  • 41. 2010-02-22 Ed H. Chi ASC Overview 41
  • 42. Characteriza*on   Models   Evalua*ons   Prototypes  
  • 43. Surfacing information •  Numerous studies mining Wikipedia revision history to surface trust-relevant information –  Adler & Alfaro, 2007; Dondio et al., 2006; Kittur et al., 2007; Viegas et al., 2004; Zeng et al., 2006 Suh, Chi, Kittur, & Pendleton, CHI2008 •  But how much impact can this have on user perceptions in a system which is inherently mutable? 43
  • 44. Hypotheses 1.  Visualization will impact perceptions of trust 2.  Compared to baseline, visualization will impact trust both positively and negatively 3.  Visualization should have most impact when high uncertainty about article •  Low quality •  High controversy 44
  • 45. Design •  3 x 2 x 2 design Controversial Uncontroversial Visualization Abortion Volcano High quality •  High stability George Bush Shark •  Low stability •  Baseline (none) Pro-life feminism Disk defragmenter Low quality Scientology and celebrities Beeswax 45
  • 46. Example: High trust visualization 46
  • 47. Example: Low trust visualization 47
  • 48. Summary info •  % from anonymous users 48
  • 49. Summary info •  % from anonymous users •  Last change by anonymous or established user 49
  • 50. Summary info •  % from anonymous users •  Last change by anonymous or established user •  Stability of words 50
  • 53. Method •  Users recruited via Amazon’s Mechanical Turk –  253 participants –  673 ratings –  7 cents per rating –  Kittur, Chi, & Suh, CHI 2008: Crowdsourcing user studies •  To ensure salience and valid answers, participants answered: –  In what time period was this article the least stable? –  How stable has this article been for the last month? –  Who was the last editor? –  How trustworthy do you consider the above editor? 53
  • 54. Results main effects of quality and controversy: • high-quality articles > low-quality articles (F(1, 425) = 25.37, p < .001) • uncontroversial articles > controversial articles (F(1, 425) = 4.69, p = . 031) 54
  • 55. Results interaction effects of quality and controversy: • high quality articles were rated equally trustworthy whether controversial or not, while • low quality articles were rated lower when they were controversial than when they were uncontroversial service. 55
  • 56. Results 1.  Significant effect of visualization –  High > low, p < .001 2.  Viz has both positive and negative effects –  High > baseline, p < .001 –  Low > baseline, p < .01 3.  No interaction of visualization with either quality or controversy –  Robust across conditions 56
  • 57. Results 1.  Significant effect of visualization –  High > low, p < .001 2.  Viz has both positive and negative effects –  High > baseline, p < .001 –  Low > baseline, p < .01 3.  No interaction of visualization with either quality or controversy –  Robust across conditions 57
  • 58. Results 1.  Significant effect of visualization –  High > low, p < .001 2.  Viz has both positive and negative effects –  High > baseline, p < .001 –  Low > baseline, p < .01 3.  No interaction effect of visualization with either quality or controversy –  Robust across conditions 58
  • 59. Characteriza*on   Models   Methodology Evalua*ons   Prototypes  
  • 60. User studies •  Getting input from users is important in HCI –  surveys –  rapid prototyping –  usability tests –  cognitive walkthroughs –  performance measures –  quantitative ratings
  • 61. User studies •  Getting input from users is expensive –  Time costs –  Monetary costs •  Often have to trade off costs with sample size
  • 62. Online solutions •  Online user surveys •  Remote usability testing •  Online experiments •  But still have difficulties –  Rely on practitioner for recruiting participants –  Limited pool of participants
  • 63. Crowdsourcing •  Make tasks available for anyone online to complete •  Quickly access a large user pool, collect data, and compensate users •  Experiences at PARC: –  CSL UbiComp group –  ISL’s NLTT group
  • 64. Crowdsourcing •  Make tasks available for anyone online to complete •  Quickly access a large user pool, collect data, and compensate users •  Example: NASA Clickworkers –  100k+ volunteers identified Mars craters from space photographs –  Aggregate results “virtually indistinguishable” from expert geologists experts crowds http://clickworkers.arc.nasa.gov
  • 65. Amazon’s Mechanical turk •  Market for “human intelligence tasks” •  Typically short, objective tasks –  Tag an image –  Find a webpage –  Evaluate relevance of search results •  Users complete for a few pennies each
  • 67. Using Mechanical Turk for user studies Traditional user Mechanical Turk studies Task complexity Complex Simple Long Short Task subjectivity Subjective Objective Opinions Verifiable User information Targeted demographics Unknown demographics High interactivity Limited interactivity Can Mechanical Turk be usefully used for user studies?
  • 68. Task •  Assess quality of Wikipedia articles •  Started with ratings from expert Wikipedians –  14 articles (e.g., “Germany”, “Noam Chomsky”) –  7-point scale •  Can we get matching ratings with mechanical turk?
  • 69. Experiment 1 •  Rate articles on 7-point scales: –  Well written –  Factually accurate –  Overall quality •  Free-text input: –  What improvements does the article need? •  Paid $0.05 each
  • 70. Experiment 1: Good news •  58 users made 210 ratings (15 per article) –  $10.50 total •  Fast results –  44% within a day, 100% within two days –  Many completed within minutes
  • 71. Experiment 1: Bad news •  Correlation between turkers and Wikipedians only marginally significant (r=.50, p=.07) •  Worse, 59% potentially invalid responses Experiment 1 Invalid 49% comments <1 min 31% responses •  Nearly 75% of these done by only 8 users
  • 72. Not a good start •  Summary so far: –  Only marginal correlation with experts. –  Heavy gaming of the system by a minority •  Possible Response: –  Can make sure these gamers are not rewarded –  Ban them from doing your hits in the future –  Create a reputation system [Delores Lab] •  Can we change how we collect user input ?
  • 73. Design changes •  Use verifiable questions to signal monitoring –  “How many sections does the article have?” –  “How many images does the article have?” –  “How many references does the article have?”
  • 74. Design changes •  Use verifiable questions to signal monitoring •  Make malicious answers as high cost as good-faith answers –  “Provide 4-6 keywords that would give someone a good summary of the contents of the article”
  • 75. Design changes •  Use verifiable questions to signal monitoring •  Make malicious answers as high cost as good-faith answers •  Make verifiable answers useful for completing task –  Used tasks similar to how Wikipedians described evaluating quality (organization, presentation, references)
  • 76. Design changes •  Use verifiable questions to signal monitoring •  Make malicious answers as high cost as good-faith answers •  Make verifiable answers useful for completing task •  Put verifiable tasks before subjective responses –  First do objective tasks and summarization –  Only then evaluate subjective quality –  Ecological validity?
  • 77. Experiment 2: Results •  124 users provided 277 ratings (~20 per article) •  Significant positive correlation with Wikipedians (r=. 66, p=.01) •  Smaller proportion malicious responses •  Increased time on task Experiment 1 Experiment 2 Invalid 49% 3% comments <1 min 31% 7% responses Median time 1:30 4:06
  • 78. Generalizing to other user studies •  Combine objective and subjective questions –  Rapid prototyping: ask verifiable questions about content/design of prototype before subjective evaluation –  User surveys: ask common-knowledge questions before asking for opinions
  • 79. Limitations of mechanical turk •  No control of users’ environment –  Potential for different browsers, physical distractions –  General problem with online experimentation •  Not designed for user studies –  Difficult to do between-subjects design –  Involves some programming •  Users –  Uncertainty about user demographics, expertise
  • 80. Conclusion •  Mechanical Turk offers the practitioner a way to access a large user pool and quickly collect data at low cost •  Good results require careful task design 1.  Use verifiable questions to signal monitoring 2.  Make malicious answers as high cost as good-faith answers 3.  Make verifiable answers useful for completing task 4.  Put verifiable tasks before subjective responses
  • 81. Ed  H.  Chi  (manager,  PS)   Peter  Pirolli  (RF)   Lichan  Hong   Bongwon  Suh   Les  Nelson   Rowan  Nairn     Gregorio  Convertino     Interns/Collaborators:     Sanjay  Kairam,  Jilin  Chen  (UMinn),  Michael  Bernstein  (MIT)   http://asc-­‐parc.blogspot.com     Ed H. Chi ASC Overview 81 2010-02-22
  • 82. 2010-02-22 Ed H. Chi ASC Overview 82
  • 83.   r, growth rate dN N = rN(1− )   K, carrying capacity dt K 4000000 3500000 3000000 € K dominates 2500000 r dominates when N K 2000000 when N is small N 1500000 N (1− ) ≈ 0 1000000 (1− ) ≈ 1 K 500000 K 0 2000 2002 2004 2006 2008 2010 Year 2010-02-22 Ed H. Chi ASC Overview 83 € €
  • 84.   r-Strategist –  Growth or exploitation –  Less-crowded niches / produce many offspring   K-Strategist –  Conservation –  Strong competitors in crowded niches / invest more heavily in fewer offspring   Evolution cycle –  Resilience of an ecological system –  Gunderson & Holling 2001 2010-02-22 Ed H. Chi ASC Overview 84
  • 85.   Exponential growth model dN –  Growth rate depends on the current N = r*N dt   Ecological population growth model –  r, growth rate of the population –  K, carrying capacity (due to resource limitation) € dN N = rN(1− ) dt K 2010-02-22 € Ed H. Chi ASC Overview 85
  • 86.   People-ware –  Growing resistance to changing content –  Coordination cost and bureaucracy   Knowledge-ware: Availability of easy topics to write about   Tool-ware: Quality of tools used by editors and admins http://www.aerostich.com/ http://www.mikestreetmedia.co.uk/blog/wp-content/uploads/2009/01/knowledge.jpg 2010-02-22 Ed H. Chi ASC Overview 86 http://youropenbook.agitprop.co.uk/growing.php?p=2