Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

•Download as PPTX, PDF•

0 likes•68 views

TimDraws

Paper presentation at the CSCW 2021 Workshop on Investigating and Mitigating Biases in Crowdsourced Data

Science

1
WIS
Web
Information
Systems
Introducing the
Cognitive-Biases-in-Crowdsourcing
Checklist
CSCW2021Workshop–InvestigatingandMitigatingBiasesinCrowdsourcedData,October23,2021,Virtual
Tim Draws1, Alisa Rieger1, Oana Inel1, Ujwal Gadiraju1, and Nava Tintarev2
t.a.draws@tudelft.nl
@tmdrws
https://timdraws.net
1Delft University of Technology, 2Maastricht University

2
WIS
Web
Information
Systems
Cognitive Biases in Crowdsourcing
• Cognitive biases of crowdworkers can negatively
affect annotation quality
– Anchoring effect
– Confirmation bias
• Combating cognitive biases is tricky
– Many cognitive biases exist
– Unclear which bias may apply where
References: Eickhoff (2018); Hube, Fetahu, & Gadiraju (2019); Tversky & Kahneman (1974)

3
WIS
Web
Information
Systems
Introducing a Checklist
• Starting point: bias checklist for business decisions
• Adaptation to fit the crowdsourcing context
• Result: 12-item checklist to combat cognitive biases
in crowdsourcing (incl. running example)
References: Kahneman, Lovallo, & Sibony (2011)

4
WIS
Web
Information
Systems
Example
(3) Groupthink or Bandwagon Effect. Does my task design give crowd
workers some notion of other people’s evaluation of the items they annotate?
For example, crowd workers may judge products as more likely to be relevant to
“paella pan” when they see that a majority of other crowd workers have judged
this product as being relevant or if it has received high ratings from consumers.

5
WIS
Web
Information
Systems
Using the Checklist
1.Measure / assess cognitive biases
2.Mitigate cognitive biases
3.Document cognitive biases

6
WIS
Web
Information
Systems
Discussion & Conclusion
• Covering all different types of biases in crowdsourcing
• Updated version of checklist available on repository (link below)
• HCOMP paper: case study + retrospective analysis
t.a.draws@tudelft.nl
@tmdrws
https://timdraws.net
Paper: https://timdraws.net/files/papers/A_Checklist_to_Combat_Cognitive_Biases_in_Crowdsourcing.pdf
Preregistration and supplementary material: https://osf.io/rbucj/

7
WIS
Web
Information
Systems
References
Carsten Eickhoff. 2018. Cognitive biases in crowdsourcing. WSDM 2018 - Proceedings of the 11th ACM International Conference on
Web Search and Data Mining 2018-Febua (2018), 162–170. https://doi.org/10.1145/3159652.3159654
Tim Draws, Alisa Rieger, Oana Inel, Ujwal Gadiraju, and Nava Tintarev. 2021. A Checklist to Combat Cognitive Biases in
Crowdsourcing. Proceedings on the Ninth AAAI Conference on Human Computation and Crowdsourcing (2021).
https://timdraws.net/files/papers/A_Checklist_to_Combat_ Cognitive_Biases_in_Crowdsourcing.pdf
Christoph Hube, Besnik Fetahu, and Ujwal Gadiraju. 2019. Understanding and mitigating worker biases in the crowdsourced collection
of subjective judgments. Conference on Human Factors in Computing Systems - Proceedings (2019).
https://doi.org/10.1145/3290605.3300637
Daniel Kahneman, Dan Lovallo, and Olivier Sibony. 2011. Before you make that big decision... Harvard business review 89, 6 (2011).
Amos Tversky and Daniel Kahneman. 1974. Judgment under Uncertainty: Heuristics and Biases. Science 185 (Sept. 1974), 1124–
1131. https://doi.org/10.1126/science.185.4157.1124

What's hot

Code4 lib2012William Gunn

The Analytics and Data Science LandscapePhilip Bourne

tools for communicating in the computational sciencesBrian Bot

How can machine learning and AI in the cloud improve research?Jisc

The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier

Keynote Talk - Gaining Powerful Insights into Social Media ListeningDr Wasim Ahmed

LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...William Gunn

State of Florida Neo4J Graph Briefing - KeynoteNeo4j

Big Data Brown Bagusmanqureshi

Towards an integrated research management digital ecosystemJisc

Getting (and giving) credit for all that we domhaendel

infrastructure for communicating data-intensive scienceBrian Bot

20160414 23 Research Data ThingsKatina Toufexis

Big Data as a Catalyst for Collaboration & InnovationPhilip Bourne

Trust threads: Provenance for Data Reuse in Long Tail ScienceBeth Plale

Towards a Data CommonsMichael Becich

SGCI Science Gateways: Ushering in a New Era of Sustainability Sandra Gesing

Noshir Contractor's view on the future of Linked DataCarlos Pedrinaci

20160301 23 Research Data ThingsKatina Toufexis

VALA14_burrowsDeb Verhoeven

What's hot (20)

Code4 lib2012

The Analytics and Data Science Landscape

tools for communicating in the computational sciences

How can machine learning and AI in the cloud improve research?

The Future of FAIR Data: An international social, legal and technological inf...

Keynote Talk - Gaining Powerful Insights into Social Media Listening

LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...

State of Florida Neo4J Graph Briefing - Keynote

Big Data Brown Bag

Towards an integrated research management digital ecosystem

Getting (and giving) credit for all that we do

infrastructure for communicating data-intensive science

20160414 23 Research Data Things

Big Data as a Catalyst for Collaboration & Innovation

Trust threads: Provenance for Data Reuse in Long Tail Science

Towards a Data Commons

SGCI Science Gateways: Ushering in a New Era of Sustainability

Noshir Contractor's view on the future of Linked Data

20160301 23 Research Data Things

VALA14_burrows

Similar to Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

The Internet of Things: What's next? PayamBarnaghi

Opportunities and methodological challenges of Big Data for official statist...Piet J.H. Daas

Service-oriented Cognitive Analytics in Smart Service SystemsDr.-Ing. Robin Hirt

Distributed Trust Architecture: The New Foundation of EverythingLiming Zhu

KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfDr. Radhey Shyam

Introduction to Data Analytics and data analytics life cycleDr. Radhey Shyam

wireless sensor networkparry prabhu

The Science of Data Science James Hendler

Distributed Trust Architecture: The New Reality of ML-based SystemsLiming Zhu

Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...Anna De Liddo

Dynamic Data Analytics for the Internet of Things: Challenges and OpportunitiesPayamBarnaghi

Search, Discovery and Analysis of Sensory Data StreamsPayamBarnaghi

BIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la IglesiaMaria de la Iglesia

Responsible AI & Cybersecurity: A tale of two technology risksLiming Zhu

Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...IT Network marcus evans

Information entanglementWillard Van De Bogart

CI_for_NAwebuploader

Big data: Challenges, Practices and TechnologiesNavneet Randhawa

Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard

Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014

Similar to Introducing the Cognitive-Biases-in-Crowdsourcing Checklist (20)

The Internet of Things: What's next?

Opportunities and methodological challenges of Big Data for official statist...

Service-oriented Cognitive Analytics in Smart Service Systems

Distributed Trust Architecture: The New Foundation of Everything

KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf

Introduction to Data Analytics and data analytics life cycle

wireless sensor network

The Science of Data Science

Distributed Trust Architecture: The New Reality of ML-based Systems

Collective Intelligence and Online Deliberation Platforms for Citizen Engagem...

Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities

Search, Discovery and Analysis of Sensory Data Streams

BIMCV, Banco de Imagen Medica de la Comunidad Valenciana. María de la Iglesia

Responsible AI & Cybersecurity: A tale of two technology risks

Bigger and Better: Employing a Holistic Strategy for Big Data toward a Strong...

Information entanglement

CI_for_NA

Big data: Challenges, Practices and Technologies

Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...

Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour

Recently uploaded

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2

Conjugation, transduction and transformationAreesha Ahmad

High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju

IDENTIFICATION OF THE LIVING- forensic medicinesherlingomez2

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha

GBSN - Biochemistry (Unit 1)Areesha Ahmad

Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee

Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju

Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy

Clean In Place(CIP).pptx .Poonam Aher Patil

GBSN - Microbiology (Unit 1)Areesha Ahmad

Unit5-Cloud.pptx for lpu course cse121 oManavSingh202607

Seismic Method Estimate velocity from seismic data.pptxAlMamun560346

Formation of low mass protostars and their circumstellar disksSérgio Sacani

Forensic Biology & Its biological significance.pdfrohankumarsinghrore1

Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour

Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385

Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav

Recently uploaded (20)

SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx

Conjugation, transduction and transformation

High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf

IDENTIFICATION OF THE LIVING- forensic medicine

9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000

GBSN - Biochemistry (Unit 1)

Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf

Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service

STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION

Clean In Place(CIP).pptx .

GBSN - Microbiology (Unit 1)

Unit5-Cloud.pptx for lpu course cse121 o

Seismic Method Estimate velocity from seismic data.pptx

Formation of low mass protostars and their circumstellar disks

Forensic Biology & Its biological significance.pdf

Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics

Zoology 5th semester notes( Sumit_yadav).pdf

Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

1. 1 WIS Web Information Systems Introducing the Cognitive-Biases-in-Crowdsourcing Checklist CSCW2021Workshop–InvestigatingandMitigatingBiasesinCrowdsourcedData,October23,2021,Virtual Tim Draws1, Alisa Rieger1, Oana Inel1, Ujwal Gadiraju1, and Nava Tintarev2 t.a.draws@tudelft.nl @tmdrws https://timdraws.net 1Delft University of Technology, 2Maastricht University

2. 2 WIS Web Information Systems Cognitive Biases in Crowdsourcing • Cognitive biases of crowdworkers can negatively affect annotation quality – Anchoring effect – Confirmation bias • Combating cognitive biases is tricky – Many cognitive biases exist – Unclear which bias may apply where References: Eickhoff (2018); Hube, Fetahu, & Gadiraju (2019); Tversky & Kahneman (1974)

3. 3 WIS Web Information Systems Introducing a Checklist • Starting point: bias checklist for business decisions • Adaptation to fit the crowdsourcing context • Result: 12-item checklist to combat cognitive biases in crowdsourcing (incl. running example) References: Kahneman, Lovallo, & Sibony (2011)

4. 4 WIS Web Information Systems Example (3) Groupthink or Bandwagon Effect. Does my task design give crowd workers some notion of other people’s evaluation of the items they annotate? For example, crowd workers may judge products as more likely to be relevant to “paella pan” when they see that a majority of other crowd workers have judged this product as being relevant or if it has received high ratings from consumers.

5. 5 WIS Web Information Systems Using the Checklist 1.Measure / assess cognitive biases 2.Mitigate cognitive biases 3.Document cognitive biases

6. 6 WIS Web Information Systems Discussion & Conclusion • Covering all different types of biases in crowdsourcing • Updated version of checklist available on repository (link below) • HCOMP paper: case study + retrospective analysis t.a.draws@tudelft.nl @tmdrws https://timdraws.net Paper: https://timdraws.net/files/papers/A_Checklist_to_Combat_Cognitive_Biases_in_Crowdsourcing.pdf Preregistration and supplementary material: https://osf.io/rbucj/

7. 7 WIS Web Information Systems References Carsten Eickhoff. 2018. Cognitive biases in crowdsourcing. WSDM 2018 - Proceedings of the 11th ACM International Conference on Web Search and Data Mining 2018-Febua (2018), 162–170. https://doi.org/10.1145/3159652.3159654 Tim Draws, Alisa Rieger, Oana Inel, Ujwal Gadiraju, and Nava Tintarev. 2021. A Checklist to Combat Cognitive Biases in Crowdsourcing. Proceedings on the Ninth AAAI Conference on Human Computation and Crowdsourcing (2021). https://timdraws.net/files/papers/A_Checklist_to_Combat_ Cognitive_Biases_in_Crowdsourcing.pdf Christoph Hube, Besnik Fetahu, and Ujwal Gadiraju. 2019. Understanding and mitigating worker biases in the crowdsourced collection of subjective judgments. Conference on Human Factors in Computing Systems - Proceedings (2019). https://doi.org/10.1145/3290605.3300637 Daniel Kahneman, Dan Lovallo, and Olivier Sibony. 2011. Before you make that big decision... Harvard business review 89, 6 (2011). Amos Tversky and Daniel Kahneman. 1974. Judgment under Uncertainty: Heuristics and Biases. Science 185 (Sept. 1974), 1124– 1131. https://doi.org/10.1126/science.185.4157.1124

Editor's Notes

1. Cognitive biases of crowd workers are an impactful but often neglected type of systemic bias that can reduce the quality of crowdsourced data labels. 2. These cognitive biases are general human tendencies towards irrational decision-making that often occur subconsciously. 3. For example, crowd workers may fall prey to the *anchoring effect* when they are overly influenced by information they encounter first or the *confirmation bias* when judging in line with some (false) preexisting beliefs. 4. Previous work that a plurality of cognitive biases can occur in different types of crowdsourcing tasks. 5. So why do requesters rarely consider the influence of cognitive biases in the tasks they design? 6. One important reason here may be that combating cognitive biases is simply tricky. 7. A large number of different cognitive biases have been identified in psychological literature and may often be unclear which specific cognitive bias may occur in a given crowdsourcing task. To efficiently document, assess, and mitigate cognitive biases in crowdsourcing, requesters need a practical tool to help them navigate this space.
1. Proposing such a practical tool is what we aimed to do in this research. 2. As a starting point, we used a checklist proposed by Kahneman, Lovallo, and Sibony (2011) for the context of business psychology. 3. The big advantage of using a checklist is that it can reduce a complex space (e.g., cognitive biases) to a considerable degree while retaining useful information. 4. Specifically, the checklist proposed by Kahneman, Lovallo, and Sibony aims to help business decision-makers to avoid falling prey to cognitive biases using 12 items. 5. These 12 items cover the vast majority of potential rational mistakes by focusing on most commonly occurring ones and grouping biases together (give example). 6. To develop a similar checklist tool for our context, we adapted the checklist proposed by Kahneman, Lovallo, and Sibony to the context of cognitive biases in crowdsourcing; which we propose in an upcoming paper at HCOMP this year. 7. Our proposed checklist similarly contains 12 items that requesters can consider to identify potential cognitive biases elicited by their crowdsourcing task.
8. Each of the 12 items covers a specific cognitive bias or family of biases that may occur in the crowdsourcing context. 9. For example, the third item in our checklist concerns *groupthink* or the *bandwagon effect*. (read out loud) 10. Going through each of the 12 items in this format should help requesters efficiently combat cognitive biases in crowdsourcing.
What can requesters do with the information they get from the checklist? 1. Having identified one or more cognitive biases that may affect crowd workers in the task at hand (ideally before collecting the data), the requester may use this information for three different purposes. 2. First, they may wish to *measure* the cognitive biases in question to assess whether crowd workers are actually affected by them. This may require adding additional items or metrics to the crowdsourcing task. 3. Second, requesters could *mitigate* the cognitive biases. Earlier work has already proposed a couple of solutions for this. 4. Third, requesters may *document* the cognitive biases so that the collected data is put in the right perspective. Pointing out such potential limitations can help others when interpreting results or re-using the data.
Of course, checklist is limited That’s why we have a live version of it In our upcoming HCOMP paper, we illustrate the use of our checklist at the hand of a case study. We there also present a retrospective analysis of past HCOMP papers to show that cognitive biases may affect crowd workers in a majority of crowdsourcing tasks but are rarely dealt with. We hope that our proposed checklist can meaningfully contribute to general efforts towards more reliable human-labeled data.

Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

Similar to Introducing the Cognitive-Biases-in-Crowdsourcing Checklist (20)

Recently uploaded

Recently uploaded (20)

Introducing the Cognitive-Biases-in-Crowdsourcing Checklist

Editor's Notes