SlideShare una empresa de Scribd logo
1 de 45
Descargar para leer sin conexión
ONE DOES NOT SIMPLY CROWDSOURCE
THE SEMANTIC WEB:
10 YEARS WITH PEOPLE, URIS, GRAPHS
AND INCENTIVES
Elena Simperl
Sematics 2018, Vienna
CROWDSOURCING = OUTSOURCING + CROWD
2
AUGMENTED INTELLIGENCE:
CROWDSOURCING CREATES LABELLED DATA FOR ML
ALGORITHMS
CROWDSOURCING AND
THE SEMANTIC WEB
Semantic applications
developers use
crowdsourcing to
achieve a goal
The Semantic Web is a
giant crowdsourcing
project
THE DESIGN SPACE OF A CROWDSOURCING
PROJECT
What
Goal
Who
Staffing
How
Process
Why
Incentives
CROWDSOURCING - WHAT
Tasks based on human
skills, not easily replicable
by machines
Editing knowledge graphs
Adding semantic annotations to
media
Adding multilingual labels to entities
Defining links between entities
…
Most effective when used at scale
(‘open call’), in combination w/
machine intelligence
CROWDSOURCING – WHO
Acosta, M., Zaveri, A., Simperl, E., Kontokostas, D., Flöck, F., &
Lehmann, J. (2016). Detecting Linked Data quality issues via
crowdsourcing: A DBpedia study. Semantic Web Journal, 1-34.
There is more to crowdsourcing than
Mechnical Turk
Use the right crowd for the right task
BACKGROUND
Varying quality of Linked Data sources
Detecting and correcting errors may require
manual inspection
dbpedia:Dave_Dobbyn dbprop:dateOfBirth “3”.
Contest
LD Experts
Difficult task
Final prize
Find Verify
Microtasks
Workers
Easy task
Micropayments
TripleCheckMate MTurk
Incorrect object
Incorrect data type
Incorrect outlink
Object values Data types Interlinks
Linked Data
experts
0.7151 0.8270 0.1525
MTurk
(majority voting)
0.8977 0.4752 0.9412
Results: Precision
Approach MTurk interfaces Findings
Use the right
crowd for the
right task
Experts detect a range
of issues, but will not
invest additional effort
Turkers can carry out the
three tasks and are
exceptionally good at
data comparisons
Diverse crowds are better
Piscopo, A., Phethean, C., & Simperl, E. (2017). What Makes a
Good Collaborative Knowledge Graph: Group Composition and
Quality in Wikidata. International Conference on Social
Informatics, 305-322, Springer.
BACKGROUND
Items and statements in Wikidata are edited by teams of
editors
Editors have varied tenure and interests
Group composition impacts outcomes
Diversity can have multiple effects
Moderate tenure diversity increases outcome quality
Interest diversity leads to increased group productivity
Chen, J., Ren, Y., Riedl, J.: The effects of diversity on group productivity and member withdrawal in online volunteer groups. In: Proceedings of the
28th international conference on human factors in computing systems - CHI ’10. p. 821. ACM Press, New York, USA (2010)
STUDY
Analysed the edit history of items
Corpus of 5k items, whose quality has been manually
assessed (5 levels)*
Edit history focused on community make-up
Community is defined as set of editors of item
Considered features from group diversity literature and
Wikidata-specific aspects
*https://www.wikidata.org/wiki/Wikidata:Item_quality
HYPOTHESES
Activity Outcome
H1 Bots edits Item quality
H2 Bot-human interaction Item quality
H3 Anonymous edits Item quality
H4 Tenure diversity Item quality
H5 Interest diversity Item quality
RESULTS
ALL HYPOTHESES SUPPORTED
H1
H2
H3 H4
H5
SUMMARY AND IMPLICATIONS
The more is
not always
the merrier
01
Bot edits are
key for quality,
but bots and
humans are
better
02
Registered
editors have
a positive
impact
Diversity
matters
04
Encourage
registration
01
Identify
further areas
for bot editing
02
Design
effective
human-bot
workflows
03
Suggest items
to edit based
on tenure and
interests
04
03
CROWDSOURCING - HOW
Bu, Q., Simperl, E., Zerr, S., & Li, Y. (2016). Using microtasks to
crowdsource DBpedia entity classification: A study in workflow
design. Semantic Web Journal, 1-18.
There are different ways to carry
out a task using crowdsourcing
They will produce different results
THREE WORKFLOWS TO
CROWDSOURCE ENTITY
TYPING
Free associations
Validating the machine
Exploring the DBpedia ontology
Findings
 Shortlists are easy & fast
 Popular classes are not enough
 Alternative ways to explore the taxonomy
 Freedom comes with a price
 Unclassified entities might be unclassifiable
 Different human data interfaces
 Working at the basic level of abstraction achieves
greatest precision
 But when given the freedom to choose, users suggest more specific
classes
4.58M
things
Kaffee, L. A., Elsahar, H., Vougiouklis, P., Gravier, C., Laforest, F.,
Hare, J., & Simperl, E. (2018). Mind the (Language) Gap:
Generation of Multilingual Wikipedia Summaries from Wikidata
for ArticlePlaceholders. In European Semantic Web Conference (pp.
319-334). Springer.
Crowds need human-readable
interfaces to KGs
BACKGROUND
Wikipedia is available in 287 languages,
but content is unevenly distributed
Wikidata is cross-lingual
ArticlePlaceholders display Wikidata
triples as stubs for articles in underserved
Wikipedia’s
Currently deployed in 11 Wikipedia’s
STUDY
Enrich ArticlePlaceholders with textual summaries generated from Wikidata
triples
Train a neural network to generate one sentence summaries resembling the
opening paragraph of a Wikipedia article
Test the approach on two languages, Esperanto and Arabic with readers and
editors of those Wikipedia’s
APPROACH
NEURAL NETWORK TRAINED ON WIKIDATA/WIKIPEDIA
Feed-forward architecture encodes
triples from the ArticlePlaceholder into
vector of fixed dimensionality
RNN-based decoder generates text
summaries, one token at a time
Optimisations for different entity
verbalisations, rare entities etc.
AUTOMATIC EVALUATION
APPROACH OUTPERFORMS BASELINES
Trained on corpus of Wikipedia sentences and corresponding Wikidata triples (205k Arabic; 102k Esperanto)
Tested against three baselines: machine translation (MT) and template retrieval (TR, TRext)
Using standard metrics: BLEU, METEOR, ROUGEL
The image part with relationship ID rId3 was not found in the file.
USER STUDIES
SUMMARIES ARE USEFUL FOR THE COMMUNITY
Readers study,
15 days, mixed
corpus of 60
articles
Editors study, 15
days, 30 summaries
The image part with relationship ID rId4 was not found in the file.
CROWDSOURCING - WHY
29
THEORY OF MOTIVATION
People do things for three
reasons
Love and glory keep costs
down
Money and glory deliver
faster
LOVE
MONEY
GLORY
30
PAID MICROTASKS
More money makes the
crowd work faster*
How about love and
glory?
*[Mason &Watts, 2009]
31
EXPERIMENT 1
Make paid microtasks more
cost-effective w/ gamification
Workers will perform better if tasks are more
engaging
 Increased accuracy through higher inter-annotator
agreement
 Cost savings through reduced unit costs
Micro-targeting incentives when players
attempt to quit improves retention
MICROTASK DESIGN
Image labelling tasks, published on microtask
platform
 Free-text labels, varying numbers of labels per image,
taboo words
 Workers can skip images, play as much as they want
Baseline: ‘standard’ tasks w/ basic spam control
vs
Gamified: same requirements & rewards, but
crowd asked to complete tasks in Wordsmith
vs
Gamified & furtherance incentives: additional
rewards to stay (random, personalised)
32
33
EVALUATION
ESP data set as gold standard
#labels, agreement, mean & max
#labels/worker
Three tasks
 Nano: 1 image
 Micro: 11 images
 Small: up to 2000 images
Probabilistic reasoning to predict worker
exit and personalize furtherance
incentives
RESULTS (GAMIFICATION, 1 IMAGE)
BETTER, CHEAPER, BUT FEWER WORKERS
34
Metric CrowdFlower Wordsmith
Total workers 600 423
Total keywords 1,200 41,206
Unique keywords 111 5,708
Avg. agreement 5.72% 37.7%
Avg. images/person 1 32
Max images/person 1 200
RESULTS (GAMIFICATION, 11 IMAGES)
COMPARABLE QUALITY, HIGHER UNIT COSTS, FEWER DROPOUTS
35
Metric CrowdFlower Wordsmith
Total workers 600 514
Total keywords 13,200 35,890
Unique keywords 1,323 4,091
Avg. agreement 6.32% 10.9%
Avg. images/person 11 27
Max images/person 1 351
RESULTS (WITH FURTHERANCE INCENTIVES)
MORE ENGAGEMENT, TARGETING WORKS
Increased participation
 People come back (20 times) and play longer (43 hours vs 3 hours without incentives)
 Financial incentives play important role
Targeted incentives work
 77% players stayed vs. 27% in the randomised condition
 19% more labels compared to no incentives condition
36
37
EXPERIMENT 2
Make paid microtasks more cost-effective
w/ social incentives
Working in pairs is more effective than the
baseline
 Increased higher inter-annotator agreement
 Higher output
Social incentives improve retention past
payment threshold
MICROTASK DESIGN
Image labelling tasks published on microtask platform
 Free-text labels, varying numbers of labels per image,
taboo words
Baseline: ‘standard’ tasks w/ basic spam control
vs
Pairs: Wordsmith-based, randomly formed pairs, people
join and leave all the time, in time more partner switches
vs
Pairs & social incentives: let’s play vs please stay
offered to worker when we expect their partner to leave
38
INCENTIVES
39
No global leaderboard
Empathic social pressure: stay (and help your partner get paid)
Social flow: keep playing and having fun together
40
EVALUATION
ESP data set as gold standard
Evaluated #labels, agreement, avg/max
#labels/worker
Two tasks
 Low threshold: 1 image
 High threshold: 11 images
Probabilistic reasoning to predict worker
exit* and offer social incentive
* [Kobren et al, 2015] extended w/ utility
features
RESULTS (COLLABORATION)
BETTER, CHEAPER, FEWER WORKERS, ADDS COMPLEXITY
41
RESULTS (SOCIAL INCENTIVES)
IMPROVED RETENTION, PLEASE STAY MORE EFFECTIVE
42
43
SUMMARY OF FINDINGS
Social incentives generate more tags and improve retention
Social dynamics: different responses if partner has been paid or not
 Paid worker 76% more likely to stay after social pressure, unpaid worker:
95% more likely to stay
 Paid workers annotate more if they decide to stay than unpaid workers
Social flow more effective than social pressure in generating more tags: 99%
of unpaid workers are likely to stay
Social pressure works more often overall
ONE DOES NOT SIMPLY
CROWDSOURCE THE SEMANTIC WEB
45
CONCLUSIONS
With AI and ML on the rise, crowdsourcing
is a critical for any Semantic Web
developer
Explore the what, who, how, why design
space
Use the full range of approaches and
techniques to scale to large datasets

Más contenido relacionado

La actualidad más candente

Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Elena Simperl
 
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...FIA2010
 
Reality Mining
Reality MiningReality Mining
Reality MiningCI&T
 
Open data for smart cities
Open data for smart citiesOpen data for smart cities
Open data for smart citiesSören Auer
 
E psi open data - rejseplanen
E psi   open data - rejseplanenE psi   open data - rejseplanen
E psi open data - rejseplanenePSI Platform
 
Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Jan Sifra
 
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.Bernhard Rieder
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachAndry Alamsyah
 
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleTweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleBernhard Rieder
 
Knime social media_white_paper
Knime social media_white_paperKnime social media_white_paper
Knime social media_white_paperFiras Husseini
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiJohn Breslin
 
Memory Connected
Memory ConnectedMemory Connected
Memory ConnectedLi Ding
 
Re-identification of Anomized CDR datasets using Social networlk Data
Re-identification of Anomized CDR datasets using Social networlk DataRe-identification of Anomized CDR datasets using Social networlk Data
Re-identification of Anomized CDR datasets using Social networlk DataAlket Cecaj
 
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...Istituto nazionale di statistica
 
Platforms and Analytical Gestures
Platforms and Analytical GesturesPlatforms and Analytical Gestures
Platforms and Analytical GesturesBernhard Rieder
 
To trust, or not to trust: Highlighting the need for data provenance in mobil...
To trust, or not to trust: Highlighting the need for data provenance in mobil...To trust, or not to trust: Highlighting the need for data provenance in mobil...
To trust, or not to trust: Highlighting the need for data provenance in mobil...Jon Lazaro Aduna
 
A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionCory Andrew Henson
 
Truth, Justice, and Technicity: from Bias to the Politics of Systems
Truth, Justice, and Technicity: from Bias to the Politics of SystemsTruth, Justice, and Technicity: from Bias to the Politics of Systems
Truth, Justice, and Technicity: from Bias to the Politics of SystemsBernhard Rieder
 
On Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationOn Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationBernhard Rieder
 

La actualidad más candente (20)

Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?Are our knowledge graphs trustworthy?
Are our knowledge graphs trustworthy?
 
EU FP7 CityPulse Project
EU FP7 CityPulse ProjectEU FP7 CityPulse Project
EU FP7 CityPulse Project
 
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...
Real World Internet, Smart Cities and Linked Data: Mirko Presser (Alexandrea ...
 
Reality Mining
Reality MiningReality Mining
Reality Mining
 
Open data for smart cities
Open data for smart citiesOpen data for smart cities
Open data for smart cities
 
E psi open data - rejseplanen
E psi   open data - rejseplanenE psi   open data - rejseplanen
E psi open data - rejseplanen
 
Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)Reality Mining (Nathan Eagle)
Reality Mining (Nathan Eagle)
 
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.Engines of Order. Social Media and the Rise of Algorithmic Knowing.
Engines of Order. Social Media and the Rise of Algorithmic Knowing.
 
Big Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network ApproachBig Data Analytics : A Social Network Approach
Big Data Analytics : A Social Network Approach
 
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% SampleTweets are Not Created Equal. Intersecting Devices in the 1% Sample
Tweets are Not Created Equal. Intersecting Devices in the 1% Sample
 
Knime social media_white_paper
Knime social media_white_paperKnime social media_white_paper
Knime social media_white_paper
 
SP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with GephiSP1: Exploratory Network Analysis with Gephi
SP1: Exploratory Network Analysis with Gephi
 
Memory Connected
Memory ConnectedMemory Connected
Memory Connected
 
Re-identification of Anomized CDR datasets using Social networlk Data
Re-identification of Anomized CDR datasets using Social networlk DataRe-identification of Anomized CDR datasets using Social networlk Data
Re-identification of Anomized CDR datasets using Social networlk Data
 
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
Giorgio Alleva, Data Innovation in Official Statistics: the Leading Role of O...
 
Platforms and Analytical Gestures
Platforms and Analytical GesturesPlatforms and Analytical Gestures
Platforms and Analytical Gestures
 
To trust, or not to trust: Highlighting the need for data provenance in mobil...
To trust, or not to trust: Highlighting the need for data provenance in mobil...To trust, or not to trust: Highlighting the need for data provenance in mobil...
To trust, or not to trust: Highlighting the need for data provenance in mobil...
 
A Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine PerceptionA Semantics-based Approach to Machine Perception
A Semantics-based Approach to Machine Perception
 
Truth, Justice, and Technicity: from Bias to the Politics of Systems
Truth, Justice, and Technicity: from Bias to the Politics of SystemsTruth, Justice, and Technicity: from Bias to the Politics of Systems
Truth, Justice, and Technicity: from Bias to the Politics of Systems
 
On Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric DiversificationOn Digital Markets, Data, and Concentric Diversification
On Digital Markets, Data, and Concentric Diversification
 

Similar a One does not simply crowdsource the Semantic Web: 10 years with people, URIs, graphs and incentives

Beyond monetary incentives: experiments with paid microtasks
Beyond monetary incentives: experiments with paid microtasksBeyond monetary incentives: experiments with paid microtasks
Beyond monetary incentives: experiments with paid microtasksElena Simperl
 
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...Stephanie Steinhardt
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software AnalyticsMargaret-Anne Storey
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchrohitcse52
 
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Margaret-Anne Storey
 
2021006 jim spohrer mc gill_precision_convergence_panel v3
2021006 jim spohrer mc gill_precision_convergence_panel v32021006 jim spohrer mc gill_precision_convergence_panel v3
2021006 jim spohrer mc gill_precision_convergence_panel v3ISSIP
 
Doctoral seminar (DBIS RWTH Aachen)
Doctoral seminar  (DBIS RWTH Aachen)Doctoral seminar  (DBIS RWTH Aachen)
Doctoral seminar (DBIS RWTH Aachen)Zina Petrushyna
 
Web2.0 and What it Means for Business
Web2.0 and What it Means for BusinessWeb2.0 and What it Means for Business
Web2.0 and What it Means for BusinessRich Miller
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Ed Chi
 
The Language of Cyberspace: UTS Lecture
The Language of Cyberspace: UTS LectureThe Language of Cyberspace: UTS Lecture
The Language of Cyberspace: UTS LectureTatiana Pentes
 
Community benefits for all 20210511 v3
Community benefits for all 20210511 v3Community benefits for all 20210511 v3
Community benefits for all 20210511 v3ISSIP
 
Proactive Displays IIIA 20080627
Proactive Displays IIIA 20080627Proactive Displays IIIA 20080627
Proactive Displays IIIA 20080627Joe McCarthy
 
Usability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered DesignUsability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered Design@cristobalcobo
 
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...Nicolas Brousse
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest linkCS, NcState
 
Greater Expectations: The Layered Meanings of Print and Web Style
Greater Expectations:  The Layered Meanings of Print and Web StyleGreater Expectations:  The Layered Meanings of Print and Web Style
Greater Expectations: The Layered Meanings of Print and Web StyleCasey McArdle
 
Comarch Loyalty Breakfast - behavioural analysis in gamification
Comarch Loyalty Breakfast - behavioural analysis in gamification Comarch Loyalty Breakfast - behavioural analysis in gamification
Comarch Loyalty Breakfast - behavioural analysis in gamification Malgorzata Kendziorek
 

Similar a One does not simply crowdsource the Semantic Web: 10 years with people, URIs, graphs and incentives (20)

Beyond monetary incentives: experiments with paid microtasks
Beyond monetary incentives: experiments with paid microtasksBeyond monetary incentives: experiments with paid microtasks
Beyond monetary incentives: experiments with paid microtasks
 
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
Designing for Collaboration: Challenges & Considerations of Multi-Use Informa...
 
Summ11 useinterx
Summ11 useinterxSumm11 useinterx
Summ11 useinterx
 
UCIDesign.ppt
UCIDesign.pptUCIDesign.ppt
UCIDesign.ppt
 
Visualization for Software Analytics
Visualization for Software AnalyticsVisualization for Software Analytics
Visualization for Software Analytics
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
 
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
Towards the Social Programmer (MSR 2012 Keynote by M. Storey)
 
2021006 jim spohrer mc gill_precision_convergence_panel v3
2021006 jim spohrer mc gill_precision_convergence_panel v32021006 jim spohrer mc gill_precision_convergence_panel v3
2021006 jim spohrer mc gill_precision_convergence_panel v3
 
Doctoral seminar (DBIS RWTH Aachen)
Doctoral seminar  (DBIS RWTH Aachen)Doctoral seminar  (DBIS RWTH Aachen)
Doctoral seminar (DBIS RWTH Aachen)
 
Web2.0 and What it Means for Business
Web2.0 and What it Means for BusinessWeb2.0 and What it Means for Business
Web2.0 and What it Means for Business
 
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
Information Seeking with Social Signals: Anatomy of a Social Tag-based Explor...
 
The Language of Cyberspace: UTS Lecture
The Language of Cyberspace: UTS LectureThe Language of Cyberspace: UTS Lecture
The Language of Cyberspace: UTS Lecture
 
Community benefits for all 20210511 v3
Community benefits for all 20210511 v3Community benefits for all 20210511 v3
Community benefits for all 20210511 v3
 
Proactive Displays IIIA 20080627
Proactive Displays IIIA 20080627Proactive Displays IIIA 20080627
Proactive Displays IIIA 20080627
 
Atlas.ti tutorial
Atlas.ti tutorialAtlas.ti tutorial
Atlas.ti tutorial
 
Usability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered DesignUsability First - Introduction to User-Centered Design
Usability First - Introduction to User-Centered Design
 
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...
<Programming> 2019 - ICW'19: The Issue of Monorepo and Polyrepo In Large Ente...
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Greater Expectations: The Layered Meanings of Print and Web Style
Greater Expectations:  The Layered Meanings of Print and Web StyleGreater Expectations:  The Layered Meanings of Print and Web Style
Greater Expectations: The Layered Meanings of Print and Web Style
 
Comarch Loyalty Breakfast - behavioural analysis in gamification
Comarch Loyalty Breakfast - behavioural analysis in gamification Comarch Loyalty Breakfast - behavioural analysis in gamification
Comarch Loyalty Breakfast - behavioural analysis in gamification
 

Más de Elena Simperl

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceElena Simperl
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationElena Simperl
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backElena Simperl
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so farElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactElena Simperl
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfElena Simperl
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringElena Simperl
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfElena Simperl
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterElena Simperl
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesElena Simperl
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachElena Simperl
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in WikidataElena Simperl
 
Making transport smarter, leveraging the human factor
Making transport smarter, leveraging the human factorMaking transport smarter, leveraging the human factor
Making transport smarter, leveraging the human factorElena Simperl
 
Quality and collaboration in Wikidata
Quality and collaboration in WikidataQuality and collaboration in Wikidata
Quality and collaboration in WikidataElena Simperl
 
The business of open data
The business of open dataThe business of open data
The business of open dataElena Simperl
 
Open data – are we done
Open data – are we doneOpen data – are we done
Open data – are we doneElena Simperl
 

Más de Elena Simperl (20)

This talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing scienceThis talk was not generated with ChatGPT: how AI is changing science
This talk was not generated with ChatGPT: how AI is changing science
 
Knowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generationKnowledge graph use cases in natural language generation
Knowledge graph use cases in natural language generation
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
The web of data: how are we doing so far
The web of data: how are we doing so farThe web of data: how are we doing so far
The web of data: how are we doing so far
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Open government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impactOpen government data portals: from publishing to use and impact
Open government data portals: from publishing to use and impact
 
Ten myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdfTen myths about knowledge graphs.pdf
Ten myths about knowledge graphs.pdf
 
What Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineeringWhat Wikidata teaches us about knowledge engineering
What Wikidata teaches us about knowledge engineering
 
Data commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdfData commons and their role in fighting misinformation.pdf
Data commons and their role in fighting misinformation.pdf
 
Pie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on TwitterPie chart or pizza: identifying chart types and their virality on Twitter
Pie chart or pizza: identifying chart types and their virality on Twitter
 
Qrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart citiesQrowd and the city: designing people-centric smart cities
Qrowd and the city: designing people-centric smart cities
 
Qrowd and the city
Qrowd and the cityQrowd and the city
Qrowd and the city
 
Inclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approachInclusive cities: a crowdsourcing approach
Inclusive cities: a crowdsourcing approach
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in Wikidata
 
Making transport smarter, leveraging the human factor
Making transport smarter, leveraging the human factorMaking transport smarter, leveraging the human factor
Making transport smarter, leveraging the human factor
 
Data storytelling
Data storytelling Data storytelling
Data storytelling
 
Quality and collaboration in Wikidata
Quality and collaboration in WikidataQuality and collaboration in Wikidata
Quality and collaboration in Wikidata
 
The Data Pitch call
The Data Pitch callThe Data Pitch call
The Data Pitch call
 
The business of open data
The business of open dataThe business of open data
The business of open data
 
Open data – are we done
Open data – are we doneOpen data – are we done
Open data – are we done
 

Último

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 

One does not simply crowdsource the Semantic Web: 10 years with people, URIs, graphs and incentives

  • 1. ONE DOES NOT SIMPLY CROWDSOURCE THE SEMANTIC WEB: 10 YEARS WITH PEOPLE, URIS, GRAPHS AND INCENTIVES Elena Simperl Sematics 2018, Vienna
  • 3. AUGMENTED INTELLIGENCE: CROWDSOURCING CREATES LABELLED DATA FOR ML ALGORITHMS
  • 4. CROWDSOURCING AND THE SEMANTIC WEB Semantic applications developers use crowdsourcing to achieve a goal The Semantic Web is a giant crowdsourcing project
  • 5. THE DESIGN SPACE OF A CROWDSOURCING PROJECT What Goal Who Staffing How Process Why Incentives
  • 7. Tasks based on human skills, not easily replicable by machines Editing knowledge graphs Adding semantic annotations to media Adding multilingual labels to entities Defining links between entities …
  • 8. Most effective when used at scale (‘open call’), in combination w/ machine intelligence
  • 10. Acosta, M., Zaveri, A., Simperl, E., Kontokostas, D., Flöck, F., & Lehmann, J. (2016). Detecting Linked Data quality issues via crowdsourcing: A DBpedia study. Semantic Web Journal, 1-34. There is more to crowdsourcing than Mechnical Turk Use the right crowd for the right task
  • 11. BACKGROUND Varying quality of Linked Data sources Detecting and correcting errors may require manual inspection dbpedia:Dave_Dobbyn dbprop:dateOfBirth “3”.
  • 12. Contest LD Experts Difficult task Final prize Find Verify Microtasks Workers Easy task Micropayments TripleCheckMate MTurk Incorrect object Incorrect data type Incorrect outlink Object values Data types Interlinks Linked Data experts 0.7151 0.8270 0.1525 MTurk (majority voting) 0.8977 0.4752 0.9412 Results: Precision Approach MTurk interfaces Findings Use the right crowd for the right task Experts detect a range of issues, but will not invest additional effort Turkers can carry out the three tasks and are exceptionally good at data comparisons
  • 13. Diverse crowds are better Piscopo, A., Phethean, C., & Simperl, E. (2017). What Makes a Good Collaborative Knowledge Graph: Group Composition and Quality in Wikidata. International Conference on Social Informatics, 305-322, Springer.
  • 14. BACKGROUND Items and statements in Wikidata are edited by teams of editors Editors have varied tenure and interests Group composition impacts outcomes Diversity can have multiple effects Moderate tenure diversity increases outcome quality Interest diversity leads to increased group productivity Chen, J., Ren, Y., Riedl, J.: The effects of diversity on group productivity and member withdrawal in online volunteer groups. In: Proceedings of the 28th international conference on human factors in computing systems - CHI ’10. p. 821. ACM Press, New York, USA (2010)
  • 15. STUDY Analysed the edit history of items Corpus of 5k items, whose quality has been manually assessed (5 levels)* Edit history focused on community make-up Community is defined as set of editors of item Considered features from group diversity literature and Wikidata-specific aspects *https://www.wikidata.org/wiki/Wikidata:Item_quality
  • 16. HYPOTHESES Activity Outcome H1 Bots edits Item quality H2 Bot-human interaction Item quality H3 Anonymous edits Item quality H4 Tenure diversity Item quality H5 Interest diversity Item quality
  • 18. SUMMARY AND IMPLICATIONS The more is not always the merrier 01 Bot edits are key for quality, but bots and humans are better 02 Registered editors have a positive impact Diversity matters 04 Encourage registration 01 Identify further areas for bot editing 02 Design effective human-bot workflows 03 Suggest items to edit based on tenure and interests 04 03
  • 20. Bu, Q., Simperl, E., Zerr, S., & Li, Y. (2016). Using microtasks to crowdsource DBpedia entity classification: A study in workflow design. Semantic Web Journal, 1-18. There are different ways to carry out a task using crowdsourcing They will produce different results
  • 21. THREE WORKFLOWS TO CROWDSOURCE ENTITY TYPING Free associations Validating the machine Exploring the DBpedia ontology Findings  Shortlists are easy & fast  Popular classes are not enough  Alternative ways to explore the taxonomy  Freedom comes with a price  Unclassified entities might be unclassifiable  Different human data interfaces  Working at the basic level of abstraction achieves greatest precision  But when given the freedom to choose, users suggest more specific classes 4.58M things
  • 22. Kaffee, L. A., Elsahar, H., Vougiouklis, P., Gravier, C., Laforest, F., Hare, J., & Simperl, E. (2018). Mind the (Language) Gap: Generation of Multilingual Wikipedia Summaries from Wikidata for ArticlePlaceholders. In European Semantic Web Conference (pp. 319-334). Springer. Crowds need human-readable interfaces to KGs
  • 23. BACKGROUND Wikipedia is available in 287 languages, but content is unevenly distributed Wikidata is cross-lingual ArticlePlaceholders display Wikidata triples as stubs for articles in underserved Wikipedia’s Currently deployed in 11 Wikipedia’s
  • 24. STUDY Enrich ArticlePlaceholders with textual summaries generated from Wikidata triples Train a neural network to generate one sentence summaries resembling the opening paragraph of a Wikipedia article Test the approach on two languages, Esperanto and Arabic with readers and editors of those Wikipedia’s
  • 25. APPROACH NEURAL NETWORK TRAINED ON WIKIDATA/WIKIPEDIA Feed-forward architecture encodes triples from the ArticlePlaceholder into vector of fixed dimensionality RNN-based decoder generates text summaries, one token at a time Optimisations for different entity verbalisations, rare entities etc.
  • 26. AUTOMATIC EVALUATION APPROACH OUTPERFORMS BASELINES Trained on corpus of Wikipedia sentences and corresponding Wikidata triples (205k Arabic; 102k Esperanto) Tested against three baselines: machine translation (MT) and template retrieval (TR, TRext) Using standard metrics: BLEU, METEOR, ROUGEL The image part with relationship ID rId3 was not found in the file.
  • 27. USER STUDIES SUMMARIES ARE USEFUL FOR THE COMMUNITY Readers study, 15 days, mixed corpus of 60 articles Editors study, 15 days, 30 summaries The image part with relationship ID rId4 was not found in the file.
  • 29. 29 THEORY OF MOTIVATION People do things for three reasons Love and glory keep costs down Money and glory deliver faster LOVE MONEY GLORY
  • 30. 30 PAID MICROTASKS More money makes the crowd work faster* How about love and glory? *[Mason &Watts, 2009]
  • 31. 31 EXPERIMENT 1 Make paid microtasks more cost-effective w/ gamification Workers will perform better if tasks are more engaging  Increased accuracy through higher inter-annotator agreement  Cost savings through reduced unit costs Micro-targeting incentives when players attempt to quit improves retention
  • 32. MICROTASK DESIGN Image labelling tasks, published on microtask platform  Free-text labels, varying numbers of labels per image, taboo words  Workers can skip images, play as much as they want Baseline: ‘standard’ tasks w/ basic spam control vs Gamified: same requirements & rewards, but crowd asked to complete tasks in Wordsmith vs Gamified & furtherance incentives: additional rewards to stay (random, personalised) 32
  • 33. 33 EVALUATION ESP data set as gold standard #labels, agreement, mean & max #labels/worker Three tasks  Nano: 1 image  Micro: 11 images  Small: up to 2000 images Probabilistic reasoning to predict worker exit and personalize furtherance incentives
  • 34. RESULTS (GAMIFICATION, 1 IMAGE) BETTER, CHEAPER, BUT FEWER WORKERS 34 Metric CrowdFlower Wordsmith Total workers 600 423 Total keywords 1,200 41,206 Unique keywords 111 5,708 Avg. agreement 5.72% 37.7% Avg. images/person 1 32 Max images/person 1 200
  • 35. RESULTS (GAMIFICATION, 11 IMAGES) COMPARABLE QUALITY, HIGHER UNIT COSTS, FEWER DROPOUTS 35 Metric CrowdFlower Wordsmith Total workers 600 514 Total keywords 13,200 35,890 Unique keywords 1,323 4,091 Avg. agreement 6.32% 10.9% Avg. images/person 11 27 Max images/person 1 351
  • 36. RESULTS (WITH FURTHERANCE INCENTIVES) MORE ENGAGEMENT, TARGETING WORKS Increased participation  People come back (20 times) and play longer (43 hours vs 3 hours without incentives)  Financial incentives play important role Targeted incentives work  77% players stayed vs. 27% in the randomised condition  19% more labels compared to no incentives condition 36
  • 37. 37 EXPERIMENT 2 Make paid microtasks more cost-effective w/ social incentives Working in pairs is more effective than the baseline  Increased higher inter-annotator agreement  Higher output Social incentives improve retention past payment threshold
  • 38. MICROTASK DESIGN Image labelling tasks published on microtask platform  Free-text labels, varying numbers of labels per image, taboo words Baseline: ‘standard’ tasks w/ basic spam control vs Pairs: Wordsmith-based, randomly formed pairs, people join and leave all the time, in time more partner switches vs Pairs & social incentives: let’s play vs please stay offered to worker when we expect their partner to leave 38
  • 39. INCENTIVES 39 No global leaderboard Empathic social pressure: stay (and help your partner get paid) Social flow: keep playing and having fun together
  • 40. 40 EVALUATION ESP data set as gold standard Evaluated #labels, agreement, avg/max #labels/worker Two tasks  Low threshold: 1 image  High threshold: 11 images Probabilistic reasoning to predict worker exit* and offer social incentive * [Kobren et al, 2015] extended w/ utility features
  • 41. RESULTS (COLLABORATION) BETTER, CHEAPER, FEWER WORKERS, ADDS COMPLEXITY 41
  • 42. RESULTS (SOCIAL INCENTIVES) IMPROVED RETENTION, PLEASE STAY MORE EFFECTIVE 42
  • 43. 43 SUMMARY OF FINDINGS Social incentives generate more tags and improve retention Social dynamics: different responses if partner has been paid or not  Paid worker 76% more likely to stay after social pressure, unpaid worker: 95% more likely to stay  Paid workers annotate more if they decide to stay than unpaid workers Social flow more effective than social pressure in generating more tags: 99% of unpaid workers are likely to stay Social pressure works more often overall
  • 44. ONE DOES NOT SIMPLY CROWDSOURCE THE SEMANTIC WEB
  • 45. 45 CONCLUSIONS With AI and ML on the rise, crowdsourcing is a critical for any Semantic Web developer Explore the what, who, how, why design space Use the full range of approaches and techniques to scale to large datasets