SlideShare una empresa de Scribd logo
1 de 29
Text, Topics, and Turkers. Hypertext 2015 1
Text, Topics, and Turkers:
A Consensus Measure for Statistical Topics
Fred Morstatter†, Jürgen Pfeffer‡,
Katja Mayer*, Huan Liu†
†Arizona State University
Tempe, Arizona, USA
‡Carnegie Mellon University
Pittsburgh, Pennsylvania, USA
*University of Vienna
Vienna, Austria
Text, Topics, and Turkers. Hypertext 2015 2
Text
• Text is everywhere in research.
• Text is huge:
• Too much data to read.
• How can we understand what is going on in
big text data?
Source Size
Wikipedia 36 million pages
World Wide Web 100+ billion static web pages
Social Media 500 million new tweets/day
Text, Topics, and Turkers. Hypertext 2015 3
Topics
• Topic Modeling
• Latent Dirichlet Allocation (LDA)
– Most commonly-used topic modeling algorithm
– Discovers “topics” within a corpus
Corpus
LDA
K
Topic ID Words
Topic 1 cat, dog, horse, ...
Topic 2 ball, field, player, ...
... ...
Topic K red, green, blue, ...
Topic 1 Topic 2 ... Topic K
Document1 0.2 0.1 0.01
Document2 0.7 0.02 0.1
...
Documentn 0.1 0.3 0.01
Text, Topics, and Turkers. Hypertext 2015 4
Topics
LDA
K = 10
Topic ID Words
Topic 1 river, lake, island, mountain, area, park, antarctic, south, mountains, dam
Topic 2 relay, athletics, metres, freestyle, hurdles, ret, divisão, athletes, bundesliga,
medals
... ...
Topic 10 courcelles, centimeters, mattythewhite, wine, stamps, oko, perennial, stubs,
ovate, greyish
Topic 1 Topic 2
...
Topic 10
Document1 0.2 0.1 0.01
Document2 0.7 0.02 0.1
...
Documentn 0.1 0.3 0.01
Text, Topics, and Turkers. Hypertext 2015 5
Topics
• How can we measure the quality of statistical
topics?
• We don’t know how well humans can
interpret topics.
• Problem: Does their understanding match
what is going on in the corpus?
Text, Topics, and Turkers. Hypertext 2015 6
Turkers
• One Solution: Crowdsourcing
• Example: Amazon’s Mechanical Turk
– Show LDA results to Turkers
– Gauge their understanding
– How to effectively measure understanding?
Text, Topics, and Turkers. Hypertext 2015 7
Turkers
• Previous Work: Chang et. al 2009
– “Word Intrusion”
– “Topic Intrusion”
Corpus
LDA
K
Topic ID Words
Topic 1 cat, dog, horse, ...
Topic 2 ball, field, player, ...
... ...
Topic K red, green, blue, ...
Topic 1 Topic 2 ... Topic K
Document1 0.2 0.1 0.01
Document2 0.7 0.02 0.1
...
Documentn 0.1 0.3 0.01
“Word Intrusion”
“Topic Intrusion”
Text, Topics, and Turkers. Hypertext 2015 8
Word Intrusion
• Show the Turker 6 words in random order
– Top 5 words from topic
– 1 “Intruded” word
– Ask Turker to choose “Intruded” word
cat dog bird truck horse snake
Topic i:
[Chang et. al 2009]
Text, Topics, and Turkers. Hypertext 2015 9
Topic Intrusion
• Show the Turker a document
• Show the Turker 4 topics
– 3 most probable topics
– 1 “Intruded” topic
– Ask Turker to choose “Intruded” Topic
Documenti
Topic A Topic B Topic C Topic D
[Chang et. al 2009]
Text, Topics, and Turkers. Hypertext 2015 10
New Measure: Topic Consensus
Corpus
LDA
K
Topic ID Words
Topic 1 cat, dog, horse, ...
Topic 2 ball, field, player, ...
... ...
Topic K red, green, blue, ...
Topic 1 Topic 2 ... Topic K
Document1 0.2 0.1 0.01
Document2 0.7 0.02 0.1
...
Documentn 0.1 0.3 0.01
“Word Intrusion”
“Topic Intrusion”
• Complements existing framework
• Measures topic quality with corpus.
“Topic Consensus”
Text, Topics, and Turkers. Hypertext 2015 11
Topic Consensus: Intuition
• Measures the agreement between topics and
“sections” they come from.
LDA Distribution Turker Distribution
Text, Topics, and Turkers. Hypertext 2015 12
Topic Consensus: Calculation
• We are comparing probability distributions.
• Jensen-Shannon Divergence.
Turker Distribution LDA Distribution
Text, Topics, and Turkers. Hypertext 2015 13
Dataset
• Scientific Abstracts
• All available abstracts
since 2007.
• Classified into three areas:
– Social Sciences & Humanities (SH)
– Life Sciences (LS)
– Physical Sciences (PE)
• Ran LDA on this dataset:
– K = [10, 25, 50, 100]
– 185 topics; 4 topic sets.
Text, Topics, and Turkers. Hypertext 2015 14
Turkers
• One task:
• Turkers have 3 + 1 options.
• Each task solved 8 times.
Text, Topics, and Turkers. Hypertext 2015 15
Results
Topic Set
ERC-10
ERC-25
ERC-50
ERC-100
new, group, results, plan, class, ...
selection, variation, population,
genetic, natural, ...
Text, Topics, and Turkers. Hypertext 2015 16
Other Topic Sets
• LDA Topics
– Use New York Times dataset from one day.
25 topics, 1 topic set
• Hand-Picked Topics
– Pure “Social Science & Humanities”
• Sampled words that occur only in these documents.
11 topics, 1 topic set
– Random Topics
• Randomly choose topics according to word distribution
of corpus.
25 topics, 1 topic set
Text, Topics, and Turkers. Hypertext 2015 17
Results
Topic Set
ERC-10
ERC-25
ERC-50
ERC-100
NYT-25
RAND-25
SH-25
Text, Topics, and Turkers. Hypertext 2015 18
Overview of the Process
• Topic Consensus can reveal new information
about the topics being studied.
– Can measure topics from a new perspective.
– Can help reveal topic confusion.
• Drawbacks:
– Expensive
– Time Consuming
– Scalability
Text, Topics, and Turkers. Hypertext 2015 19
Automated Measures
1. Topic Size: Number of tokens assigned to the
topic.
2. Topic Coherence: Probability that the top
words co-occur in documents in the corpus.
3. Topic Coherence Significance: Significance of
Topic Coherence compared to other topics.
4. Normalized Pointwise Mutual Information:
Measures the association between the top
words in the topics.
Text, Topics, and Turkers. Hypertext 2015 20
Measures
• Herfindahl-Hirschman Index (HHI)
– Measures concentration of a market.
– Used to find monopolies.
– Viewed from two perspectives:
Word Probability HHI5. 6.
Social Sciences Physical Sciences Life Sciences
ERC Section HHI
Text, Topics, and Turkers. Hypertext 2015 21
Results - Correlation
Automated Measure Correlation
Topic Size -0.532
Topic Coherence -0.584
Topic Coherence Significance -0.788
Normalized Pointwise
Mutual Information
-0.774
HHI (Word Probability) -0.885
HHI (ERC Section) -0.478
Text, Topics, and Turkers. Hypertext 2015 22
Results - Prediction
• Build classifier to predict actual Topic
Consensus value.
• Build linear regression model:
– Takes automated measures.
– Predicts Topic Consensus.
• RMSE: 0.12 ± 0.02.
Text, Topics, and Turkers. Hypertext 2015 23
Acknowledgements
• Members of the DMML lab
• Office of Naval Research through grant
N000141410095
• LexisNexis and HPCC Systems
Text, Topics, and Turkers. Hypertext 2015 24
Conclusion
• Introduced a new method for evaluating the
interpretability of statistical topics.
• Demonstrated this measure on a real-world
dataset.
• Automated this measure for scalability.
Text, Topics, and Turkers. Hypertext 2015 25
Future Work
• How sensitive are measures to top words?
– Word Intrusion uses 5
– Topic Intrusion uses 5
– Topic Consensus uses 25
• How do measures fare on different datasets?
• Other measures that can reveal quality topics?
Text, Topics, and Turkers. Hypertext 2015 26
Auxiliary Slides
Text, Topics, and Turkers. Hypertext 2015 27
User Demographics
Sex Education Age
First Language Country of Origin
Text, Topics, and Turkers. Hypertext 2015 28
Results – Confusion Matrix
Text, Topics, and Turkers. Hypertext 2015 29
Dataset Statistics

Más contenido relacionado

La actualidad más candente

NIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderNIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderSebastian Ruder
 
Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Lifeng (Aaron) Han
 
Blendle: Diverse recommendations from a vast archive
Blendle: Diverse recommendations from a vast archiveBlendle: Diverse recommendations from a vast archive
Blendle: Diverse recommendations from a vast archiveJasper Oosterman
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowTony Russell-Rose
 

La actualidad más candente (6)

BDACA - Lecture4
BDACA - Lecture4BDACA - Lecture4
BDACA - Lecture4
 
NIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian RuderNIPS 2016 Highlights - Sebastian Ruder
NIPS 2016 Highlights - Sebastian Ruder
 
Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...
 
Blendle: Diverse recommendations from a vast archive
Blendle: Diverse recommendations from a vast archiveBlendle: Diverse recommendations from a vast archive
Blendle: Diverse recommendations from a vast archive
 
Text Mining
Text MiningText Mining
Text Mining
 
Text Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and TomorrowText Analytics: Yesterday, Today and Tomorrow
Text Analytics: Yesterday, Today and Tomorrow
 

Similar a Text, Topics, and Turkers: A Consensus Measure for Statistical Topics

Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval GESIS
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingJack Park
 
6_Big Data Sources part3-Day 3_A_text_mining.pptx
6_Big Data Sources part3-Day 3_A_text_mining.pptx6_Big Data Sources part3-Day 3_A_text_mining.pptx
6_Big Data Sources part3-Day 3_A_text_mining.pptxShowravDuttaAnkur
 
Intro 2 document
Intro 2 documentIntro 2 document
Intro 2 documentUma Kant
 
A Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic ModellingA Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic Modellingcsandit
 
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGA TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGcscpconf
 
THe HathiTrust Research Center: Digital Humanities at Scale
THe HathiTrust Research Center: Digital Humanities at ScaleTHe HathiTrust Research Center: Digital Humanities at Scale
THe HathiTrust Research Center: Digital Humanities at ScaleRobert H. McDonald
 
4 text mining and open ended questions in sample surveys ludovic lebart cnrs
4 text mining and open ended questions in sample surveys ludovic lebart cnrs4 text mining and open ended questions in sample surveys ludovic lebart cnrs
4 text mining and open ended questions in sample surveys ludovic lebart cnrsEvelyn Femat
 
Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Deep Kayal
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Roi Blanco
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesElsevier
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Data Works MD
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlpAmaan Shaikh
 
The aggregator database cornerstone or annex
The aggregator database cornerstone or annexThe aggregator database cornerstone or annex
The aggregator database cornerstone or annexNASIG
 
Natural_Language_Processing_1.ppt
Natural_Language_Processing_1.pptNatural_Language_Processing_1.ppt
Natural_Language_Processing_1.ppttestbest6
 
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewArc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewGalala University
 
Frontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisFrontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisJonathan Stray
 

Similar a Text, Topics, and Turkers: A Consensus Measure for Statistical Topics (20)

Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval Intra- and interdisciplinary cross-concordances for information retrieval
Intra- and interdisciplinary cross-concordances for information retrieval
 
HyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive ComputingHyperMembrane Structures for Open Source Cognitive Computing
HyperMembrane Structures for Open Source Cognitive Computing
 
6_Big Data Sources part3-Day 3_A_text_mining.pptx
6_Big Data Sources part3-Day 3_A_text_mining.pptx6_Big Data Sources part3-Day 3_A_text_mining.pptx
6_Big Data Sources part3-Day 3_A_text_mining.pptx
 
Intro 2 document
Intro 2 documentIntro 2 document
Intro 2 document
 
A Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic ModellingA Text Mining Research Based on LDA Topic Modelling
A Text Mining Research Based on LDA Topic Modelling
 
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLINGA TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
A TEXT MINING RESEARCH BASED ON LDA TOPIC MODELLING
 
THe HathiTrust Research Center: Digital Humanities at Scale
THe HathiTrust Research Center: Digital Humanities at ScaleTHe HathiTrust Research Center: Digital Humanities at Scale
THe HathiTrust Research Center: Digital Humanities at Scale
 
4 text mining and open ended questions in sample surveys ludovic lebart cnrs
4 text mining and open ended questions in sample surveys ludovic lebart cnrs4 text mining and open ended questions in sample surveys ludovic lebart cnrs
4 text mining and open ended questions in sample surveys ludovic lebart cnrs
 
Topic Pages. From articles to answers.
Topic Pages. From articles to answers.Topic Pages. From articles to answers.
Topic Pages. From articles to answers.
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
 
Semi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific TablesSemi-automated Exploration and Extraction of Data in Scientific Tables
Semi-automated Exploration and Extraction of Data in Scientific Tables
 
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
Exploring Correlation Between Sentiment of Environmental Tweets and the Stock...
 
Some Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBASome Information Retrieval Models and Our Experiments for TREC KBA
Some Information Retrieval Models and Our Experiments for TREC KBA
 
Introduction to nlp
Introduction to nlpIntroduction to nlp
Introduction to nlp
 
The aggregator database cornerstone or annex
The aggregator database cornerstone or annexThe aggregator database cornerstone or annex
The aggregator database cornerstone or annex
 
Natural_Language_Processing_1.ppt
Natural_Language_Processing_1.pptNatural_Language_Processing_1.ppt
Natural_Language_Processing_1.ppt
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Paul Groth
Paul GrothPaul Groth
Paul Groth
 
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature reviewArc 323 human studies in architecture fall 2018 lecture 3-literature review
Arc 323 human studies in architecture fall 2018 lecture 3-literature review
 
Frontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisFrontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text Analysis
 

Último

When-technology-and-Humanity-Cross-1.pptx
When-technology-and-Humanity-Cross-1.pptxWhen-technology-and-Humanity-Cross-1.pptx
When-technology-and-Humanity-Cross-1.pptxReaper61
 
Unveiling SOCIO COSMOS: Where Socializing Meets the Stars
Unveiling SOCIO COSMOS: Where Socializing Meets the StarsUnveiling SOCIO COSMOS: Where Socializing Meets the Stars
Unveiling SOCIO COSMOS: Where Socializing Meets the StarsSocioCosmos
 
办理伯明翰大学毕业证书文凭学位证书
办理伯明翰大学毕业证书文凭学位证书办理伯明翰大学毕业证书文凭学位证书
办理伯明翰大学毕业证书文凭学位证书saphesg8
 
Mastering Wealth with YouTube Content Marketing.pdf
Mastering Wealth with YouTube Content Marketing.pdfMastering Wealth with YouTube Content Marketing.pdf
Mastering Wealth with YouTube Content Marketing.pdfTirupati Social Media
 
Cosmic Conversations with Sociocosmos...
Cosmic Conversations with Sociocosmos...Cosmic Conversations with Sociocosmos...
Cosmic Conversations with Sociocosmos...SocioCosmos
 
YouScan Company Overview - Social Media Listening with Visual Insights.pdf
YouScan Company Overview - Social Media Listening with Visual Insights.pdfYouScan Company Overview - Social Media Listening with Visual Insights.pdf
YouScan Company Overview - Social Media Listening with Visual Insights.pdfAlexander Sirach
 
Music Video Codes and Conventions 2 .pptx
Music Video Codes and Conventions 2 .pptxMusic Video Codes and Conventions 2 .pptx
Music Video Codes and Conventions 2 .pptxjenrobinson12
 
Amplify Your Brand with Our Tailored Social Media Marketing Services
Amplify Your Brand with Our Tailored Social Media Marketing ServicesAmplify Your Brand with Our Tailored Social Media Marketing Services
Amplify Your Brand with Our Tailored Social Media Marketing ServicesNetqom Solutions
 
AI Virtual Influencers: The Future of Influencer Marketing
AI Virtual Influencers:  The Future of Influencer MarketingAI Virtual Influencers:  The Future of Influencer Marketing
AI Virtual Influencers: The Future of Influencer MarketingCut-the-SaaS
 
O9654467111 Call Girls In Shahdara Women Seeking Men
O9654467111 Call Girls In Shahdara Women Seeking MenO9654467111 Call Girls In Shahdara Women Seeking Men
O9654467111 Call Girls In Shahdara Women Seeking MenSapana Sha
 
social media advantages and disadvantages
social media advantages and disadvantagessocial media advantages and disadvantages
social media advantages and disadvantagesmehwishkhan1018786
 
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...jicagig173
 
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一ra6e69ou
 
Protecting Your Little Explorer at Home!
Protecting Your Little Explorer at Home!Protecting Your Little Explorer at Home!
Protecting Your Little Explorer at Home!andrekr997
 
fraud storyboards powerpoint media project
fraud storyboards powerpoint media projectfraud storyboards powerpoint media project
fraud storyboards powerpoint media project17mos052
 

Último (20)

When-technology-and-Humanity-Cross-1.pptx
When-technology-and-Humanity-Cross-1.pptxWhen-technology-and-Humanity-Cross-1.pptx
When-technology-and-Humanity-Cross-1.pptx
 
young call girls in Greater Noida 🔝 9953056974 🔝 Delhi escort Service
young call girls in  Greater Noida 🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in  Greater Noida 🔝 9953056974 🔝 Delhi escort Service
young call girls in Greater Noida 🔝 9953056974 🔝 Delhi escort Service
 
Unveiling SOCIO COSMOS: Where Socializing Meets the Stars
Unveiling SOCIO COSMOS: Where Socializing Meets the StarsUnveiling SOCIO COSMOS: Where Socializing Meets the Stars
Unveiling SOCIO COSMOS: Where Socializing Meets the Stars
 
办理伯明翰大学毕业证书文凭学位证书
办理伯明翰大学毕业证书文凭学位证书办理伯明翰大学毕业证书文凭学位证书
办理伯明翰大学毕业证书文凭学位证书
 
looking for escort 9953056974 Low Rate Call Girls In Vinod Nagar
looking for escort 9953056974 Low Rate Call Girls In  Vinod Nagarlooking for escort 9953056974 Low Rate Call Girls In  Vinod Nagar
looking for escort 9953056974 Low Rate Call Girls In Vinod Nagar
 
Mastering Wealth with YouTube Content Marketing.pdf
Mastering Wealth with YouTube Content Marketing.pdfMastering Wealth with YouTube Content Marketing.pdf
Mastering Wealth with YouTube Content Marketing.pdf
 
Cosmic Conversations with Sociocosmos...
Cosmic Conversations with Sociocosmos...Cosmic Conversations with Sociocosmos...
Cosmic Conversations with Sociocosmos...
 
YouScan Company Overview - Social Media Listening with Visual Insights.pdf
YouScan Company Overview - Social Media Listening with Visual Insights.pdfYouScan Company Overview - Social Media Listening with Visual Insights.pdf
YouScan Company Overview - Social Media Listening with Visual Insights.pdf
 
Music Video Codes and Conventions 2 .pptx
Music Video Codes and Conventions 2 .pptxMusic Video Codes and Conventions 2 .pptx
Music Video Codes and Conventions 2 .pptx
 
Amplify Your Brand with Our Tailored Social Media Marketing Services
Amplify Your Brand with Our Tailored Social Media Marketing ServicesAmplify Your Brand with Our Tailored Social Media Marketing Services
Amplify Your Brand with Our Tailored Social Media Marketing Services
 
AI Virtual Influencers: The Future of Influencer Marketing
AI Virtual Influencers:  The Future of Influencer MarketingAI Virtual Influencers:  The Future of Influencer Marketing
AI Virtual Influencers: The Future of Influencer Marketing
 
O9654467111 Call Girls In Shahdara Women Seeking Men
O9654467111 Call Girls In Shahdara Women Seeking MenO9654467111 Call Girls In Shahdara Women Seeking Men
O9654467111 Call Girls In Shahdara Women Seeking Men
 
social media advantages and disadvantages
social media advantages and disadvantagessocial media advantages and disadvantages
social media advantages and disadvantages
 
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...
Models Call Girls Shettihalli - 7001305949 Escorts Service 50% Off with Cash ...
 
Hot Sexy call girls in Ramesh Nagar🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Ramesh Nagar🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Ramesh Nagar🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Ramesh Nagar🔝 9953056974 🔝 Delhi escort Service
 
Enjoy ➥8448380779▻ Call Girls In Noida Sector 93 Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Noida Sector 93 Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Noida Sector 93 Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Noida Sector 93 Escorts Delhi NCR
 
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一
定制(ENU毕业证书)英国爱丁堡龙比亚大学毕业证成绩单原版一比一
 
FULL ENJOY Call Girls In Mohammadpur (Delhi) Call Us 9953056974
FULL ENJOY Call Girls In Mohammadpur  (Delhi) Call Us 9953056974FULL ENJOY Call Girls In Mohammadpur  (Delhi) Call Us 9953056974
FULL ENJOY Call Girls In Mohammadpur (Delhi) Call Us 9953056974
 
Protecting Your Little Explorer at Home!
Protecting Your Little Explorer at Home!Protecting Your Little Explorer at Home!
Protecting Your Little Explorer at Home!
 
fraud storyboards powerpoint media project
fraud storyboards powerpoint media projectfraud storyboards powerpoint media project
fraud storyboards powerpoint media project
 

Text, Topics, and Turkers: A Consensus Measure for Statistical Topics

  • 1. Text, Topics, and Turkers. Hypertext 2015 1 Text, Topics, and Turkers: A Consensus Measure for Statistical Topics Fred Morstatter†, Jürgen Pfeffer‡, Katja Mayer*, Huan Liu† †Arizona State University Tempe, Arizona, USA ‡Carnegie Mellon University Pittsburgh, Pennsylvania, USA *University of Vienna Vienna, Austria
  • 2. Text, Topics, and Turkers. Hypertext 2015 2 Text • Text is everywhere in research. • Text is huge: • Too much data to read. • How can we understand what is going on in big text data? Source Size Wikipedia 36 million pages World Wide Web 100+ billion static web pages Social Media 500 million new tweets/day
  • 3. Text, Topics, and Turkers. Hypertext 2015 3 Topics • Topic Modeling • Latent Dirichlet Allocation (LDA) – Most commonly-used topic modeling algorithm – Discovers “topics” within a corpus Corpus LDA K Topic ID Words Topic 1 cat, dog, horse, ... Topic 2 ball, field, player, ... ... ... Topic K red, green, blue, ... Topic 1 Topic 2 ... Topic K Document1 0.2 0.1 0.01 Document2 0.7 0.02 0.1 ... Documentn 0.1 0.3 0.01
  • 4. Text, Topics, and Turkers. Hypertext 2015 4 Topics LDA K = 10 Topic ID Words Topic 1 river, lake, island, mountain, area, park, antarctic, south, mountains, dam Topic 2 relay, athletics, metres, freestyle, hurdles, ret, divisão, athletes, bundesliga, medals ... ... Topic 10 courcelles, centimeters, mattythewhite, wine, stamps, oko, perennial, stubs, ovate, greyish Topic 1 Topic 2 ... Topic 10 Document1 0.2 0.1 0.01 Document2 0.7 0.02 0.1 ... Documentn 0.1 0.3 0.01
  • 5. Text, Topics, and Turkers. Hypertext 2015 5 Topics • How can we measure the quality of statistical topics? • We don’t know how well humans can interpret topics. • Problem: Does their understanding match what is going on in the corpus?
  • 6. Text, Topics, and Turkers. Hypertext 2015 6 Turkers • One Solution: Crowdsourcing • Example: Amazon’s Mechanical Turk – Show LDA results to Turkers – Gauge their understanding – How to effectively measure understanding?
  • 7. Text, Topics, and Turkers. Hypertext 2015 7 Turkers • Previous Work: Chang et. al 2009 – “Word Intrusion” – “Topic Intrusion” Corpus LDA K Topic ID Words Topic 1 cat, dog, horse, ... Topic 2 ball, field, player, ... ... ... Topic K red, green, blue, ... Topic 1 Topic 2 ... Topic K Document1 0.2 0.1 0.01 Document2 0.7 0.02 0.1 ... Documentn 0.1 0.3 0.01 “Word Intrusion” “Topic Intrusion”
  • 8. Text, Topics, and Turkers. Hypertext 2015 8 Word Intrusion • Show the Turker 6 words in random order – Top 5 words from topic – 1 “Intruded” word – Ask Turker to choose “Intruded” word cat dog bird truck horse snake Topic i: [Chang et. al 2009]
  • 9. Text, Topics, and Turkers. Hypertext 2015 9 Topic Intrusion • Show the Turker a document • Show the Turker 4 topics – 3 most probable topics – 1 “Intruded” topic – Ask Turker to choose “Intruded” Topic Documenti Topic A Topic B Topic C Topic D [Chang et. al 2009]
  • 10. Text, Topics, and Turkers. Hypertext 2015 10 New Measure: Topic Consensus Corpus LDA K Topic ID Words Topic 1 cat, dog, horse, ... Topic 2 ball, field, player, ... ... ... Topic K red, green, blue, ... Topic 1 Topic 2 ... Topic K Document1 0.2 0.1 0.01 Document2 0.7 0.02 0.1 ... Documentn 0.1 0.3 0.01 “Word Intrusion” “Topic Intrusion” • Complements existing framework • Measures topic quality with corpus. “Topic Consensus”
  • 11. Text, Topics, and Turkers. Hypertext 2015 11 Topic Consensus: Intuition • Measures the agreement between topics and “sections” they come from. LDA Distribution Turker Distribution
  • 12. Text, Topics, and Turkers. Hypertext 2015 12 Topic Consensus: Calculation • We are comparing probability distributions. • Jensen-Shannon Divergence. Turker Distribution LDA Distribution
  • 13. Text, Topics, and Turkers. Hypertext 2015 13 Dataset • Scientific Abstracts • All available abstracts since 2007. • Classified into three areas: – Social Sciences & Humanities (SH) – Life Sciences (LS) – Physical Sciences (PE) • Ran LDA on this dataset: – K = [10, 25, 50, 100] – 185 topics; 4 topic sets.
  • 14. Text, Topics, and Turkers. Hypertext 2015 14 Turkers • One task: • Turkers have 3 + 1 options. • Each task solved 8 times.
  • 15. Text, Topics, and Turkers. Hypertext 2015 15 Results Topic Set ERC-10 ERC-25 ERC-50 ERC-100 new, group, results, plan, class, ... selection, variation, population, genetic, natural, ...
  • 16. Text, Topics, and Turkers. Hypertext 2015 16 Other Topic Sets • LDA Topics – Use New York Times dataset from one day. 25 topics, 1 topic set • Hand-Picked Topics – Pure “Social Science & Humanities” • Sampled words that occur only in these documents. 11 topics, 1 topic set – Random Topics • Randomly choose topics according to word distribution of corpus. 25 topics, 1 topic set
  • 17. Text, Topics, and Turkers. Hypertext 2015 17 Results Topic Set ERC-10 ERC-25 ERC-50 ERC-100 NYT-25 RAND-25 SH-25
  • 18. Text, Topics, and Turkers. Hypertext 2015 18 Overview of the Process • Topic Consensus can reveal new information about the topics being studied. – Can measure topics from a new perspective. – Can help reveal topic confusion. • Drawbacks: – Expensive – Time Consuming – Scalability
  • 19. Text, Topics, and Turkers. Hypertext 2015 19 Automated Measures 1. Topic Size: Number of tokens assigned to the topic. 2. Topic Coherence: Probability that the top words co-occur in documents in the corpus. 3. Topic Coherence Significance: Significance of Topic Coherence compared to other topics. 4. Normalized Pointwise Mutual Information: Measures the association between the top words in the topics.
  • 20. Text, Topics, and Turkers. Hypertext 2015 20 Measures • Herfindahl-Hirschman Index (HHI) – Measures concentration of a market. – Used to find monopolies. – Viewed from two perspectives: Word Probability HHI5. 6. Social Sciences Physical Sciences Life Sciences ERC Section HHI
  • 21. Text, Topics, and Turkers. Hypertext 2015 21 Results - Correlation Automated Measure Correlation Topic Size -0.532 Topic Coherence -0.584 Topic Coherence Significance -0.788 Normalized Pointwise Mutual Information -0.774 HHI (Word Probability) -0.885 HHI (ERC Section) -0.478
  • 22. Text, Topics, and Turkers. Hypertext 2015 22 Results - Prediction • Build classifier to predict actual Topic Consensus value. • Build linear regression model: – Takes automated measures. – Predicts Topic Consensus. • RMSE: 0.12 ± 0.02.
  • 23. Text, Topics, and Turkers. Hypertext 2015 23 Acknowledgements • Members of the DMML lab • Office of Naval Research through grant N000141410095 • LexisNexis and HPCC Systems
  • 24. Text, Topics, and Turkers. Hypertext 2015 24 Conclusion • Introduced a new method for evaluating the interpretability of statistical topics. • Demonstrated this measure on a real-world dataset. • Automated this measure for scalability.
  • 25. Text, Topics, and Turkers. Hypertext 2015 25 Future Work • How sensitive are measures to top words? – Word Intrusion uses 5 – Topic Intrusion uses 5 – Topic Consensus uses 25 • How do measures fare on different datasets? • Other measures that can reveal quality topics?
  • 26. Text, Topics, and Turkers. Hypertext 2015 26 Auxiliary Slides
  • 27. Text, Topics, and Turkers. Hypertext 2015 27 User Demographics Sex Education Age First Language Country of Origin
  • 28. Text, Topics, and Turkers. Hypertext 2015 28 Results – Confusion Matrix
  • 29. Text, Topics, and Turkers. Hypertext 2015 29 Dataset Statistics

Notas del editor

  1. Topic modeling --- text summarization These algorithms are widely used for
  2. Why do I need to measure these topics? Finding quality topics Setting value of K in LDA Choosing the best topic model (LDA, ...)
  3. We need objective measures to evaluate the quality of topics.
  4. Each document gets a score. Can aggregate to get a sense of the model. This is a measure of the model, by looking at the document.
  5. The Previous measures are good. Specifically, we are looking at properties of the corpus.
  6. Sections can be like newspaper Blue is SPORTS Red is BUSINESS In reality, no topic is going to purely sports or business. Topics are mixtures over these sections. We want to know how humans can interpret these mixtures. Sections can be like Twitter Blue is protest Red is This slide just illustrates the process, I’ll get into more details later. This is a TC calculation for ONE TOPIC
  7. Topic Consensus is calculated as... K is Kullback-Leibler divergence; M is the middle of the distribution One side effect of using this measure is that lower scores indicate a better consensus.
  8. Mention undecidable
  9. If you want good topics you might choose 100...., If you want a good model you might choose 25.... The worst from TC are often “stopwords” topics Connection to Word Intrusion Are they really good topics?
  10. Each bar is a group of topics Bar in the middle is the median SH does the best ... This is good! Random does the worse ... This is also good! NYT does the worst ... Why?
  11. Is it possible to find a way to address all of these drawbacks? Explain the remainder of this paper here.
  12. These are methods used throughout the literature to measure topic quality, we repeat them here.
  13. Similar to Entropy
  14. Negative correlation makes sense. A high HHI means concentration, and a low Topic Consensus means a good score. Spearman’s Rho
  15. Other measures both in terms of automatic and crowdsourced.
  16. Can I provide an example? What do I want people to remember?
  17. LS: 0.36 PE: 0.45 SH: 0.19