SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
End-to-end Plural Coreference
Resolution on TV Show Transcripts
Jose Coves
Advisor: Dr. Jinho D. Choi
Emory University, Department of Computer Science
Contents
■ Introduction
■ Coreference Resolution
■ End-to-end Coreference
■ Plural Coreference + metrics
■ Approaches
■ Results
Coreference Resolution
■ Coreference Resolution
– Find expressions that refer to the same entity
– Very important for higher-level NLP tasks
– Natural language understanding: QA, summarization, information extraction, etc.
– Unresolved fundamental NLP task
– Syntactical structures, speakers, sequential order, text comprehension
– Ambiguity: She told Monica she was smart. She told Joey she was smart.
– Entities: General (locations, objects, etc.) or specific (people).
– Mentions: Nested (The Wall of China). Plural (Mom and dad, they)
Corpus: Friends TV Show
■ Entities
– Known entities: Main characters. Eg. Joey is great.
– GENERIC: Characters whose identity is not revealed. Eg. I like the waitress.
– GENERAL: A class of people. Eg. The ideal girl doesn’t exist.
– OTHER: Identity unknown from local context. Eg. The guy next to me.
■ Annotation
– No nested plural entities (mom and dad).
– Plural mentions are not coreferent.
– Plural mentions added to coreferent entities’ clusters (they à mom, dad)
– GENERAL, OTHER à Singletons: One mention. Eg: I like women.
Neural Networks
■ FFNN – Feed output forward,
as input to next layer, without
forming a cycle
■ LSTM – Artificial RNN with
loops that allow information
to persist
■ CNN – Deep NN, extracts
most important features in
condensed form
End-to-end Coreference Resolution
■ Produce coreferent clusters by assigning antecedents to top span
■ No syntactic parser or mention-detector
■ Y(i) = {ε, 1, . . . , i−1}, for each top span i
■ Dummy Antecedent ε à Not a mention or not coreferent with any antecedent
■ Optimize marginal log-likelihood
■ Random initial prunning,
■ Only gold mentions get positive updates
End-to-end: Span representations
■ Max span width
■ Rank spans by mention score
■ Keep top K spans
End-to-end System
■ Element-wise similarity
■ Feature vector:
– speaker information
– distance factor
Higher-Order Coreference and Coarse-
to-Fine Inference
■ Higher-Order Coreference
– Expected antecedent
– Gate vector
– Update as weighted average
■ Coarse-to-Fine Inference
– Span ratio r, keep K = rT top spans, T is document length
– For each span, keep top C antecedents (Sc)
– Compute final coreference score S(i,j)
Plural Coreference Resolution
■ Uses gold mentions for test set predictions
■ Adds plural annotation to Friends corpus
■ Labeling for plural mentions
■ Clustering algorithm
■ Modify evaluation metrics
■ Feed all mention pairs into Agglomerative Convolutional Neural Network
ACNN classifier
Labeling and Clustering
■ For each span mj, look at each antecedent mi
■ L = S = Singular Antecedent: mi is singular. Assign mj to the cluster of mi
■ R = P = Plural Antecedent: mj is singular and mi is plural. Assign mi to mj’s cluster
Approach
■ End-to-end Neural Coreference + Plural Coreference Resolution
■ Adapt from CoNLL corpus to Friends corpus
– Nested mentions (nested mention detection, F1 72 vs 85)
– Singletons
– Plural speakers
– Character entities
– Plural mentions
■ Predict plural antecedents
■ Merge mentions into entity clusters
Plural speakers
■ Singular speakers à Same speaker binary flag vector
■ Plural speakers à Intersection of speakers, non-empty flag (speaker in common)
– Pros: captures existence of relationship
– Cons: no measure of match strength (exact vs partial)
■ Average speakers
– Assign embeddings to each speaker
– Average speakers’ embeddings
– Both mentions embeddings
– Pair-wise multiplication
Training Labels, Singletons
■ Baseline: End-to-end singular coreference resolution
– Modify gold labels for training set
– Pick ”head” mention, output singular clusters
– Evaluate on plural metrics
– Sort by appearance frequency
– Most popular, least popular, none. Eg. They à mom:2, dad:1
■ Singletons: Mentions not coreferent to other mentions
– Add left-over spans with mention score > threshold t (t=0)
– Only gold mentions receive + updates
Plural mentions
■ Singular Coreference
– Predict a coreferent antecedent for each mention
– Merge to the same cluster (transitive nature)
■ Plural Coreference
– Not transitive. Eg. {me, we}, {you, we} but not {me, you}
– Already have singular antecedents, need plural antecedents
– Predictions for all pairs of mentions (not just one per span)
Singularity
■ Singular + Plural antecedents à Revisit Higher-order coreference
– Weight Singular/Plural with Singularity S
■ Training loss = S*LossSingular + (1-S)*LossPlural 0.6 < S < 0.7
■ Antecedent labels
– Singular if span is singular (gold entity group size)
– Plural if span is plural and antecedent is singular
– Non-coreferent otherwise
Merging clusters + Many antecedents
■ Original clusters from baseline (singular antecedents): Add mi to cluster[mj]
– Then, for each plural prediction, add the span mj to cluster[mi]
– Example: I think we won. You did great. (we,you)àS, (I,we)àP. {you,we}, {I,we}
■ Only top singular antecedent for each span (or dummy)
– Softmax à Antecedents with score > dummy (no output label)
– Limit number of antecedents: maxplural and maxsingular (error accumulation)
– Example: I bought it for me, but we could share it.
Antecedent conflicts
■ Clustering: singular antecedents, then plural antecedents
■ Wrong predictions à Error propagation
■ New order for clustering
– Look at spans in order
– For each span, process its antecedents in order
■ Fix mention pair marked as singular and plural antecedent
– Pick the highest score
■ Example: I think we won. You did great.
■ Later spans, more antecedents
New Plural labels
■ Antecedent labels (Zhou and Choi)
– Singular if antecedent is singular
– Plural if antecedent is plural and span is singular
– Non-coreferent otherwise
– First singular, then plural antecedents
– New Plural
■ Reduce error propagation with span ordering
– New Plural + Ordering
Results
■ F Base + plural New plural
Base + plural New plural
Analisys
■ Many Singular antecedents
– Pros: Helps with missed antecedents. Essential for plurals
Example: I, me, I. Missed (I, me). We, I, you. Need (We,I) and (I,you)
– Cons:
Example: I, we, you. Wrong (I,you)àS.
– f
Conclusion
■ Successfully adapt coreference model to Friends corpus
– Plural speakers, singletons, plural mentions
■ Modify singular coreference (end-to-end) for plural coreference
– Gradually identify weaknesses and improve performance
– Labeling techniques for plural antecedents
– Clustering of antecedents
■ First model to achieve end-to-end neural plural coreference resolution
Bibliography
Henry Y. Chen, Ethan Zhou, and Jinho D. Choi. Robust coreference resolution and entity linking on dialogues: Character identification on tv show transcripts. In Proceedings of
the 21st Conference on Computational Natural Language Learning (CoNLL 2017), pages 216–225. Association for Computational Linguistics, 2017. doi: 28 29
10.18653/v1/K17-1023. URL http://aclweb.org/anthology/K17-1023.
Yu-Hsin Chen and Jinho D. Choi. Character identification on multiparty conversation: Identifying mentions of characters in tv shows. In Proceedings of the 17th Annual
Meeting of the Special Interest Group on Discourse and Dialogue, pages 90–100. Association for Computational Linguistics, 2016. doi: 10.18653/v1/W16-3612. URL
http://aclweb.org/anthology/W16-3612.
Kevin Clark and Christopher D. Manning. Deep reinforcement learning for mention-ranking coreference models. In Proceedings of the 2016 Conference on Empirical Methods
in Natural Language Processing, pages 2256–2262. Association for Computational Linguistics, 2016. doi: 10.18653/v1/D16-1245. URL http: //aclweb.org/anthology/D16-1245.
Kevin Clark and Christopher D. Manning. Improving coreference resolution by learning entity-level distributed representations. CoRR, abs/1606.01323, 2016. URL
http://arxiv.org/abs/1606.01323.
Arzoo Katiyar and Claire Cardie. Nested named entity recognition revisited. In Proceedings of the 2018 Conference of the North American Chapter of the Association for
Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 861–871, New Orleans, Louisiana, June 2018. Association for 30 Computational
Linguistics. doi: 10.18653/v1/N18-1079. URL http://www. aclweb.org/anthology/N18-1079.
Kenton Lee, Luheng He, Mike Lewis, and Luke Zettlemoyer. End-to-end neural coreference resolution. CoRR, abs/1707.07045, 2017. URL http://arxiv.org/ abs/1707.07045.
Kenton Lee, Luheng He, and Luke Zettlemoyer. Higher-order coreference resolution with coarse-to-fine inference. CoRR, abs/1804.05392, 2018. URL http://arxiv.
org/abs/1804.05392.
Sameer Pradhan, Alessandro Moschitti, Nianwen Xue, Olga Uryupina, and Yuchen Zhang. Conll-2012 shared task: Modeling multilingual unrestricted coreference in ontonotes.
In Joint Conference on EMNLP and CoNLL - Shared Task, CoNLL ’12, pages 1–40, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics. URL
http://dl.acm.org/citation.cfm?id=2391181.2391183.
Sam Wiseman, Alexander M. Rush, and Stuart M. Shieber. Learning global features for coreference resolution. CoRR, abs/1604.03035, 2016. URL http://arxiv.
org/abs/1604.03035
Ethan Zhou and Jinho D. Choi. They exist! introducing plural mentions to coreference resolution and entity linking. In Proceedings of the 27th International Conference on 31
Computational Linguistics, pages 24–34. Association for Computational Linguistics, 2018. URL http://aclweb.org/anthology/C18-1003.
https://nlp.stanford.edu/projects/coref.shtml
Metrics

Más contenido relacionado

Similar a End-to-End Plural Coreference Resolution on TV Show Transcripts

Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language ProcessingPranav Gupta
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuPyData
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingTed Xiao
 
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckHotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckTao Xie
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015RIILP
 
The noun phrase introducers of npChapter 4the noun phr.docx
The noun phrase  introducers of npChapter 4the noun phr.docxThe noun phrase  introducers of npChapter 4the noun phr.docx
The noun phrase introducers of npChapter 4the noun phr.docxarnoldmeredith47041
 
The noun phrase introducers of npChapter 4the noun phr.docx
The noun phrase  introducers of npChapter 4the noun phr.docxThe noun phrase  introducers of npChapter 4the noun phr.docx
The noun phrase introducers of npChapter 4the noun phr.docxdennisa15
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for SecurityTao Xie
 
語言學概論Syntax
語言學概論Syntax語言學概論Syntax
語言學概論Syntax棠貝 白
 
Vocabulary and listening
Vocabulary and listeningVocabulary and listening
Vocabulary and listeningIvan Aguilar
 
general english
general englishgeneral english
general englishRohadi Mpd
 
Ok 1 general english
Ok 1 general englishOk 1 general english
Ok 1 general englishRohadi Mpd
 

Similar a End-to-End Plural Coreference Resolution on TV Show Transcripts (20)

haenelt.ppt
haenelt.ppthaenelt.ppt
haenelt.ppt
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Random Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda UruchurtuRandom Forests R vs Python by Linda Uruchurtu
Random Forests R vs Python by Linda Uruchurtu
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
A Panorama of Natural Language Processing
A Panorama of Natural Language ProcessingA Panorama of Natural Language Processing
A Panorama of Natural Language Processing
 
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William EnckHotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
HotSoS16 Tutorial "Text Analytics for Security" by Tao Xie and William Enck
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
 
APA 6Th Edition
APA 6Th EditionAPA 6Th Edition
APA 6Th Edition
 
E10 dec5 2011
E10 dec5 2011E10 dec5 2011
E10 dec5 2011
 
6 POS SA.pptx
6 POS SA.pptx6 POS SA.pptx
6 POS SA.pptx
 
The noun phrase introducers of npChapter 4the noun phr.docx
The noun phrase  introducers of npChapter 4the noun phr.docxThe noun phrase  introducers of npChapter 4the noun phr.docx
The noun phrase introducers of npChapter 4the noun phr.docx
 
The noun phrase introducers of npChapter 4the noun phr.docx
The noun phrase  introducers of npChapter 4the noun phr.docxThe noun phrase  introducers of npChapter 4the noun phr.docx
The noun phrase introducers of npChapter 4the noun phr.docx
 
Exam Preparation
Exam PreparationExam Preparation
Exam Preparation
 
Text Analytics for Security
Text Analytics for SecurityText Analytics for Security
Text Analytics for Security
 
語言學概論Syntax
語言學概論Syntax語言學概論Syntax
語言學概論Syntax
 
Semantics reloaded
Semantics reloadedSemantics reloaded
Semantics reloaded
 
K 12 handouts
K 12 handoutsK 12 handouts
K 12 handouts
 
Vocabulary and listening
Vocabulary and listeningVocabulary and listening
Vocabulary and listening
 
general english
general englishgeneral english
general english
 
Ok 1 general english
Ok 1 general englishOk 1 general english
Ok 1 general english
 

Más de Jinho Choi

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Jinho Choi
 
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Jinho Choi
 
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Jinho Choi
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Jinho Choi
 
The Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionThe Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionJinho Choi
 
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Jinho Choi
 
Abstract Meaning Representation
Abstract Meaning RepresentationAbstract Meaning Representation
Abstract Meaning RepresentationJinho Choi
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role LabelingJinho Choi
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet SimilaritiesJinho Choi
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical RelationsJinho Choi
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementJinho Choi
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingJinho Choi
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueJinho Choi
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingJinho Choi
 
Topological Sort
Topological SortTopological Sort
Topological SortJinho Choi
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseJinho Choi
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsJinho Choi
 
How to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyHow to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyJinho Choi
 

Más de Jinho Choi (20)

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
 
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
 
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
 
The Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionThe Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference Resolution
 
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
 
Abstract Meaning Representation
Abstract Meaning RepresentationAbstract Meaning Representation
Abstract Meaning Representation
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
 
CKY Parsing
CKY ParsingCKY Parsing
CKY Parsing
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet Similarities
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical Relations
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue Management
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR Parsing
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to Dialogue
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue Understanding
 
Topological Sort
Topological SortTopological Sort
Topological Sort
 
Tries - Put
Tries - PutTries - Put
Tries - Put
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
 
How to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyHow to make Emora talk about Sports Intelligently
How to make Emora talk about Sports Intelligently
 

Último

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Último (20)

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

End-to-End Plural Coreference Resolution on TV Show Transcripts

  • 1. End-to-end Plural Coreference Resolution on TV Show Transcripts Jose Coves Advisor: Dr. Jinho D. Choi Emory University, Department of Computer Science
  • 2. Contents ■ Introduction ■ Coreference Resolution ■ End-to-end Coreference ■ Plural Coreference + metrics ■ Approaches ■ Results
  • 3. Coreference Resolution ■ Coreference Resolution – Find expressions that refer to the same entity – Very important for higher-level NLP tasks – Natural language understanding: QA, summarization, information extraction, etc. – Unresolved fundamental NLP task – Syntactical structures, speakers, sequential order, text comprehension – Ambiguity: She told Monica she was smart. She told Joey she was smart. – Entities: General (locations, objects, etc.) or specific (people). – Mentions: Nested (The Wall of China). Plural (Mom and dad, they)
  • 4. Corpus: Friends TV Show ■ Entities – Known entities: Main characters. Eg. Joey is great. – GENERIC: Characters whose identity is not revealed. Eg. I like the waitress. – GENERAL: A class of people. Eg. The ideal girl doesn’t exist. – OTHER: Identity unknown from local context. Eg. The guy next to me. ■ Annotation – No nested plural entities (mom and dad). – Plural mentions are not coreferent. – Plural mentions added to coreferent entities’ clusters (they à mom, dad) – GENERAL, OTHER à Singletons: One mention. Eg: I like women.
  • 5. Neural Networks ■ FFNN – Feed output forward, as input to next layer, without forming a cycle ■ LSTM – Artificial RNN with loops that allow information to persist ■ CNN – Deep NN, extracts most important features in condensed form
  • 6. End-to-end Coreference Resolution ■ Produce coreferent clusters by assigning antecedents to top span ■ No syntactic parser or mention-detector ■ Y(i) = {ε, 1, . . . , i−1}, for each top span i ■ Dummy Antecedent ε à Not a mention or not coreferent with any antecedent ■ Optimize marginal log-likelihood ■ Random initial prunning, ■ Only gold mentions get positive updates
  • 7. End-to-end: Span representations ■ Max span width ■ Rank spans by mention score ■ Keep top K spans
  • 8. End-to-end System ■ Element-wise similarity ■ Feature vector: – speaker information – distance factor
  • 9. Higher-Order Coreference and Coarse- to-Fine Inference ■ Higher-Order Coreference – Expected antecedent – Gate vector – Update as weighted average ■ Coarse-to-Fine Inference – Span ratio r, keep K = rT top spans, T is document length – For each span, keep top C antecedents (Sc) – Compute final coreference score S(i,j)
  • 10. Plural Coreference Resolution ■ Uses gold mentions for test set predictions ■ Adds plural annotation to Friends corpus ■ Labeling for plural mentions ■ Clustering algorithm ■ Modify evaluation metrics ■ Feed all mention pairs into Agglomerative Convolutional Neural Network
  • 12. Labeling and Clustering ■ For each span mj, look at each antecedent mi ■ L = S = Singular Antecedent: mi is singular. Assign mj to the cluster of mi ■ R = P = Plural Antecedent: mj is singular and mi is plural. Assign mi to mj’s cluster
  • 13. Approach ■ End-to-end Neural Coreference + Plural Coreference Resolution ■ Adapt from CoNLL corpus to Friends corpus – Nested mentions (nested mention detection, F1 72 vs 85) – Singletons – Plural speakers – Character entities – Plural mentions ■ Predict plural antecedents ■ Merge mentions into entity clusters
  • 14. Plural speakers ■ Singular speakers à Same speaker binary flag vector ■ Plural speakers à Intersection of speakers, non-empty flag (speaker in common) – Pros: captures existence of relationship – Cons: no measure of match strength (exact vs partial) ■ Average speakers – Assign embeddings to each speaker – Average speakers’ embeddings – Both mentions embeddings – Pair-wise multiplication
  • 15. Training Labels, Singletons ■ Baseline: End-to-end singular coreference resolution – Modify gold labels for training set – Pick ”head” mention, output singular clusters – Evaluate on plural metrics – Sort by appearance frequency – Most popular, least popular, none. Eg. They à mom:2, dad:1 ■ Singletons: Mentions not coreferent to other mentions – Add left-over spans with mention score > threshold t (t=0) – Only gold mentions receive + updates
  • 16. Plural mentions ■ Singular Coreference – Predict a coreferent antecedent for each mention – Merge to the same cluster (transitive nature) ■ Plural Coreference – Not transitive. Eg. {me, we}, {you, we} but not {me, you} – Already have singular antecedents, need plural antecedents – Predictions for all pairs of mentions (not just one per span)
  • 17. Singularity ■ Singular + Plural antecedents à Revisit Higher-order coreference – Weight Singular/Plural with Singularity S ■ Training loss = S*LossSingular + (1-S)*LossPlural 0.6 < S < 0.7 ■ Antecedent labels – Singular if span is singular (gold entity group size) – Plural if span is plural and antecedent is singular – Non-coreferent otherwise
  • 18. Merging clusters + Many antecedents ■ Original clusters from baseline (singular antecedents): Add mi to cluster[mj] – Then, for each plural prediction, add the span mj to cluster[mi] – Example: I think we won. You did great. (we,you)àS, (I,we)àP. {you,we}, {I,we} ■ Only top singular antecedent for each span (or dummy) – Softmax à Antecedents with score > dummy (no output label) – Limit number of antecedents: maxplural and maxsingular (error accumulation) – Example: I bought it for me, but we could share it.
  • 19. Antecedent conflicts ■ Clustering: singular antecedents, then plural antecedents ■ Wrong predictions à Error propagation ■ New order for clustering – Look at spans in order – For each span, process its antecedents in order ■ Fix mention pair marked as singular and plural antecedent – Pick the highest score ■ Example: I think we won. You did great. ■ Later spans, more antecedents
  • 20. New Plural labels ■ Antecedent labels (Zhou and Choi) – Singular if antecedent is singular – Plural if antecedent is plural and span is singular – Non-coreferent otherwise – First singular, then plural antecedents – New Plural ■ Reduce error propagation with span ordering – New Plural + Ordering
  • 22.
  • 23. ■ F Base + plural New plural
  • 24. Base + plural New plural
  • 25. Analisys ■ Many Singular antecedents – Pros: Helps with missed antecedents. Essential for plurals Example: I, me, I. Missed (I, me). We, I, you. Need (We,I) and (I,you) – Cons: Example: I, we, you. Wrong (I,you)àS. – f
  • 26. Conclusion ■ Successfully adapt coreference model to Friends corpus – Plural speakers, singletons, plural mentions ■ Modify singular coreference (end-to-end) for plural coreference – Gradually identify weaknesses and improve performance – Labeling techniques for plural antecedents – Clustering of antecedents ■ First model to achieve end-to-end neural plural coreference resolution
  • 27. Bibliography Henry Y. Chen, Ethan Zhou, and Jinho D. Choi. Robust coreference resolution and entity linking on dialogues: Character identification on tv show transcripts. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), pages 216–225. Association for Computational Linguistics, 2017. doi: 28 29 10.18653/v1/K17-1023. URL http://aclweb.org/anthology/K17-1023. Yu-Hsin Chen and Jinho D. Choi. Character identification on multiparty conversation: Identifying mentions of characters in tv shows. In Proceedings of the 17th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 90–100. Association for Computational Linguistics, 2016. doi: 10.18653/v1/W16-3612. URL http://aclweb.org/anthology/W16-3612. Kevin Clark and Christopher D. Manning. Deep reinforcement learning for mention-ranking coreference models. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pages 2256–2262. Association for Computational Linguistics, 2016. doi: 10.18653/v1/D16-1245. URL http: //aclweb.org/anthology/D16-1245. Kevin Clark and Christopher D. Manning. Improving coreference resolution by learning entity-level distributed representations. CoRR, abs/1606.01323, 2016. URL http://arxiv.org/abs/1606.01323. Arzoo Katiyar and Claire Cardie. Nested named entity recognition revisited. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 861–871, New Orleans, Louisiana, June 2018. Association for 30 Computational Linguistics. doi: 10.18653/v1/N18-1079. URL http://www. aclweb.org/anthology/N18-1079. Kenton Lee, Luheng He, Mike Lewis, and Luke Zettlemoyer. End-to-end neural coreference resolution. CoRR, abs/1707.07045, 2017. URL http://arxiv.org/ abs/1707.07045. Kenton Lee, Luheng He, and Luke Zettlemoyer. Higher-order coreference resolution with coarse-to-fine inference. CoRR, abs/1804.05392, 2018. URL http://arxiv. org/abs/1804.05392. Sameer Pradhan, Alessandro Moschitti, Nianwen Xue, Olga Uryupina, and Yuchen Zhang. Conll-2012 shared task: Modeling multilingual unrestricted coreference in ontonotes. In Joint Conference on EMNLP and CoNLL - Shared Task, CoNLL ’12, pages 1–40, Stroudsburg, PA, USA, 2012. Association for Computational Linguistics. URL http://dl.acm.org/citation.cfm?id=2391181.2391183. Sam Wiseman, Alexander M. Rush, and Stuart M. Shieber. Learning global features for coreference resolution. CoRR, abs/1604.03035, 2016. URL http://arxiv. org/abs/1604.03035 Ethan Zhou and Jinho D. Choi. They exist! introducing plural mentions to coreference resolution and entity linking. In Proceedings of the 27th International Conference on 31 Computational Linguistics, pages 24–34. Association for Computational Linguistics, 2018. URL http://aclweb.org/anthology/C18-1003. https://nlp.stanford.edu/projects/coref.shtml