SlideShare a Scribd company logo
1 of 16
Download to read offline
SemEval 2018 Task 4:
Character Identification
on Multiparty Dialogues
Jinho D. Choi @ Emory University
HenryY. Chen @ Snap Inc.
https://github.com/emorynlp/semeval-2018-task4
http://nlp.mathcs.emory.edu
Character Mining
2
Extract or infer various information about
individual characters in multiparty dialogues
text chat
audio chat
video chat
Character Mining
3
Transcripts from the TV show, Friends
A total of 10 seasons, 236 episodes, 3,107 scenes
Character Mining
4
Character Reading
Comprehension
Emotion
Detection
Personality
TestIdentification
https://github.com/emorynlp/character-mining
5
Ross I told mom and dad last night, they seemed to take it pretty well.
Monica
Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll
never have grandchildren, I'll never have grandchildren." was what? A wrong number?
Ross Sorry.
Joey
Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.

Can I tell you what the answer is?
Jack Monica
Ross Joey
Judy
Character Identification
6
Ross I told mom and dad last night, they seemed to take it pretty well.
Monica
Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll
never have grandchildren, I'll never have grandchildren." was what? A wrong number?
Ross Sorry.
Joey
Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.

Can I tell you what the answer is?
Entity Linking or Coreference Resolution?
? ?
Cross-document resolution is required
Judy & Jack appear in some other dialogue
Character Identification
Corpus
7
Episodes Scenes Speakers Utterances Sentences Tokens
Season 1 24 326 107 5,968 10,790 81,453
Season 2 24 293 107 5,747 9,337 81,910
Total 48 619 171 11,715 20,127 163,363
Two seasons of Friends
Pseudo-annotated then
manually corrected
Manually double-annotated
by crowd workers
Nominals referring to humans Characters in the show
Mentions Entities
Mention Annotation
8
Named Entity Pronoun Gazeteer Total
Season 1 946 7,549 811 9,306
Season 2 1,037 7,459 806 9,302
Total 1,983 15,008 1,617 18,608
A noun phrase is a mention if it is either
1. A PERSON named entity, or
2. A pronoun or possessive pronoun excluding“it” and “they”, or
3. One of the personal noun gazetteers that are 603 common and singular
personal nouns selected from Freebase and DBPedia (e.g., mom, girlfriend).
Gives about the F1 score of 95.93%
Entity Annotation
9
Primary Secondary Generic Collective General Other Total
Season 1 5,160 2,526 178 1,150 111 187 9,312
Season 2 5,385 2,340 120 1,238 44 169 9,296
Total 10,545 4,866 298 2,388 155 356 18,608
1. Primary: six main characters of the show.
2. Secondary: all the non-primary characters.
3. Generic: real entities whose identities are not defined 

(e.g.,That waitress is really cute, I am going to ask her out)
4. Collective: the plural use of the pronoun you, which cannot be deterministically
distinguished from the singular use
5. General: mentions used in reference to a general case rather than an specific entity 

(e.g.,The ideal guy you look for doesn’t exist)
6. Other: indicates all the other kinds of mentions
Used for this year’s
shared task
Evaluation Metrics
10
Labeling Accuracy
# of all mentions
# of correctly identified mentions
Average Macro F1
F1 score of the i’th character
i = 1
N
N
1
6 Main Characters + Others All Characters
Overall Scores
11
Over 90 teams registered at CodaLab
Only 4 teams made submissions :-(
Main + Others All Characters
Acc F1 Acc F1
AMORE-UPF 77.23 79.36 74.72 41.05
KNU-CI 85.10 86.00 69.49 16.98
Kampfpudding 73.36 73.51 59.45 37.37
Zuma-AR 46.85 44.68 33.06 16.09
Main Characters + Others
12
Ross Rachel Chand Joey Pheob Monica OTHER
AMORE-UPF 78.57 82.98 81.36 79.83 86.52 85.22 61.02
KNU-CI 85.86 92.49 84.94 79.67 88.09 91.16 79.79
Kampfpudding 73.48 70.67 79.25 63.38 79.79 73.35 74.61
Zuma-AR 38.72 43.05 43.04 36.10 42.90 46.43 51.78
Evaluation 18.98 13.96 9.80 9.51 9.02 8.97 29.77
Training 13.93 12.37 11.43 9.43 8.79 10.61 33.44
All Characters
13
Be Ca Ed Pa Ju MB Ri Sc Cr Fr Ja OT
AMORE 50.00 57.14 80.60 35.56 72.73 64.52 80.85 10.00 61.54 0.00 42.11 7.89
KNU-CI 38.46 62.79 73.02 15.38 42.55 0.00 66.67 38.46 0.00 18.18 16.00 0.00
Kamp. 31.86 33.33 68.85 33.33 60.32 50.00 61.22 10.00 0.00 0.00 23.53 0.00
Zuma 0.00 12.24 44.44 0.00 27.91 15.38 77.78 0.00 38.46 0.00 12.50 0.00
TST 3.46 1.73 1.56 1.44 1.32 0.86 0.86 0.78 0.74 0.70 0.62 2.92
TRN 1.41 1.46 1.06 0.71 1.15 0.60 1.83 0.21 0.13 0.51 0.43 13.51
Next Step
14
Character Identification v2.0
Four seasons of Friends are completely annotated
Plural mentions are annotated
Ihttps://github.com/emorynlp/character-identification
They Exist! Introducing Plural Mentions to
Coreference Resolution and Entity Linking
Ethan Zhou and Jinho D. Choi
COLING 2018
ThankYou
15
Henry Chen Ethan Zhou
Questions
16

More Related Content

What's hot

What's hot (16)

Silent hill essay script 1
Silent hill essay script 1Silent hill essay script 1
Silent hill essay script 1
 
michael
michaelmichael
michael
 
Gameplay and ludology
Gameplay and ludologyGameplay and ludology
Gameplay and ludology
 
Data gathering on gaming results
Data gathering on gaming resultsData gathering on gaming results
Data gathering on gaming results
 
Script presintation
Script presintation Script presintation
Script presintation
 
Q script
Q scriptQ script
Q script
 
Shotlist
Shotlist Shotlist
Shotlist
 
3. research (1)
3. research (1)3. research (1)
3. research (1)
 
The Zodiac Ciphers: What do we know? And when can we stop trying to solve them?
The Zodiac Ciphers: What do we know?  And when can we stop trying to solve them?The Zodiac Ciphers: What do we know?  And when can we stop trying to solve them?
The Zodiac Ciphers: What do we know? And when can we stop trying to solve them?
 
Game review dead island riptide
Game review  dead island riptideGame review  dead island riptide
Game review dead island riptide
 
What is a_game
What is a_gameWhat is a_game
What is a_game
 
Mafia 2
Mafia 2Mafia 2
Mafia 2
 
Q cards
Q cardsQ cards
Q cards
 
3. research
3. research3. research
3. research
 
Graphical styles
Graphical stylesGraphical styles
Graphical styles
 
Why games?
Why games?Why games?
Why games?
 

More from Jinho Choi

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Jinho Choi
 
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Jinho Choi
 
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Jinho Choi
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Jinho Choi
 
The Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionThe Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionJinho Choi
 
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Jinho Choi
 
Abstract Meaning Representation
Abstract Meaning RepresentationAbstract Meaning Representation
Abstract Meaning RepresentationJinho Choi
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role LabelingJinho Choi
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet SimilaritiesJinho Choi
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical RelationsJinho Choi
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementJinho Choi
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingJinho Choi
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueJinho Choi
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingJinho Choi
 
Topological Sort
Topological SortTopological Sort
Topological SortJinho Choi
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseJinho Choi
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsJinho Choi
 
How to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyHow to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyJinho Choi
 

More from Jinho Choi (20)

Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal ...
 
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
Analysis of Hierarchical Multi-Content Text Classification Model on B-SHARP D...
 
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...Competence-Level Prediction and Resume & Job Description Matching Using Conte...
Competence-Level Prediction and Resume & Job Description Matching Using Conte...
 
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
Transformers to Learn Hierarchical Contexts in Multiparty Dialogue for Span-b...
 
The Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference ResolutionThe Myth of Higher-Order Inference in Coreference Resolution
The Myth of Higher-Order Inference in Coreference Resolution
 
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
Noise Pollution in Hospital Readmission Prediction: Long Document Classificat...
 
Abstract Meaning Representation
Abstract Meaning RepresentationAbstract Meaning Representation
Abstract Meaning Representation
 
Semantic Role Labeling
Semantic Role LabelingSemantic Role Labeling
Semantic Role Labeling
 
CKY Parsing
CKY ParsingCKY Parsing
CKY Parsing
 
CS329 - WordNet Similarities
CS329 - WordNet SimilaritiesCS329 - WordNet Similarities
CS329 - WordNet Similarities
 
CS329 - Lexical Relations
CS329 - Lexical RelationsCS329 - Lexical Relations
CS329 - Lexical Relations
 
Automatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue ManagementAutomatic Knowledge Base Expansion for Dialogue Management
Automatic Knowledge Base Expansion for Dialogue Management
 
Attention is All You Need for AMR Parsing
Attention is All You Need for AMR ParsingAttention is All You Need for AMR Parsing
Attention is All You Need for AMR Parsing
 
Graph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to DialogueGraph-to-Text Generation and its Applications to Dialogue
Graph-to-Text Generation and its Applications to Dialogue
 
Real-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue UnderstandingReal-time Coreference Resolution for Dialogue Understanding
Real-time Coreference Resolution for Dialogue Understanding
 
Topological Sort
Topological SortTopological Sort
Topological Sort
 
Tries - Put
Tries - PutTries - Put
Tries - Put
 
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's DiseaseMulti-modal Embedding Learning for Early Detection of Alzheimer's Disease
Multi-modal Embedding Learning for Early Detection of Alzheimer's Disease
 
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue ContextsBuilding Widely-Interpretable Semantic Networks for Dialogue Contexts
Building Widely-Interpretable Semantic Networks for Dialogue Contexts
 
How to make Emora talk about Sports Intelligently
How to make Emora talk about Sports IntelligentlyHow to make Emora talk about Sports Intelligently
How to make Emora talk about Sports Intelligently
 

Recently uploaded

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

SemEval 2018 Task 4: Character Identification on Multiparty Dialogues

  • 1. SemEval 2018 Task 4: Character Identification on Multiparty Dialogues Jinho D. Choi @ Emory University HenryY. Chen @ Snap Inc. https://github.com/emorynlp/semeval-2018-task4 http://nlp.mathcs.emory.edu
  • 2. Character Mining 2 Extract or infer various information about individual characters in multiparty dialogues text chat audio chat video chat
  • 3. Character Mining 3 Transcripts from the TV show, Friends A total of 10 seasons, 236 episodes, 3,107 scenes
  • 5. 5 Ross I told mom and dad last night, they seemed to take it pretty well. Monica Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll never have grandchildren, I'll never have grandchildren." was what? A wrong number? Ross Sorry. Joey Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.
 Can I tell you what the answer is? Jack Monica Ross Joey Judy Character Identification
  • 6. 6 Ross I told mom and dad last night, they seemed to take it pretty well. Monica Oh really, so that hysterical phone call I got from a woman at sobbing 3:00 A.M., "I'll never have grandchildren, I'll never have grandchildren." was what? A wrong number? Ross Sorry. Joey Alright Ross, look. You're feeling a lot of pain right now. You're angry. You're hurting.
 Can I tell you what the answer is? Entity Linking or Coreference Resolution? ? ? Cross-document resolution is required Judy & Jack appear in some other dialogue Character Identification
  • 7. Corpus 7 Episodes Scenes Speakers Utterances Sentences Tokens Season 1 24 326 107 5,968 10,790 81,453 Season 2 24 293 107 5,747 9,337 81,910 Total 48 619 171 11,715 20,127 163,363 Two seasons of Friends Pseudo-annotated then manually corrected Manually double-annotated by crowd workers Nominals referring to humans Characters in the show Mentions Entities
  • 8. Mention Annotation 8 Named Entity Pronoun Gazeteer Total Season 1 946 7,549 811 9,306 Season 2 1,037 7,459 806 9,302 Total 1,983 15,008 1,617 18,608 A noun phrase is a mention if it is either 1. A PERSON named entity, or 2. A pronoun or possessive pronoun excluding“it” and “they”, or 3. One of the personal noun gazetteers that are 603 common and singular personal nouns selected from Freebase and DBPedia (e.g., mom, girlfriend). Gives about the F1 score of 95.93%
  • 9. Entity Annotation 9 Primary Secondary Generic Collective General Other Total Season 1 5,160 2,526 178 1,150 111 187 9,312 Season 2 5,385 2,340 120 1,238 44 169 9,296 Total 10,545 4,866 298 2,388 155 356 18,608 1. Primary: six main characters of the show. 2. Secondary: all the non-primary characters. 3. Generic: real entities whose identities are not defined 
 (e.g.,That waitress is really cute, I am going to ask her out) 4. Collective: the plural use of the pronoun you, which cannot be deterministically distinguished from the singular use 5. General: mentions used in reference to a general case rather than an specific entity 
 (e.g.,The ideal guy you look for doesn’t exist) 6. Other: indicates all the other kinds of mentions Used for this year’s shared task
  • 10. Evaluation Metrics 10 Labeling Accuracy # of all mentions # of correctly identified mentions Average Macro F1 F1 score of the i’th character i = 1 N N 1 6 Main Characters + Others All Characters
  • 11. Overall Scores 11 Over 90 teams registered at CodaLab Only 4 teams made submissions :-( Main + Others All Characters Acc F1 Acc F1 AMORE-UPF 77.23 79.36 74.72 41.05 KNU-CI 85.10 86.00 69.49 16.98 Kampfpudding 73.36 73.51 59.45 37.37 Zuma-AR 46.85 44.68 33.06 16.09
  • 12. Main Characters + Others 12 Ross Rachel Chand Joey Pheob Monica OTHER AMORE-UPF 78.57 82.98 81.36 79.83 86.52 85.22 61.02 KNU-CI 85.86 92.49 84.94 79.67 88.09 91.16 79.79 Kampfpudding 73.48 70.67 79.25 63.38 79.79 73.35 74.61 Zuma-AR 38.72 43.05 43.04 36.10 42.90 46.43 51.78 Evaluation 18.98 13.96 9.80 9.51 9.02 8.97 29.77 Training 13.93 12.37 11.43 9.43 8.79 10.61 33.44
  • 13. All Characters 13 Be Ca Ed Pa Ju MB Ri Sc Cr Fr Ja OT AMORE 50.00 57.14 80.60 35.56 72.73 64.52 80.85 10.00 61.54 0.00 42.11 7.89 KNU-CI 38.46 62.79 73.02 15.38 42.55 0.00 66.67 38.46 0.00 18.18 16.00 0.00 Kamp. 31.86 33.33 68.85 33.33 60.32 50.00 61.22 10.00 0.00 0.00 23.53 0.00 Zuma 0.00 12.24 44.44 0.00 27.91 15.38 77.78 0.00 38.46 0.00 12.50 0.00 TST 3.46 1.73 1.56 1.44 1.32 0.86 0.86 0.78 0.74 0.70 0.62 2.92 TRN 1.41 1.46 1.06 0.71 1.15 0.60 1.83 0.21 0.13 0.51 0.43 13.51
  • 14. Next Step 14 Character Identification v2.0 Four seasons of Friends are completely annotated Plural mentions are annotated Ihttps://github.com/emorynlp/character-identification They Exist! Introducing Plural Mentions to Coreference Resolution and Entity Linking Ethan Zhou and Jinho D. Choi COLING 2018