SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
1/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Using Knowledge Base Semantics in
Context-Aware Entity Linking
Cheikh Brahim El Vaigh, François Goasdoué, Guillaume Gravier
and Pascale Sébillot
DocEng ’19, September 23–26, 2019, Berlin, Germany
2/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Context
Exploring large archive of a regional newspaper efficiently
iCODA1 :
Building an unified graph (RDF KB) with all data sources
Providing human friendly visualization for journalists
Bridging content and data : Linking content to the RDF
Knowledge Base
1
https://project.inria.fr/icoda/
3/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
RDF Knowledge Bases (KB)?
Specification of RDF graphs with triples :
(s, p, o) ∈ (U ∪ B)xUx(U ∪ L ∪ B) s
p
−→ o
RDF triples for facts and knowledge
RDF fact Triple notation
Class assertion (s, τ, o)
Property assertion (s, p, o) with p ∈ {τ, subC,
subP, d, r}
RDF knowledge Triple notation
Subclass (s, subC, o)
Subproperty (s, subP, o)
Domain typing (s, d, o)
Range typing (s, r, o)
4/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
KB example
Facts
(Q19837, τ, Person)
(Q19837, name, ”Steve Jobs”)
∗(Q19837, emplBy, Q312)
Knowledge
(Employee, subC, Person)
(emplBy, d, Employee)
(emplBy, r, Organization)
(emplBy, subP, workFor)
5/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Deriving the implicit triples from KB
Facts
(Q19837, τ, Person)
(Q19837, name, ”Steve Jobs”)
∗(Q19837, emplBy, Q312)
(Q19837, τ, Employee)
(Q312, τ, Organization)
∗(Q19837, workFor, Q312)
Knowledge
(Employee, subC, Person)
(emplBy, d, Employee)
(emplBy, r, Organization)
(emplBy, subP, workFor)
(workFor, r, Organization)
(workFor, d, Employee)
(workFor, d, Person)
(p, d, o), (s1, p, o1) → (s1, τ, o)
(p, r, o), (s1, p, o1) → (o1, τ, o)
(s, p1, o), (p1, subP, p) → (s, p, o)
(s, subC, o), (s1, τ, s) → (s1, τ, o)
6/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Standard Entity Linking pipeline
Definition (Entity Linking)
Identifying the entities of a reference knowledge base (KB) that are
mentioned in textual documents
6/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Standard Entity Linking pipeline
Definition (Entity Linking)
Identifying the entities of a reference knowledge base (KB) that are
mentioned in textual documents
Standard pipeline :
Named Entity Recognition (NER)
6/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Standard Entity Linking pipeline
Definition (Entity Linking)
Identifying the entities of a reference knowledge base (KB) that are
mentioned in textual documents
Standard pipeline :
Named Entity Recognition (NER)
Candidate Entity Generation
6/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Standard Entity Linking pipeline
Definition (Entity Linking)
Identifying the entities of a reference knowledge base (KB) that are
mentioned in textual documents
Standard pipeline :
Named Entity Recognition (NER)
Candidate Entity Generation
Candidate Entity Ranking
7/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Challenges
Mentions ambiguity :
Names variants : Jobs; Steve Jobs; Steven Paul Jobs
Indirect mentions : the CEO of Apple, Steve...
Less popular entities : Ploulec’h(Brittany)
8/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Limits of Entity Linking Techniques
Disregarding mentions context : entity-by-entity linking
Leveraging mostly Wikipedia : hyperlink graphs
Limited use of RDF KBs : binary indicator
9/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Collective Entity Linking
Definition (Collective Entity Linking)
Identifying all the entities of a reference knowledge base (KB) that
are mentioned in textual documents at once.
n : number of mention
mi : ith mention in the text
ei : a candidate entity for mi
(ˆe1, ..., ˆen)= arg max
e1,...,n
(
n
i=1
φ(mi |ei )
mention/entity similarity
+
n
i=1
n
j=1;j=i
ψ(ei |ej )
collective coherence score
)
CEL = φ() + ψ() + Optimisation
Optimisation = Graph search or Learning to rank
10/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Candidate Entities Generation + Local score
Candidate Entities Generation :
Dictionary : Cross-Wiki
cross-wiki["stevejobs"] => [[’7412236’, 0.99],[’5042765’,0.01]]
Search engine : Wikipedia search
Local score (φ) :
Cosine similarity : cosine(Vmention, Vcandidate entity)
Wikipedia popularity(mention, candidate entity)
pop(m, e) =
n(m, e)
e ∈W
n(m, e )
(1)
n(m, e) = number of time m occurs as anchor of e
11/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Word2vec computes φ()
Learning word representation based on its context.
Two models
Continuous Bag-of-Words (CBOW) : predicting word from
its context
Skip-Gram : predicting for a given word, its context
Example
dataset : the cat sits on the mat.
half-window : 1
CBOW :
([the,sits],cat),([cat,on],sits),([sits,the],on),([on,mat],the)
Skip-Gram :
(cat,[the,sits]),(sits,[cat,on]),(on,[sits,the]),(the,[on,mat])
Reflects semantic proximity
Computes cosine similarity
12/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
WSRM : Weighted Semantic Relatedness Measure
Contribution : WSRM
Generic definition over RDF KBs
Take advantages of RDF KBs semantics
More relations = strong similarity
WSRM(ei , ej ) =
n(ei , ej )
e ∈E
n(ei , e )
(2)
ψ(ei , ej ) =
1
2
(WSRM(ei , ej ) + WSRM(ej , ei )) . (3)
Pre-computed from the RDF KB
Similar to Wikipedia popularity Eq.1
13/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
WSRM computes ψ() : Example
Facts
(Q19837, τ, Person)
(Q19837, name, ”Steve Jobs”)
(Q19837, τ, Employee)
(Q312, τ, Organization)
∗(Q19837, emplBy, Q312)
∗(Q19837, workFor, Q312)
∗(Q19837, workWith, Q483382)
∗(Q483382, workFor, Q312)
Knowledges
(Employee, subC, Person)
(emplBy, d, Employee)
(emplBy, r, Organization)
(emplBy, subP, workFor)
(workFor, r, Organization)
(workFor, d, Employee)
WSRM(Q312, Q19837) = 0
WSRM(Q19837, Q312) = 2/3
WSRM(Q312, Q483382) = 0
WSRM(Q483382, Q312) = 1
ψ(Q312, Q19837) = 1/3
ψ(Q312, Q483382) = 1/2
14/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Local classifications : problem solved
Learning a matching mention/candidate entity (0 or 1)
Training logistic regression with 6 features : cos, pop, sum,
max@1, max@2, max@3
sum(eij ; m1, .., mn) =
n
l=1,k=i e∈C(ml )
ψ(eij , e) , (4)
maxk(eij ; m1, .., mn) =
n
max @k
l=1,j=i
max
e ∈C(ml )
ψ(ei , e) (5)
Global optimization (argmax) ⇔ local classifications
Rank with Posterior probability
15/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Datasets
AIDA [4] : entity annotated corpus of Reuters news
documents
Reuters128 [9] : Economic news articles
RSS500 [9] : RSS feeds including all major worldwide
newspapers
TAC-KBP 2016-2017 datasets [5, 6] : Newswire and
forum-discussion documents
Dataset Nb. docs Nb. mentions Avg nb. mentions/doc
TAC-KBP 2016 eval 169 9231 54.6
TAC-KBP 2017 eval 167 6915 41.4
AIDA-train 846 18519 21.9
AIDA-valid 216 4784 22.1
AIDA-test 231 4479 19.4
Reuters128 128 881 6.9
RSS-500 500 1000 2
Table: Statistics on the used datasets.
16/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Results (1/2) : Features Study
Features F1-score
popularity 72.3
popularity + cosine 72.9
popularity + cosine + sum 73.2
popularity + cosine + max1,2,3 75.7
popularity + cosine + sum + max1,2,3 75.9
Table: Linking accuracy (F1 score) on the TAC KBP-2017 dataset
Collective coherence improves Entity-by-Entity linking
SUM and MAX complementary
Local classification successfully aggregate local and coherence
scores
17/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Results (2/2) : Comparison to State-of-the-Art Approaches
Approach AIDA-A AIDA-B Reuters128 RSS-500
CNN [2] (Entity-by-Entity) - 85.5 - -
End-to-End [7] 89.4 82.4 54.6 42.2
NCEL [1] 79.0 80.0 - -
AGDISTIS [10] 57.5 57.8 68.9 54.2
Babelfy [8] 71.9 75.5 54.8 64.1
AIDA [4] 74.3 76.5 56.6 65.5
PBoH [3] 79.4 80.0 68.3 55.3
WSRM 90.6 87.7 79.9 79.3
Table: Micro-averaged F1 score for different methods on the four
datasets
WSRM stable for different datasets
18/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Conclusion and perspectives
CEL without Wikipedia
Improving CEL with WSRM = Binary indicator
Opening the door to semantic reasoning : paths of size
m, m > 1
18/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Yixin Cao, Lei Hou, Juanzi Li, and Zhiyuan Liu.
Neural collective entity linking.
In Proceedings of the 27th International Conference on
Computational Linguistics, pages 675–686, 2018.
Matthew Francis-Landau, Greg Durrett, and Dan Klein.
Capturing semantic similarity for entity linking with
convolutional neural networks.
In Proceedings of the 15th Annual Conference of the North
American Chapter of the Association for Computational
Linguistics: Human Language Technologies, pages 1256–1261,
2016.
Octavian-Eugen Ganea, Marina Ganea, Aurelien Lucchi,
Carsten Eickhoff, and Thomas Hofmann.
Probabilistic bag-of-hyperlinks model for entity linking.
In Proceedings of the 25th International Conference on World
Wide Web, pages 927–938, 2016.
18/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen
Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva,
Stefan Thater, and Gerhard Weikum.
Robust disambiguation of named entities in text.
In Proceedings of the 2011 Conference on Empirical Methods
in Natural Language Processing, pages 782–792, 2011.
Heng Ji and Nothman.
Overview of tac-kbp2016 tri-lingual edl and its impact on
end-to-end cold-start kbp.
Proceedings of the 2016 Text Analysis Conference, 2016.
Heng Ji, Xiaoman Pan, Boliang Zhang, Joel Nothman, James
Mayfield, Paul McNamee, and Cash Costello.
Overview of tac-kbp2017 13 languages entity discovery and
linking.
In Proceedings of the 2017 Text Analysis Conference, 2017.
Nikolaos Kolitsas, Octavian-Eugen Ganea, and Thomas
Hofmann.
18/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
End-to-end neural entity linking.
In Proceedings of the 22nd Conference on Computational
Natural Language Learning, pages 519–529, 2018.
Andrea Moro, Alessandro Raganato, and Roberto Navigli.
Entity linking meets word sense disambiguation: a unified
approach.
Transactions of the Association for Computational Linguistics,
2:231–244, 2014.
Michael Röder, Ricardo Usbeck, Sebastian Hellmann, Daniel
Gerber, and Andreas Both.
N3- a collection of datasets for named entity recognition and
disambiguation in the nlp interchange format.
In Proceedings of the 9th International Conference on
Language Resources and Evaluation, pages 3529–3533, 2014.
Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Michael Röder,
Daniel Gerber, Sandro Athaide Coelho, Sören Auer, and
Andreas Both.
18/18
Introduction Collective Entity Linking Conclusion and perspectives Bibliographie
Agdistis - graph-based disambiguation of named entities using
linked data.
In Proceedings of the International Semantic Web Conference,
pages 457–471, 2014.

Más contenido relacionado

La actualidad más candente

R-programming-training-in-mumbai
R-programming-training-in-mumbaiR-programming-training-in-mumbai
R-programming-training-in-mumbaiUnmesh Baile
 
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge GraphJoint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge GraphFedorNikolaev
 
多媒體資料庫(New)3rd
多媒體資料庫(New)3rd多媒體資料庫(New)3rd
多媒體資料庫(New)3rdKevingo Tsai
 
Gremlin Queries with DataStax Enterprise Graph
Gremlin Queries with DataStax Enterprise GraphGremlin Queries with DataStax Enterprise Graph
Gremlin Queries with DataStax Enterprise GraphStephen Mallette
 
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...Lukas Galke
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationVladimir Alexiev, PhD, PMP
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsbutest
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlBen Healey
 
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...FedorNikolaev
 
Asynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsAsynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsFabian Pedregosa
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftSebastian Ruder
 
Gremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageGremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageMarko Rodriguez
 
RDataMining slides-r-programming
RDataMining slides-r-programmingRDataMining slides-r-programming
RDataMining slides-r-programmingYanchang Zhao
 
(Hierarchical) topic modeling
(Hierarchical) topic modeling (Hierarchical) topic modeling
(Hierarchical) topic modeling Yueshen Xu
 

La actualidad más candente (19)

R-programming-training-in-mumbai
R-programming-training-in-mumbaiR-programming-training-in-mumbai
R-programming-training-in-mumbai
 
Ir
IrIr
Ir
 
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge GraphJoint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph
Joint Word and Entity Embeddings for Entity Retrieval from Knowledge Graph
 
多媒體資料庫(New)3rd
多媒體資料庫(New)3rd多媒體資料庫(New)3rd
多媒體資料庫(New)3rd
 
Gremlin Queries with DataStax Enterprise Graph
Gremlin Queries with DataStax Enterprise GraphGremlin Queries with DataStax Enterprise Graph
Gremlin Queries with DataStax Enterprise Graph
 
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...
Evaluating the Impact of Word Embeddings on Similarity Scoring in Practical I...
 
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic RepresentationGetty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
 
Incomplete Information in RDF
Incomplete Information in RDFIncomplete Information in RDF
Incomplete Information in RDF
 
Software tookits for machine learning and graphical models
Software tookits for machine learning and graphical modelsSoftware tookits for machine learning and graphical models
Software tookits for machine learning and graphical models
 
Graph mining ppt
Graph mining pptGraph mining ppt
Graph mining ppt
 
Text analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco ControlText analytics in Python and R with examples from Tobacco Control
Text analytics in Python and R with examples from Tobacco Control
 
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of...
 
PAKDD2013
PAKDD2013PAKDD2013
PAKDD2013
 
Asynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and AlgorithmsAsynchronous Stochastic Optimization, New Analysis and Algorithms
Asynchronous Stochastic Optimization, New Analysis and Algorithms
 
A Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF GraphsA Survey of Entity Ranking over RDF Graphs
A Survey of Entity Ranking over RDF Graphs
 
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain ShiftStrong Baselines for Neural Semi-supervised Learning under Domain Shift
Strong Baselines for Neural Semi-supervised Learning under Domain Shift
 
Gremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming LanguageGremlin: A Graph-Based Programming Language
Gremlin: A Graph-Based Programming Language
 
RDataMining slides-r-programming
RDataMining slides-r-programmingRDataMining slides-r-programming
RDataMining slides-r-programming
 
(Hierarchical) topic modeling
(Hierarchical) topic modeling (Hierarchical) topic modeling
(Hierarchical) topic modeling
 

Similar a Collective entity linking with WSRM DocEng'19

Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Searchkrisztianbalog
 
Session 1.5 supporting virtual integration of linked data with just-in-time...
Session 1.5   supporting virtual integration of linked data with just-in-time...Session 1.5   supporting virtual integration of linked data with just-in-time...
Session 1.5 supporting virtual integration of linked data with just-in-time...semanticsconference
 
Reconciling Event-Based Knowledge through RDF2VEC
Reconciling Event-Based Knowledge through RDF2VECReconciling Event-Based Knowledge through RDF2VEC
Reconciling Event-Based Knowledge through RDF2VECMehwish Alam
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataVrije Universiteit Amsterdam
 
Fuzzy formal concept analysis: Approaches, applications and issues
Fuzzy formal concept analysis: Approaches, applications and issuesFuzzy formal concept analysis: Approaches, applications and issues
Fuzzy formal concept analysis: Approaches, applications and issuesCSITiaesprime
 
ForecastCombinations package
ForecastCombinations packageForecastCombinations package
ForecastCombinations packageeraviv
 
Relaxing global-as-view in mediated data integration from linked data
Relaxing global-as-view in mediated data integration from linked dataRelaxing global-as-view in mediated data integration from linked data
Relaxing global-as-view in mediated data integration from linked dataAlessandro Adamou
 
Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)krisztianbalog
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overviewdgarijo
 
Semantic Search and Result Presentation with Entity Cards
Semantic Search and Result Presentation with Entity CardsSemantic Search and Result Presentation with Entity Cards
Semantic Search and Result Presentation with Entity CardsFaegheh Hasibi
 
bridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the webbridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the webFabien Gandon
 
Creating a dataset of peer review in computer science conferences published b...
Creating a dataset of peer review in computer science conferences published b...Creating a dataset of peer review in computer science conferences published b...
Creating a dataset of peer review in computer science conferences published b...Aliaksandr Birukou
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligencevini89
 
Building data fusion surrogate models for spacecraft aerodynamic problems wit...
Building data fusion surrogate models for spacecraft aerodynamic problems wit...Building data fusion surrogate models for spacecraft aerodynamic problems wit...
Building data fusion surrogate models for spacecraft aerodynamic problems wit...Shinwoo Jang
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph MaintenancePaul Groth
 
FScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems ProgrammingFScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems ProgrammingRoberto Casadei
 
Types Working for You, Not Against You
Types Working for You, Not Against YouTypes Working for You, Not Against You
Types Working for You, Not Against YouC4Media
 
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012taxonbytes
 

Similar a Collective entity linking with WSRM DocEng'19 (20)

Evaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented SearchEvaluation Initiatives for Entity-oriented Search
Evaluation Initiatives for Entity-oriented Search
 
Session 1.5 supporting virtual integration of linked data with just-in-time...
Session 1.5   supporting virtual integration of linked data with just-in-time...Session 1.5   supporting virtual integration of linked data with just-in-time...
Session 1.5 supporting virtual integration of linked data with just-in-time...
 
Reconciling Event-Based Knowledge through RDF2VEC
Reconciling Event-Based Knowledge through RDF2VECReconciling Event-Based Knowledge through RDF2VEC
Reconciling Event-Based Knowledge through RDF2VEC
 
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked DataDedalo, looking for Cluster Explanations in a labyrinth of Linked Data
Dedalo, looking for Cluster Explanations in a labyrinth of Linked Data
 
Fuzzy formal concept analysis: Approaches, applications and issues
Fuzzy formal concept analysis: Approaches, applications and issuesFuzzy formal concept analysis: Approaches, applications and issues
Fuzzy formal concept analysis: Approaches, applications and issues
 
Wi presentation
Wi presentationWi presentation
Wi presentation
 
ForecastCombinations package
ForecastCombinations packageForecastCombinations package
ForecastCombinations package
 
Relaxing global-as-view in mediated data integration from linked data
Relaxing global-as-view in mediated data integration from linked dataRelaxing global-as-view in mediated data integration from linked data
Relaxing global-as-view in mediated data integration from linked data
 
Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)Entity Retrieval (WWW 2013 tutorial)
Entity Retrieval (WWW 2013 tutorial)
 
EDBT 2015: Summer School Overview
EDBT 2015: Summer School OverviewEDBT 2015: Summer School Overview
EDBT 2015: Summer School Overview
 
Semantic Search and Result Presentation with Entity Cards
Semantic Search and Result Presentation with Entity CardsSemantic Search and Result Presentation with Entity Cards
Semantic Search and Result Presentation with Entity Cards
 
3DRepo
3DRepo3DRepo
3DRepo
 
bridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the webbridging formal semantics and social semantics on the web
bridging formal semantics and social semantics on the web
 
Creating a dataset of peer review in computer science conferences published b...
Creating a dataset of peer review in computer science conferences published b...Creating a dataset of peer review in computer science conferences published b...
Creating a dataset of peer review in computer science conferences published b...
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Building data fusion surrogate models for spacecraft aerodynamic problems wit...
Building data fusion surrogate models for spacecraft aerodynamic problems wit...Building data fusion surrogate models for spacecraft aerodynamic problems wit...
Building data fusion surrogate models for spacecraft aerodynamic problems wit...
 
Knowledge Graph Maintenance
Knowledge Graph MaintenanceKnowledge Graph Maintenance
Knowledge Graph Maintenance
 
FScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems ProgrammingFScaFi: A Core Calculus for Collective Adaptive Systems Programming
FScaFi: A Core Calculus for Collective Adaptive Systems Programming
 
Types Working for You, Not Against You
Types Working for You, Not Against YouTypes Working for You, Not Against You
Types Working for You, Not Against You
 
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012
Franz et. al. 2012. Reconciling Succeeding Classifications, ESA 2012
 

Último

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and ClassificationsAreesha Ahmad
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 

Último (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 

Collective entity linking with WSRM DocEng'19

  • 1. 1/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Using Knowledge Base Semantics in Context-Aware Entity Linking Cheikh Brahim El Vaigh, François Goasdoué, Guillaume Gravier and Pascale Sébillot DocEng ’19, September 23–26, 2019, Berlin, Germany
  • 2. 2/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Context Exploring large archive of a regional newspaper efficiently iCODA1 : Building an unified graph (RDF KB) with all data sources Providing human friendly visualization for journalists Bridging content and data : Linking content to the RDF Knowledge Base 1 https://project.inria.fr/icoda/
  • 3. 3/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie RDF Knowledge Bases (KB)? Specification of RDF graphs with triples : (s, p, o) ∈ (U ∪ B)xUx(U ∪ L ∪ B) s p −→ o RDF triples for facts and knowledge RDF fact Triple notation Class assertion (s, τ, o) Property assertion (s, p, o) with p ∈ {τ, subC, subP, d, r} RDF knowledge Triple notation Subclass (s, subC, o) Subproperty (s, subP, o) Domain typing (s, d, o) Range typing (s, r, o)
  • 4. 4/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie KB example Facts (Q19837, τ, Person) (Q19837, name, ”Steve Jobs”) ∗(Q19837, emplBy, Q312) Knowledge (Employee, subC, Person) (emplBy, d, Employee) (emplBy, r, Organization) (emplBy, subP, workFor)
  • 5. 5/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Deriving the implicit triples from KB Facts (Q19837, τ, Person) (Q19837, name, ”Steve Jobs”) ∗(Q19837, emplBy, Q312) (Q19837, τ, Employee) (Q312, τ, Organization) ∗(Q19837, workFor, Q312) Knowledge (Employee, subC, Person) (emplBy, d, Employee) (emplBy, r, Organization) (emplBy, subP, workFor) (workFor, r, Organization) (workFor, d, Employee) (workFor, d, Person) (p, d, o), (s1, p, o1) → (s1, τ, o) (p, r, o), (s1, p, o1) → (o1, τ, o) (s, p1, o), (p1, subP, p) → (s, p, o) (s, subC, o), (s1, τ, s) → (s1, τ, o)
  • 6. 6/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Standard Entity Linking pipeline Definition (Entity Linking) Identifying the entities of a reference knowledge base (KB) that are mentioned in textual documents
  • 7. 6/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Standard Entity Linking pipeline Definition (Entity Linking) Identifying the entities of a reference knowledge base (KB) that are mentioned in textual documents Standard pipeline : Named Entity Recognition (NER)
  • 8. 6/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Standard Entity Linking pipeline Definition (Entity Linking) Identifying the entities of a reference knowledge base (KB) that are mentioned in textual documents Standard pipeline : Named Entity Recognition (NER) Candidate Entity Generation
  • 9. 6/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Standard Entity Linking pipeline Definition (Entity Linking) Identifying the entities of a reference knowledge base (KB) that are mentioned in textual documents Standard pipeline : Named Entity Recognition (NER) Candidate Entity Generation Candidate Entity Ranking
  • 10. 7/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Challenges Mentions ambiguity : Names variants : Jobs; Steve Jobs; Steven Paul Jobs Indirect mentions : the CEO of Apple, Steve... Less popular entities : Ploulec’h(Brittany)
  • 11. 8/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Limits of Entity Linking Techniques Disregarding mentions context : entity-by-entity linking Leveraging mostly Wikipedia : hyperlink graphs Limited use of RDF KBs : binary indicator
  • 12. 9/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Collective Entity Linking Definition (Collective Entity Linking) Identifying all the entities of a reference knowledge base (KB) that are mentioned in textual documents at once. n : number of mention mi : ith mention in the text ei : a candidate entity for mi (ˆe1, ..., ˆen)= arg max e1,...,n ( n i=1 φ(mi |ei ) mention/entity similarity + n i=1 n j=1;j=i ψ(ei |ej ) collective coherence score ) CEL = φ() + ψ() + Optimisation Optimisation = Graph search or Learning to rank
  • 13. 10/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Candidate Entities Generation + Local score Candidate Entities Generation : Dictionary : Cross-Wiki cross-wiki["stevejobs"] => [[’7412236’, 0.99],[’5042765’,0.01]] Search engine : Wikipedia search Local score (φ) : Cosine similarity : cosine(Vmention, Vcandidate entity) Wikipedia popularity(mention, candidate entity) pop(m, e) = n(m, e) e ∈W n(m, e ) (1) n(m, e) = number of time m occurs as anchor of e
  • 14. 11/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Word2vec computes φ() Learning word representation based on its context. Two models Continuous Bag-of-Words (CBOW) : predicting word from its context Skip-Gram : predicting for a given word, its context Example dataset : the cat sits on the mat. half-window : 1 CBOW : ([the,sits],cat),([cat,on],sits),([sits,the],on),([on,mat],the) Skip-Gram : (cat,[the,sits]),(sits,[cat,on]),(on,[sits,the]),(the,[on,mat]) Reflects semantic proximity Computes cosine similarity
  • 15. 12/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie WSRM : Weighted Semantic Relatedness Measure Contribution : WSRM Generic definition over RDF KBs Take advantages of RDF KBs semantics More relations = strong similarity WSRM(ei , ej ) = n(ei , ej ) e ∈E n(ei , e ) (2) ψ(ei , ej ) = 1 2 (WSRM(ei , ej ) + WSRM(ej , ei )) . (3) Pre-computed from the RDF KB Similar to Wikipedia popularity Eq.1
  • 16. 13/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie WSRM computes ψ() : Example Facts (Q19837, τ, Person) (Q19837, name, ”Steve Jobs”) (Q19837, τ, Employee) (Q312, τ, Organization) ∗(Q19837, emplBy, Q312) ∗(Q19837, workFor, Q312) ∗(Q19837, workWith, Q483382) ∗(Q483382, workFor, Q312) Knowledges (Employee, subC, Person) (emplBy, d, Employee) (emplBy, r, Organization) (emplBy, subP, workFor) (workFor, r, Organization) (workFor, d, Employee) WSRM(Q312, Q19837) = 0 WSRM(Q19837, Q312) = 2/3 WSRM(Q312, Q483382) = 0 WSRM(Q483382, Q312) = 1 ψ(Q312, Q19837) = 1/3 ψ(Q312, Q483382) = 1/2
  • 17. 14/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Local classifications : problem solved Learning a matching mention/candidate entity (0 or 1) Training logistic regression with 6 features : cos, pop, sum, max@1, max@2, max@3 sum(eij ; m1, .., mn) = n l=1,k=i e∈C(ml ) ψ(eij , e) , (4) maxk(eij ; m1, .., mn) = n max @k l=1,j=i max e ∈C(ml ) ψ(ei , e) (5) Global optimization (argmax) ⇔ local classifications Rank with Posterior probability
  • 18. 15/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Datasets AIDA [4] : entity annotated corpus of Reuters news documents Reuters128 [9] : Economic news articles RSS500 [9] : RSS feeds including all major worldwide newspapers TAC-KBP 2016-2017 datasets [5, 6] : Newswire and forum-discussion documents Dataset Nb. docs Nb. mentions Avg nb. mentions/doc TAC-KBP 2016 eval 169 9231 54.6 TAC-KBP 2017 eval 167 6915 41.4 AIDA-train 846 18519 21.9 AIDA-valid 216 4784 22.1 AIDA-test 231 4479 19.4 Reuters128 128 881 6.9 RSS-500 500 1000 2 Table: Statistics on the used datasets.
  • 19. 16/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Results (1/2) : Features Study Features F1-score popularity 72.3 popularity + cosine 72.9 popularity + cosine + sum 73.2 popularity + cosine + max1,2,3 75.7 popularity + cosine + sum + max1,2,3 75.9 Table: Linking accuracy (F1 score) on the TAC KBP-2017 dataset Collective coherence improves Entity-by-Entity linking SUM and MAX complementary Local classification successfully aggregate local and coherence scores
  • 20. 17/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Results (2/2) : Comparison to State-of-the-Art Approaches Approach AIDA-A AIDA-B Reuters128 RSS-500 CNN [2] (Entity-by-Entity) - 85.5 - - End-to-End [7] 89.4 82.4 54.6 42.2 NCEL [1] 79.0 80.0 - - AGDISTIS [10] 57.5 57.8 68.9 54.2 Babelfy [8] 71.9 75.5 54.8 64.1 AIDA [4] 74.3 76.5 56.6 65.5 PBoH [3] 79.4 80.0 68.3 55.3 WSRM 90.6 87.7 79.9 79.3 Table: Micro-averaged F1 score for different methods on the four datasets WSRM stable for different datasets
  • 21. 18/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Conclusion and perspectives CEL without Wikipedia Improving CEL with WSRM = Binary indicator Opening the door to semantic reasoning : paths of size m, m > 1
  • 22. 18/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Yixin Cao, Lei Hou, Juanzi Li, and Zhiyuan Liu. Neural collective entity linking. In Proceedings of the 27th International Conference on Computational Linguistics, pages 675–686, 2018. Matthew Francis-Landau, Greg Durrett, and Dan Klein. Capturing semantic similarity for entity linking with convolutional neural networks. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1256–1261, 2016. Octavian-Eugen Ganea, Marina Ganea, Aurelien Lucchi, Carsten Eickhoff, and Thomas Hofmann. Probabilistic bag-of-hyperlinks model for entity linking. In Proceedings of the 25th International Conference on World Wide Web, pages 927–938, 2016.
  • 23. 18/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva, Stefan Thater, and Gerhard Weikum. Robust disambiguation of named entities in text. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 782–792, 2011. Heng Ji and Nothman. Overview of tac-kbp2016 tri-lingual edl and its impact on end-to-end cold-start kbp. Proceedings of the 2016 Text Analysis Conference, 2016. Heng Ji, Xiaoman Pan, Boliang Zhang, Joel Nothman, James Mayfield, Paul McNamee, and Cash Costello. Overview of tac-kbp2017 13 languages entity discovery and linking. In Proceedings of the 2017 Text Analysis Conference, 2017. Nikolaos Kolitsas, Octavian-Eugen Ganea, and Thomas Hofmann.
  • 24. 18/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie End-to-end neural entity linking. In Proceedings of the 22nd Conference on Computational Natural Language Learning, pages 519–529, 2018. Andrea Moro, Alessandro Raganato, and Roberto Navigli. Entity linking meets word sense disambiguation: a unified approach. Transactions of the Association for Computational Linguistics, 2:231–244, 2014. Michael Röder, Ricardo Usbeck, Sebastian Hellmann, Daniel Gerber, and Andreas Both. N3- a collection of datasets for named entity recognition and disambiguation in the nlp interchange format. In Proceedings of the 9th International Conference on Language Resources and Evaluation, pages 3529–3533, 2014. Ricardo Usbeck, Axel-Cyrille Ngonga Ngomo, Michael Röder, Daniel Gerber, Sandro Athaide Coelho, Sören Auer, and Andreas Both.
  • 25. 18/18 Introduction Collective Entity Linking Conclusion and perspectives Bibliographie Agdistis - graph-based disambiguation of named entities using linked data. In Proceedings of the International Semantic Web Conference, pages 457–471, 2014.