SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Universität Hamburg, – Name of Presenter: Titel of Talk/Slideset (Version: 27.6.2014) – Slide 1
Using Linked Disambiguated Distributional
Networks for Word Sense Disambiguation
Chris Biemann
biemann@informatik.uni-hamburg.de
Alexander Panchenko
panchenko@informatik.uni-hamburg.de
Stefano Faralli
stefano@informatik.uni-mannheim.de
Simone Paolo Ponzetto
simone@informatik.uni-mannheim.de
Dmitry Ustalov
dmitry.ustalov@urfu.ru
Presented by:
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 2
Contribution
An unsupervised knowledge-based approach to WSD based on
the Hybrid Aligned Resource (HAR) by Faralli et al. (2016):
• Learning sparse distributional sense representations from text;
• Linking them to the language resource (LR);
• Expanding sense representations of the LR.
Combines distributional and knowledge-based sense
representations.
Faralli S., Panchenko A., Biemann C., and Ponzetto S.P. (2016). Linked disambiguated distributional semantic networks.
In International Semantic Web Conference (ISWC’2016), pages 56–64, Kobe, Japan. Springer.
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 3
Contribution
An unsupervised knowledge-based approach to WSD based on
the Hybrid Aligned Resource (HAR) by Faralli et al. (2016):
• Learning sparse distributional sense representations from text;
• Linking them to the language resource (LR);
• Expanding sense representations of the LR.
Combines distributional and knowledge-based sense
representations.
The method requires no linking of texts to a sense inventory and
thus can be applied to large text collections.
Faralli S., Panchenko A., Biemann C., and Ponzetto S.P. (2016). Linked disambiguated distributional semantic networks.
In International Semantic Web Conference (ISWC’2016), pages 56–64, Kobe, Japan. Springer.
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 4
Linked Disambiguated Distributional Networks for WSD
Distributional corpus-derived information
- Hybrid Aligned Resource (HAR) by Faralli et al. (2016)
- Distributional sense representations linked to a lexical resource (WordNet, ...)
- Sample entries of the HAR for the words “mouse” and “keyboard”.
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 5
Linked Disambiguated Distributional Networks for WSD
- Hybrid Aligned Resource (HAR) by Faralli et al. (2016)
- Distributional sense representations linked to a lexical resource (WordNet, ...)
- Sample entries of the HAR for the words “mouse” and “keyboard”.
Information from the knowledge base
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 6
Construction of the Hybrid Aligned Resource (HAR):
1. Building a Distributional Thesaurus (DT).
2. Word Sense Induction.
3. Labeling Word Senses with Hypernyms.
4. Disambiguation of Related Terms and Hypernyms.
5. Retrieval of Context Clues.
Linked Disambiguated Distributional Networks for WSD
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 7
Construction of the Hybrid Aligned Resource (HAR):
1. Building a Distributional Thesaurus (DT).
2. Word Sense Induction.
3. Labeling Word Senses with Hypernyms.
4. Disambiguation of Related Terms and Hypernyms.
5. Retrieval of Context Clues.
HAR Datasets used in our experiment (Faralli et al., 2016):
− news:
• a 100 million sentence news corpus
• average polysemy of 2.3
− wiki:
• a 35 million sentence Wikipedia corpus
• average polysemy of 1.8
Linked Disambiguated Distributional Networks for WSD
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 8
Using the Hybrid Aligned Resource in Word Sense Disambiguation:
− WordNet: this baseline model relies solely on the WordNet:
• Synonyms
• Glosses
• Target synset + synsets directly connected to it
− WordNet + Related: augments the WordNet-based representation
with related terms from the corpus-induced word senses.
− WordNet + Related + Context: all features of the previous model
plus context clues obtained by aggregating features of the sense
cluster words.
Linked Disambiguated Distributional Networks for WSD
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 9
Linked Disambiguated Distributional Networks for WSD
- The third sense of the word “disk” in the WordNet:
- The initial WordNet-based sense representation vs
- The enriched via linking to HAR sense representation
- Enriched with related words from the HAR
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 10
Evaluation: Research Questions
RQ 1:
Does the enriched sense representation improve WSD performance
compared to the original WordNet-based representations?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 11
Evaluation: Research Questions
RQ 1:
Does the linked sense representation improve WSD performance
compared to the original WordNet-based sense representation?
RQ 2:
What is the quality of our approach compared to the SOTA
unsupervised knowledge-based WSD systems?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 12
Evaluation: Dataset and Evaluation Metrics
SemEval-2007 Task 16 “Evaluation of wide-coverage knowledge
resources” (Cuadros and Rigau, 2007):
- specifically designed for evaluating the impact of lexical
resources on the WSD performance
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 13
Evaluation: Dataset and Evaluation Metrics
SemEval-2007 Task 16 “Evaluation of wide-coverage knowledge
resources” (Cuadros and Rigau, 2007):
- specifically designed for evaluating the impact of lexical
resources on the WSD performance
- the task dataset is based on the WordNet-labeled sentences from:
- Senseval-3 (Mihalcea et al., 2004)
- SemEval-2007 Task 17 (Pradhan et al., 2007)
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 14
RQ1: Results
Does the linked sense representation improve WSD performance
compared to the original WordNet-based sense representation?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 15
RQ1: Dataset and Evaluation Metrics
Does the linked sense representation improve WSD performance
compared to the original WordNet-based sense representation?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 16
RQ2: Baselines
What is the quality of our approach compared to the SOTA
unsupervised knowledge-based WSD systems?
The state of the art unsupervised knowledge-based methods:
- WN+XWN (Cuadros and Rigau, 2007)
- WordNet + eXtend WordNet (parsing WordNet glosses)
- KnowNet (Cuadros and Rigau, 2008)
- based on snippets retrieved with a web search engine
- BabelNet (Navigli and Ponzetto, 2012)
- Wikipedia articles + WordNet synsets
- NASARI (Camacho-Collados et al., 2015):
- vector representations of senses based on Wikipedia and WordNet
- lexical or sense-based feature spaces
- The links between WordNet and Wikipedia are retrieved from BabelNet
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 17
RQ2: Results
What is the quality of our approach compared to the SOTA
unsupervised knowledge-based WSD systems?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 18
RQ2: Results
What is the quality of our approach compared to the SOTA
unsupervised knowledge-based WSD systems?
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 19
Conclusions
− We presented a novel approach to knowledge-based WSD:
• Learning sparse distributional sense representations from text;
• Linking them to the language resource (LR);
• Expanding sense representations of the LR.
− Possibility to use large corpora: not limited to Wikipedia-linked
texts as in BabelNet, NASARI.
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 20
Conclusions
− We presented a novel approach to knowledge-based WSD:
• Learning sparse distributional sense representations from text;
• Linking them to the language resource (LR);
• Expanding sense representations of the LR.
− A possibility to use large corpora: the method is not limited to
Wikipedia-linked texts as in BabelNet, NASARI.
− RQ1: Distributional sense representations let us substantially
outperform the model based solely on the lexical resource.
− RQ2: Comparable performance to the state-of-the-art hybrid
approaches leveraging corpus-based features.
Universität Hamburg, – Name of Presenter: Titel of Talk/Slideset (Version: 27.6.2014) – Slide 21
We acknowledge the support of:
Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 22
References
[1] Chris Biemann and Martin Riedl. 2013. Text: Now in 2D! A Framework for Lexical Expansion with
Contextual Similarity. Journal of Language Modelling, 1(1):55–95
[2] Chris Biemann. 2006. Chinese whispers - an efficient graph clustering algorithm and its application to
natural language processing problems. In Proceedings of TextGraphs: the First Workshop on Graph
Based Methods for Natural Language Processing, pages 73–80, New York City. Association for
Computational Linguistics.
[3] Jose Camacho-Collados, Mohammad Taher Pilehvar, ´ and Roberto Navigli. 2015a. Nasari: a novel
approach to a semantically-aware representation of items. In Proceedings of the 2015 Conference of
the North American Chapter of the Association for Computational Linguistics: Human Language
Technologies, pages 567–577, Denver, Colorado. Association for Computational Linguistics.
[4] Montse Cuadros and German Rigau. 2007. Semeval- 2007 task 16: Evaluation of wide coverage
knowledge resources. In Proceedings of the Fourth International Workshop on Semantic Evaluations
(SemEval-2007), pages 81–86, Prague, Czech Republic. Association for Computational Linguistics.
[5] Montse Cuadros and German Rigau. 2008. KnowNet: Building a large net of knowledge from the web.
In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008),
pages 161–168, Manchester, UK, August. Coling 2008 Organizing Committee.
[6] Stefano Faralli, Alexander Panchenko, Chris Biemann, and Simone P. Ponzetto. 2016. Linked
disambiguated distributional semantic networks. In International Semantic Web Conference
(ISWC’2016), pages 56–64, Kobe, Japan. Springer.
[7] Roberto Navigli and Simone Paolo Ponzetto. 2012. Babelnet: The automatic construction, evaluation
and application of a wide-coverage multilingual semantic network. Artificial Intelligence, 193:217– 250.

Más contenido relacionado

La actualidad más candente

Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
Isabelle Augenstein
 
Web services for supporting the interactions of learners in the social web - ...
Web services for supporting the interactions of learners in the social web - ...Web services for supporting the interactions of learners in the social web - ...
Web services for supporting the interactions of learners in the social web - ...
Traian Rebedea
 

La actualidad más candente (16)

Computational Rhetoric for Serbian - Resources and Implementation
Computational Rhetoric for Serbian - Resources and ImplementationComputational Rhetoric for Serbian - Resources and Implementation
Computational Rhetoric for Serbian - Resources and Implementation
 
Learning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyondLearning with limited labelled data in NLP: multi-task learning and beyond
Learning with limited labelled data in NLP: multi-task learning and beyond
 
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate ...
 
Neural Information Retrieval: In search of meaningful progress
Neural Information Retrieval: In search of meaningful progressNeural Information Retrieval: In search of meaningful progress
Neural Information Retrieval: In search of meaningful progress
 
How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...How useful are semantic links for the detection of implicit references in csc...
How useful are semantic links for the detection of implicit references in csc...
 
Interactive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector EmbeddingsInteractive Analysis of Word Vector Embeddings
Interactive Analysis of Word Vector Embeddings
 
ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3ESWC 2014 Tutorial part 3
ESWC 2014 Tutorial part 3
 
A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...
 
Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...
 
Detecting and Describing Historical Periods in a Large Corpora
Detecting and Describing Historical Periods in a Large CorporaDetecting and Describing Historical Periods in a Large Corpora
Detecting and Describing Historical Periods in a Large Corpora
 
Web services for supporting the interactions of learners in the social web - ...
Web services for supporting the interactions of learners in the social web - ...Web services for supporting the interactions of learners in the social web - ...
Web services for supporting the interactions of learners in the social web - ...
 
Towards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web StackTowards a Distributional Semantic Web Stack
Towards a Distributional Semantic Web Stack
 
Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)
Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)
Dynamic Topic Modeling via Non-negative Matrix Factorization (Dr. Derek Greene)
 
Benchmarking for Neural Information Retrieval: MS MARCO, TREC, and Beyond
Benchmarking for Neural Information Retrieval: MS MARCO, TREC, and BeyondBenchmarking for Neural Information Retrieval: MS MARCO, TREC, and Beyond
Benchmarking for Neural Information Retrieval: MS MARCO, TREC, and Beyond
 
Neural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain ShiftNeural Semi-supervised Learning under Domain Shift
Neural Semi-supervised Learning under Domain Shift
 
A Comparison of Supervised Learning Classifiers for Link Discovery
A Comparison of Supervised Learning Classifiers for Link DiscoveryA Comparison of Supervised Learning Classifiers for Link Discovery
A Comparison of Supervised Learning Classifiers for Link Discovery
 

Similar a Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation

A Semantic-enhanced Inference Framework for Heterogeneous Resources Management
A Semantic-enhanced Inference Framework for Heterogeneous Resources ManagementA Semantic-enhanced Inference Framework for Heterogeneous Resources Management
A Semantic-enhanced Inference Framework for Heterogeneous Resources Management
Silvia Giannini
 
Open Data Dialog 2013 - Linked Data in Education
Open Data Dialog 2013 - Linked Data in EducationOpen Data Dialog 2013 - Linked Data in Education
Open Data Dialog 2013 - Linked Data in Education
Stefan Dietze
 
SATANJEEV BANERJEE
SATANJEEV BANERJEESATANJEEV BANERJEE
SATANJEEV BANERJEE
butest
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
Sören Auer
 

Similar a Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation (20)

Text Mining for Lexicography
Text Mining for LexicographyText Mining for Lexicography
Text Mining for Lexicography
 
Embedding for fun fumarola Meetup Milano DLI luglio
Embedding for fun fumarola Meetup Milano DLI luglioEmbedding for fun fumarola Meetup Milano DLI luglio
Embedding for fun fumarola Meetup Milano DLI luglio
 
A Semantic-enhanced Inference Framework for Heterogeneous Resources Management
A Semantic-enhanced Inference Framework for Heterogeneous Resources ManagementA Semantic-enhanced Inference Framework for Heterogeneous Resources Management
A Semantic-enhanced Inference Framework for Heterogeneous Resources Management
 
20140506 edrene athens_winer
20140506 edrene athens_winer20140506 edrene athens_winer
20140506 edrene athens_winer
 
QALD-7 Question Answering over Linked Data Challenge
QALD-7 Question Answering over Linked Data ChallengeQALD-7 Question Answering over Linked Data Challenge
QALD-7 Question Answering over Linked Data Challenge
 
Qald 7 at ESWC2017
Qald 7 at ESWC2017Qald 7 at ESWC2017
Qald 7 at ESWC2017
 
Open Data Dialog 2013 - Linked Data in Education
Open Data Dialog 2013 - Linked Data in EducationOpen Data Dialog 2013 - Linked Data in Education
Open Data Dialog 2013 - Linked Data in Education
 
SATANJEEV BANERJEE
SATANJEEV BANERJEESATANJEEV BANERJEE
SATANJEEV BANERJEE
 
Neural Models for Information Retrieval
Neural Models for Information RetrievalNeural Models for Information Retrieval
Neural Models for Information Retrieval
 
Approach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through SemanticsApproach to leverage Websites to APIs through Semantics
Approach to leverage Websites to APIs through Semantics
 
Dabbagh pbl2016-presentation
Dabbagh pbl2016-presentationDabbagh pbl2016-presentation
Dabbagh pbl2016-presentation
 
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp SchollSemantic Relatedness of Web Resources by XESA - Philipp Scholl
Semantic Relatedness of Web Resources by XESA - Philipp Scholl
 
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
 
2010-04-29-swnj-pcls-presentation
2010-04-29-swnj-pcls-presentation2010-04-29-swnj-pcls-presentation
2010-04-29-swnj-pcls-presentation
 
An Entity-Driven Recursive Neural Network Model for Chinese Discourse Coheren...
An Entity-Driven Recursive Neural Network Model for Chinese Discourse Coheren...An Entity-Driven Recursive Neural Network Model for Chinese Discourse Coheren...
An Entity-Driven Recursive Neural Network Model for Chinese Discourse Coheren...
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect matchLinked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
 
Semantics-aware Content-based Recommender Systems
Semantics-aware Content-based Recommender SystemsSemantics-aware Content-based Recommender Systems
Semantics-aware Content-based Recommender Systems
 
Interpreting Embeddings with Comparison
Interpreting Embeddings with ComparisonInterpreting Embeddings with Comparison
Interpreting Embeddings with Comparison
 
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIESTHE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
THE ABILITY OF WORD EMBEDDINGS TO CAPTURE WORD SIMILARITIES
 

Más de Alexander Panchenko

The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
Alexander Panchenko
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK Data
Alexander Panchenko
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети Фейсбук
Alexander Panchenko
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Alexander Panchenko
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation Extraction
Alexander Panchenko
 

Más de Alexander Panchenko (18)

Graph's not dead: from unsupervised induction of linguistic structures from t...
Graph's not dead: from unsupervised induction of linguistic structures from t...Graph's not dead: from unsupervised induction of linguistic structures from t...
Graph's not dead: from unsupervised induction of linguistic structures from t...
 
Building a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common CrawlBuilding a Web-Scale Dependency-Parsed Corpus from Common Crawl
Building a Web-Scale Dependency-Parsed Corpus from Common Crawl
 
Improving Hypernymy Extraction with Distributional Semantic Classes
Improving Hypernymy Extraction with Distributional Semantic ClassesImproving Hypernymy Extraction with Distributional Semantic Classes
Improving Hypernymy Extraction with Distributional Semantic Classes
 
Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources
Inducing Interpretable Word Senses for WSD and Enrichment of Lexical ResourcesInducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources
Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources
 
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
IIT-UHH at SemEval-2017 Task 3: Exploring Multiple Features for Community Que...
 
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
Fighting with Sparsity of the Synonymy Dictionaries for Automatic Synset Indu...
 
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...The 6th Conference on Analysis of Images, Social Networks, and Texts  (AIST 2...
The 6th Conference on Analysis of Images, Social Networks, and Texts (AIST 2...
 
Getting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part IIGetting started in Apache Spark and Flink (with Scala) - Part II
Getting started in Apache Spark and Flink (with Scala) - Part II
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
 
Text Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK DataText Analysis of Social Networks: Working with FB and VK Data
Text Analysis of Social Networks: Working with FB and VK Data
 
Неологизмы в социальной сети Фейсбук
Неологизмы в социальной сети ФейсбукНеологизмы в социальной сети Фейсбук
Неологизмы в социальной сети Фейсбук
 
Sentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking FacebookSentiment Index of the Russian Speaking Facebook
Sentiment Index of the Russian Speaking Facebook
 
Similarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation ExtractionSimilarity Measures for Semantic Relation Extraction
Similarity Measures for Semantic Relation Extraction
 
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
Dmitry Gubanov. An Approach to the Study of Formal and Informal Relations of ...
 
Detecting Gender by Full Name: Experiments with the Russian Language
Detecting Gender by Full Name:  Experiments with the Russian LanguageDetecting Gender by Full Name:  Experiments with the Russian Language
Detecting Gender by Full Name: Experiments with the Russian Language
 
Document
DocumentDocument
Document
 
Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...Вычислительная лексическая семантика: метрики семантической близости и их при...
Вычислительная лексическая семантика: метрики семантической близости и их при...
 
Semantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation ExtractionSemantic Similarity Measures for Semantic Relation Extraction
Semantic Similarity Measures for Semantic Relation Extraction
 

Último

THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 

Último (20)

300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 

Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation

  • 1. Universität Hamburg, – Name of Presenter: Titel of Talk/Slideset (Version: 27.6.2014) – Slide 1 Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation Chris Biemann biemann@informatik.uni-hamburg.de Alexander Panchenko panchenko@informatik.uni-hamburg.de Stefano Faralli stefano@informatik.uni-mannheim.de Simone Paolo Ponzetto simone@informatik.uni-mannheim.de Dmitry Ustalov dmitry.ustalov@urfu.ru Presented by:
  • 2. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 2 Contribution An unsupervised knowledge-based approach to WSD based on the Hybrid Aligned Resource (HAR) by Faralli et al. (2016): • Learning sparse distributional sense representations from text; • Linking them to the language resource (LR); • Expanding sense representations of the LR. Combines distributional and knowledge-based sense representations. Faralli S., Panchenko A., Biemann C., and Ponzetto S.P. (2016). Linked disambiguated distributional semantic networks. In International Semantic Web Conference (ISWC’2016), pages 56–64, Kobe, Japan. Springer.
  • 3. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 3 Contribution An unsupervised knowledge-based approach to WSD based on the Hybrid Aligned Resource (HAR) by Faralli et al. (2016): • Learning sparse distributional sense representations from text; • Linking them to the language resource (LR); • Expanding sense representations of the LR. Combines distributional and knowledge-based sense representations. The method requires no linking of texts to a sense inventory and thus can be applied to large text collections. Faralli S., Panchenko A., Biemann C., and Ponzetto S.P. (2016). Linked disambiguated distributional semantic networks. In International Semantic Web Conference (ISWC’2016), pages 56–64, Kobe, Japan. Springer.
  • 4. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 4 Linked Disambiguated Distributional Networks for WSD Distributional corpus-derived information - Hybrid Aligned Resource (HAR) by Faralli et al. (2016) - Distributional sense representations linked to a lexical resource (WordNet, ...) - Sample entries of the HAR for the words “mouse” and “keyboard”.
  • 5. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 5 Linked Disambiguated Distributional Networks for WSD - Hybrid Aligned Resource (HAR) by Faralli et al. (2016) - Distributional sense representations linked to a lexical resource (WordNet, ...) - Sample entries of the HAR for the words “mouse” and “keyboard”. Information from the knowledge base
  • 6. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 6 Construction of the Hybrid Aligned Resource (HAR): 1. Building a Distributional Thesaurus (DT). 2. Word Sense Induction. 3. Labeling Word Senses with Hypernyms. 4. Disambiguation of Related Terms and Hypernyms. 5. Retrieval of Context Clues. Linked Disambiguated Distributional Networks for WSD
  • 7. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 7 Construction of the Hybrid Aligned Resource (HAR): 1. Building a Distributional Thesaurus (DT). 2. Word Sense Induction. 3. Labeling Word Senses with Hypernyms. 4. Disambiguation of Related Terms and Hypernyms. 5. Retrieval of Context Clues. HAR Datasets used in our experiment (Faralli et al., 2016): − news: • a 100 million sentence news corpus • average polysemy of 2.3 − wiki: • a 35 million sentence Wikipedia corpus • average polysemy of 1.8 Linked Disambiguated Distributional Networks for WSD
  • 8. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 8 Using the Hybrid Aligned Resource in Word Sense Disambiguation: − WordNet: this baseline model relies solely on the WordNet: • Synonyms • Glosses • Target synset + synsets directly connected to it − WordNet + Related: augments the WordNet-based representation with related terms from the corpus-induced word senses. − WordNet + Related + Context: all features of the previous model plus context clues obtained by aggregating features of the sense cluster words. Linked Disambiguated Distributional Networks for WSD
  • 9. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 9 Linked Disambiguated Distributional Networks for WSD - The third sense of the word “disk” in the WordNet: - The initial WordNet-based sense representation vs - The enriched via linking to HAR sense representation - Enriched with related words from the HAR
  • 10. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 10 Evaluation: Research Questions RQ 1: Does the enriched sense representation improve WSD performance compared to the original WordNet-based representations?
  • 11. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 11 Evaluation: Research Questions RQ 1: Does the linked sense representation improve WSD performance compared to the original WordNet-based sense representation? RQ 2: What is the quality of our approach compared to the SOTA unsupervised knowledge-based WSD systems?
  • 12. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 12 Evaluation: Dataset and Evaluation Metrics SemEval-2007 Task 16 “Evaluation of wide-coverage knowledge resources” (Cuadros and Rigau, 2007): - specifically designed for evaluating the impact of lexical resources on the WSD performance
  • 13. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 13 Evaluation: Dataset and Evaluation Metrics SemEval-2007 Task 16 “Evaluation of wide-coverage knowledge resources” (Cuadros and Rigau, 2007): - specifically designed for evaluating the impact of lexical resources on the WSD performance - the task dataset is based on the WordNet-labeled sentences from: - Senseval-3 (Mihalcea et al., 2004) - SemEval-2007 Task 17 (Pradhan et al., 2007)
  • 14. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 14 RQ1: Results Does the linked sense representation improve WSD performance compared to the original WordNet-based sense representation?
  • 15. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 15 RQ1: Dataset and Evaluation Metrics Does the linked sense representation improve WSD performance compared to the original WordNet-based sense representation?
  • 16. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 16 RQ2: Baselines What is the quality of our approach compared to the SOTA unsupervised knowledge-based WSD systems? The state of the art unsupervised knowledge-based methods: - WN+XWN (Cuadros and Rigau, 2007) - WordNet + eXtend WordNet (parsing WordNet glosses) - KnowNet (Cuadros and Rigau, 2008) - based on snippets retrieved with a web search engine - BabelNet (Navigli and Ponzetto, 2012) - Wikipedia articles + WordNet synsets - NASARI (Camacho-Collados et al., 2015): - vector representations of senses based on Wikipedia and WordNet - lexical or sense-based feature spaces - The links between WordNet and Wikipedia are retrieved from BabelNet
  • 17. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 17 RQ2: Results What is the quality of our approach compared to the SOTA unsupervised knowledge-based WSD systems?
  • 18. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 18 RQ2: Results What is the quality of our approach compared to the SOTA unsupervised knowledge-based WSD systems?
  • 19. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 19 Conclusions − We presented a novel approach to knowledge-based WSD: • Learning sparse distributional sense representations from text; • Linking them to the language resource (LR); • Expanding sense representations of the LR. − Possibility to use large corpora: not limited to Wikipedia-linked texts as in BabelNet, NASARI.
  • 20. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 20 Conclusions − We presented a novel approach to knowledge-based WSD: • Learning sparse distributional sense representations from text; • Linking them to the language resource (LR); • Expanding sense representations of the LR. − A possibility to use large corpora: the method is not limited to Wikipedia-linked texts as in BabelNet, NASARI. − RQ1: Distributional sense representations let us substantially outperform the model based solely on the lexical resource. − RQ2: Comparable performance to the state-of-the-art hybrid approaches leveraging corpus-based features.
  • 21. Universität Hamburg, – Name of Presenter: Titel of Talk/Slideset (Version: 27.6.2014) – Slide 21 We acknowledge the support of:
  • 22. Universität Hamburg – Panchenko et al.: Using Linked Disambiguated Distributional Networks for WSD (04.04.2017) – Slide 22 References [1] Chris Biemann and Martin Riedl. 2013. Text: Now in 2D! A Framework for Lexical Expansion with Contextual Similarity. Journal of Language Modelling, 1(1):55–95 [2] Chris Biemann. 2006. Chinese whispers - an efficient graph clustering algorithm and its application to natural language processing problems. In Proceedings of TextGraphs: the First Workshop on Graph Based Methods for Natural Language Processing, pages 73–80, New York City. Association for Computational Linguistics. [3] Jose Camacho-Collados, Mohammad Taher Pilehvar, ´ and Roberto Navigli. 2015a. Nasari: a novel approach to a semantically-aware representation of items. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 567–577, Denver, Colorado. Association for Computational Linguistics. [4] Montse Cuadros and German Rigau. 2007. Semeval- 2007 task 16: Evaluation of wide coverage knowledge resources. In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pages 81–86, Prague, Czech Republic. Association for Computational Linguistics. [5] Montse Cuadros and German Rigau. 2008. KnowNet: Building a large net of knowledge from the web. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 161–168, Manchester, UK, August. Coling 2008 Organizing Committee. [6] Stefano Faralli, Alexander Panchenko, Chris Biemann, and Simone P. Ponzetto. 2016. Linked disambiguated distributional semantic networks. In International Semantic Web Conference (ISWC’2016), pages 56–64, Kobe, Japan. Springer. [7] Roberto Navigli and Simone Paolo Ponzetto. 2012. Babelnet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artificial Intelligence, 193:217– 250.