SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
Perceptually Grounded Selectional Preferences
Katia Shutova es407@cam.ac.uk
https://www.cl.cam.ac.uk/~es407/
Niket Tandon ntandon@mpi-inf.mpg.de
https://www.mpi-inf.mpg.de/~ntandon/
Gerard de Melo gdm@demelo.org
http://gerard.demelo.org
Contact
1. Philip Resnik (1993). Selection and information: A class-based approach to lexical relationships. Technical report, Univ. of Pennsylvania.
2. Frank Keller & Mirella Lapata (2003). Using the Web to obtain frequencies for unseen bigrams. Comp. Ling. 29(3):459–484.
3. Mats Rooth, Stefan Riezler, Detlef Prescher, Glenn Carroll, Franz Beil (1999). Inducing a semantically annotated lexicon via
EM-based clustering. Proc. ACL 1999.
4. Sebastian Pado, Ulrike Pado, Katrin Erk (2007). Flexible, corpus-based modelling of human plausibility judgements.
Proc. EMNLP-CoNLL 2007.
5. Diarmuid O ́Seaghdha (2010). Latent variable models of selectional preference. Proc. ACL 2010.
6. Ekaterina Shutova (2010) . Automatic metaphor interpretation as a paraphrasing task. Proc. NAACL 2010.
References
Selectional Preferences are semantic constraints of a predicate
on its arguments
The authors wrote a new paper. ✔ high plausibility
The paper wrote a new author. ✘ Very low plausibility
The cat is eating your sausage. ✔ high plausibility
The carrot is eating your keys. ✘ Very low plausibility
Knowledge of selectional preferences is useful in many NLP tasks:
●
Word Sense Disambiguation
●
Parsing (resolving attachments)
●
Semantic Role Labelling
●
Natural Language Inference
●
Detecting multi-word expressions
●
Etc.
What are Selectional Preferences?
Previous work uses purely text-based methods:
●
Problem of topic bias / figurative uses of words: E.g. “cut” mainly occurs with
“cost” and “price” as arguments in the BNC.
●
→ Skew towards abstract uses, different from our daily life experience of cutting
Our Approach: Use Multimodal Data
●
BNC for text (parsed using RASP parser)
●
100 million Flickr images/videos from Yahoo! Webscope Flickr-100M dataset
Challenge: From a set of Flickr Tags to noun–verb pairs
Collecting Multimodal Correlations
Step 1: Acquisition of Argument Classes
Observed data is sparse → Need to generalize
Spectral Clustering of nouns using Jensen-Shannon divergence as sim. measure
Step 2: Quantifying Selectional Preferences
Selectional Preference Model
Shutova (2010) approach: metaphor interpretation as paraphrasing
“a carelessly leaked report” → “a carelessly disclosed report”
1) Take maximum likelihood candidate verbs
2) Filter by semantic similarity to target verb
3) Filter for a strong selectional preference fit (assuming it indicates literalness or
conventionality) so as to remove figurative uses
Application to Metaphor Interpretation
Multimodal selectional preferences outperform
●
purely linguistic and visual models, and
●
previous state-of-the-art models
Conclusions
Method
Seen
Dataset
Unseen
Dataset
Rooth et al. (1999) EM 0.487 0.520
Pado et al. (2007)
VSM
0.490 0.430
O'Seaghda (2010) LDA 0.548 0.605
Visual Model 0.126 0.132
Linguistic Model 0.688 0.559
Interpolated Model 0.728 0.430
Direct Evaluation
mother
sitting
baby
lap
rachel lind
wristwatch
pajamas
Clothes
etc.
Ekaterina Shutova Niket Tandon Gerard de Melo
University of Cambridge Max Planck Institute
for Informatics
Tsinghua University
Shutova (2010) LSP ISP
Mean Avg. Prec. (MAP) on
Shutova (2010) gold data 0.62 0.62 0.65
Results on Keller & Lapata (2003)
Datasets (Spearman Rho)
Visual Features: verb lemmas
co-occurring with nouns
Linguistic Features:
grammatical relations
Approach
1) Stemming
2) Filtering:
Remove rare words
and named entities
3) POS tagging:
by jointly disambiguating
tags to WordNet synsets
so as to maximize
coherence
WordNet
priors
similarities
https://www.flickr.com/photos/seandreilinger/465827703/
canon
rebel
400D
ball
portfolio
yellow
serve
website
racket
roland
garros
etc.
https://www.flickr.com/photos/pysanchis/2521372121/

Más contenido relacionado

Más de Gerard de Melo

Information Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram DataInformation Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram DataGerard de Melo
 
UWN: A Large Multilingual Lexical Knowledge Base
UWN: A Large Multilingual Lexical Knowledge BaseUWN: A Large Multilingual Lexical Knowledge Base
UWN: A Large Multilingual Lexical Knowledge BaseGerard de Melo
 
Multilingual Text Classification using Ontologies
Multilingual Text Classification using OntologiesMultilingual Text Classification using Ontologies
Multilingual Text Classification using OntologiesGerard de Melo
 
Extracting Sense-Disambiguated Example Sentences From Parallel Corpora
Extracting Sense-Disambiguated Example Sentences From Parallel CorporaExtracting Sense-Disambiguated Example Sentences From Parallel Corpora
Extracting Sense-Disambiguated Example Sentences From Parallel CorporaGerard de Melo
 
Towards a Universal Wordnet by Learning from Combined Evidence
Towards a Universal Wordnet by Learning from Combined EvidenceTowards a Universal Wordnet by Learning from Combined Evidence
Towards a Universal Wordnet by Learning from Combined EvidenceGerard de Melo
 
Not Quite the Same: Identity Constraints for the Web of Linked Data
Not Quite the Same: Identity Constraints for the Web of Linked DataNot Quite the Same: Identity Constraints for the Web of Linked Data
Not Quite the Same: Identity Constraints for the Web of Linked DataGerard de Melo
 
Good, Great, Excellent: Global Inference of Semantic Intensities
Good, Great, Excellent: Global Inference of Semantic IntensitiesGood, Great, Excellent: Global Inference of Semantic Intensities
Good, Great, Excellent: Global Inference of Semantic IntensitiesGerard de Melo
 
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyYAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyGerard de Melo
 

Más de Gerard de Melo (8)

Information Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram DataInformation Extraction from Web-Scale N-Gram Data
Information Extraction from Web-Scale N-Gram Data
 
UWN: A Large Multilingual Lexical Knowledge Base
UWN: A Large Multilingual Lexical Knowledge BaseUWN: A Large Multilingual Lexical Knowledge Base
UWN: A Large Multilingual Lexical Knowledge Base
 
Multilingual Text Classification using Ontologies
Multilingual Text Classification using OntologiesMultilingual Text Classification using Ontologies
Multilingual Text Classification using Ontologies
 
Extracting Sense-Disambiguated Example Sentences From Parallel Corpora
Extracting Sense-Disambiguated Example Sentences From Parallel CorporaExtracting Sense-Disambiguated Example Sentences From Parallel Corpora
Extracting Sense-Disambiguated Example Sentences From Parallel Corpora
 
Towards a Universal Wordnet by Learning from Combined Evidence
Towards a Universal Wordnet by Learning from Combined EvidenceTowards a Universal Wordnet by Learning from Combined Evidence
Towards a Universal Wordnet by Learning from Combined Evidence
 
Not Quite the Same: Identity Constraints for the Web of Linked Data
Not Quite the Same: Identity Constraints for the Web of Linked DataNot Quite the Same: Identity Constraints for the Web of Linked Data
Not Quite the Same: Identity Constraints for the Web of Linked Data
 
Good, Great, Excellent: Global Inference of Semantic Intensities
Good, Great, Excellent: Global Inference of Semantic IntensitiesGood, Great, Excellent: Global Inference of Semantic Intensities
Good, Great, Excellent: Global Inference of Semantic Intensities
 
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged OntologyYAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
YAGO-SUMO: Integrating YAGO into the Suggested Upper Merged Ontology
 

Último

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 

Último (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 

Perceptually Grounded Selectional Preferences – Using Flickr Image and Video tags for Natural Language Semantics

  • 1. Perceptually Grounded Selectional Preferences Katia Shutova es407@cam.ac.uk https://www.cl.cam.ac.uk/~es407/ Niket Tandon ntandon@mpi-inf.mpg.de https://www.mpi-inf.mpg.de/~ntandon/ Gerard de Melo gdm@demelo.org http://gerard.demelo.org Contact 1. Philip Resnik (1993). Selection and information: A class-based approach to lexical relationships. Technical report, Univ. of Pennsylvania. 2. Frank Keller & Mirella Lapata (2003). Using the Web to obtain frequencies for unseen bigrams. Comp. Ling. 29(3):459–484. 3. Mats Rooth, Stefan Riezler, Detlef Prescher, Glenn Carroll, Franz Beil (1999). Inducing a semantically annotated lexicon via EM-based clustering. Proc. ACL 1999. 4. Sebastian Pado, Ulrike Pado, Katrin Erk (2007). Flexible, corpus-based modelling of human plausibility judgements. Proc. EMNLP-CoNLL 2007. 5. Diarmuid O ́Seaghdha (2010). Latent variable models of selectional preference. Proc. ACL 2010. 6. Ekaterina Shutova (2010) . Automatic metaphor interpretation as a paraphrasing task. Proc. NAACL 2010. References Selectional Preferences are semantic constraints of a predicate on its arguments The authors wrote a new paper. ✔ high plausibility The paper wrote a new author. ✘ Very low plausibility The cat is eating your sausage. ✔ high plausibility The carrot is eating your keys. ✘ Very low plausibility Knowledge of selectional preferences is useful in many NLP tasks: ● Word Sense Disambiguation ● Parsing (resolving attachments) ● Semantic Role Labelling ● Natural Language Inference ● Detecting multi-word expressions ● Etc. What are Selectional Preferences? Previous work uses purely text-based methods: ● Problem of topic bias / figurative uses of words: E.g. “cut” mainly occurs with “cost” and “price” as arguments in the BNC. ● → Skew towards abstract uses, different from our daily life experience of cutting Our Approach: Use Multimodal Data ● BNC for text (parsed using RASP parser) ● 100 million Flickr images/videos from Yahoo! Webscope Flickr-100M dataset Challenge: From a set of Flickr Tags to noun–verb pairs Collecting Multimodal Correlations Step 1: Acquisition of Argument Classes Observed data is sparse → Need to generalize Spectral Clustering of nouns using Jensen-Shannon divergence as sim. measure Step 2: Quantifying Selectional Preferences Selectional Preference Model Shutova (2010) approach: metaphor interpretation as paraphrasing “a carelessly leaked report” → “a carelessly disclosed report” 1) Take maximum likelihood candidate verbs 2) Filter by semantic similarity to target verb 3) Filter for a strong selectional preference fit (assuming it indicates literalness or conventionality) so as to remove figurative uses Application to Metaphor Interpretation Multimodal selectional preferences outperform ● purely linguistic and visual models, and ● previous state-of-the-art models Conclusions Method Seen Dataset Unseen Dataset Rooth et al. (1999) EM 0.487 0.520 Pado et al. (2007) VSM 0.490 0.430 O'Seaghda (2010) LDA 0.548 0.605 Visual Model 0.126 0.132 Linguistic Model 0.688 0.559 Interpolated Model 0.728 0.430 Direct Evaluation mother sitting baby lap rachel lind wristwatch pajamas Clothes etc. Ekaterina Shutova Niket Tandon Gerard de Melo University of Cambridge Max Planck Institute for Informatics Tsinghua University Shutova (2010) LSP ISP Mean Avg. Prec. (MAP) on Shutova (2010) gold data 0.62 0.62 0.65 Results on Keller & Lapata (2003) Datasets (Spearman Rho) Visual Features: verb lemmas co-occurring with nouns Linguistic Features: grammatical relations Approach 1) Stemming 2) Filtering: Remove rare words and named entities 3) POS tagging: by jointly disambiguating tags to WordNet synsets so as to maximize coherence WordNet priors similarities https://www.flickr.com/photos/seandreilinger/465827703/ canon rebel 400D ball portfolio yellow serve website racket roland garros etc. https://www.flickr.com/photos/pysanchis/2521372121/