SlideShare una empresa de Scribd logo
1 de 31
Descargar para leer sin conexión
ACMMM2013 reading
@ Kanto CV 2014.2.23
Akisato Kimura (@_akisato)
NTT Communication Science Labs
Paper to read
Sentiment analysis of images
Basic strategy
• Adjective noun pairs (ANPs)
– Adjectives play a significant role in conveying
sentiments, but visually inconsistent.
– Combined phrases make the concepts more
detectable than single adj. & n.
• cf. Recognition using visual phrases
[CVPR11]
Contributions
• Automatically construct a large-scale Visual
Sentiment Ontology (VSO) with 3000 ANPs
– With the help of psychological theories and web
mining techniques

• Propose SentiBank: a visual concept detector
library to detect the presence of 1200 ANPs
– Useful for sentiment analysis of visual contents as
attributes
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
24 basic words for emotions
• Founded on Plutchik’s Wheel on Emotions
1
1
2
3
4
1
2
3
4
1
2
3
4

1
2
3
4

4
4
3

1

2
2
3
3

4
3
1
2
4
2
3
1
2
1
4
1
2
3 http://en.wikipedia.org/wiki/Plutchik%27s_Wheel_
4 of_Emotions#Plutchik.27s_wheel_of_emotions
24 basic words for emotions (cont.)
• 8 basic emotions
x 3 degrees

1
4

2

3
1
2
3
4
1
2
3
4

3

2

4
1
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
Sentiment word discovery
• Web mining strategy
– Retrieve images & videos from Flickr & YouTube
with each of 24 basic words as a query
– Extract their associated tags by Lookapp tool
[Borth+ ICMR11]
Sentiment word discovery (cont.)
• Exploits various NLP techniques & resources
– Post-processings
• Remove stop words, perform stemming
• Top 100 tags are selected for each emotion

– Sentiment value computation (-1 neg  +1 pos)
• SentiWordNet [Esuli+ 2006] SentiStrength [Thelwall+ 2010]
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
ANP construction
• Take all the pairs of (ADJ, N)s into consideration
– Remove named entities with meaning changed
(e.g. “hot” + “dog”  generic named entity)

• Fuse sentiment values
– Simple sum-up model : s(ANP) = s(ADJ) + s(N)
• If sgn(s(ADJ)) != sgn(s(N)), then s(ANP) = S(ADJ).

• Rank ANPs by their frequency
– Remove all ANPs with no images
– Resulting in 47K ANP candidates
ANP construction (cont.)
• Ontology sampling
– Partition candidates into individual ADJ sets
– Sample a subset from each ADJ set
– Take ANPs with sufficient (>125) images

• Linking back to emotions
– For each ANP, count images with 24 basic words & the ANP
in their meta, create a 24-dim histogram
How reliable ANP labels are?
• Web annotation may not be reliable
– Using Flickr tags as pseudo ANP labels might incur
false positive

• Manual (=AMT) validation
– Randomly sample images of 200 ANPs
– Each image is validated by 3 Turkers, treated as
correct only if >= 2 Turkers agree
– Results: 97% correct
http://visual-sentiment-ontology.appspot.com
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
Training ANP detectors
• Various visual features
– Color histogram (3 colors x 256 dim), GIST (512 dim),
LBP (53 dim), BoW with spatial pyramid and max
pooling (1000 dim x 2 layers), attributes [Yu+ CVPR13]
(2000 dim)

• Training a linear SVM for every ANP
– Parameter tuning by cross validation (AP@20-based)
– Measure performance by AP@20, AUC & F-score.

• Several feature fusions
– Early fusion, late fusion, weighted early/late fusion
Detector performance
• Comparing visual features (left)
– 1st: attributes, 2nd: BoWs

• Comparing feature fusions (right)
– 1st: Weighted late fusion, but not dominant
– Adopt early fusion for implementation simplicity
Examples
Detectability issues
• Select only ANPs with good detection accuracy
– 1200 ANPs with AP@20>0 & F-score>0.6

• No correlation bwt detectability & occurrence
– Difficulty in detecting ANPs depends on the
content diversity and the abstract level
Other issues
• Special visual features improve detectors
– ObjectBank [Li+ NIPS2010], facial features, aesthetic
features [Bhattacharya+ ACMMM13]

• Ontology structure
– Interactive process to combine 1200 ANPs into distinct
groups  6 levels, 15 nodes at the top
• N: standard “is-a” relations
• ADJ: exclusive (“sad” vs “happy”) & strength (“nice”, “great”,
“awesome”)

– 41% nouns uncovered by ImageNet
• Related to abstract concepts (e.g. “violence”, “religion”)
Framework
1. Select 24 fundamental words representing emotion
2. Retrieve images with every of the words as a query
3. Tags associated with the images are extracted to
build ANPs ( = strong sentiment ADJs + all Ns)
4. Train ANP detectors and keep only detectors with
reasonable performance to form SentiBank
SentiBank applications
• Sentiment prediction in image tweets
– Sentiment analysis rely on text-based tools
– 140 characters (in ENG) are too short
– Use SentiBank to complement and augment texts

• Emotion classification
– Demonstrate the performance against an emotion
dataset of art photos [Machajdik+ ACMMM10]
Sentiment prediction in tweets
• Data collection
– Gather tweets with images & popular hashtags
• #nuclearpower, #election, #championsleague, #cairo …

– AMT to obtain sentiment ground-truth
• 3 Turkers for every tweets: almost agreed (below)

http://www.ee.columbia.ed
u/ln/dvmm/vso/download/t
witter_dataset.html
Sentiment prediction in tweets (cont.)
• Visual-based classifier
– Serve SentiBank as a mid-level representation
• Use ANP responses as an input feature
• Employ a linear classifier for the final output

– Compare SentiBank with low-level features
Sentiment prediction in tweets (cont.)
• Text-based classifier
– Naïve Bayes + SentiStrength

• Overall performance
Sentiment prediction in tweets (cont.)
• Detailed performance
Emotion classification
• Dataset
– 807 art photos, 8 emotion categories retrieved
from DeviantArt.com
Takeaway messages
• To appear in Tomorrow’s meeting

Más contenido relacionado

Similar a ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using adjective noun pairs

Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on TwitterSubarno Pal
 
A Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorA Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorIOSR Journals
 
Elderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detectionElderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detectionTanvi Mittal
 
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptx
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptxTraffic_Sign_Recognition_Using_CNN_-_PPT.pptx
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptxpritisharma1970
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Jonathon Hare
 
Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.Takrim Ul Islam Laskar
 
Predicting Emotions through Facial Expressions
Predicting Emotions through Facial Expressions  Predicting Emotions through Facial Expressions
Predicting Emotions through Facial Expressions twinkle singh
 
B8_Mini project_Final review ppt.pptx
B8_Mini project_Final review ppt.pptxB8_Mini project_Final review ppt.pptx
B8_Mini project_Final review ppt.pptxEgguIqbal
 
REVIEW PPT.pptx
REVIEW PPT.pptxREVIEW PPT.pptx
REVIEW PPT.pptxSaravanaD2
 
Emotionally Intelligent Music Player.pptx
Emotionally Intelligent Music Player.pptxEmotionally Intelligent Music Player.pptx
Emotionally Intelligent Music Player.pptxxelihom699
 
EMOTION RECOGNITION SYSTEMS: A REVIEW
EMOTION RECOGNITION SYSTEMS: A REVIEWEMOTION RECOGNITION SYSTEMS: A REVIEW
EMOTION RECOGNITION SYSTEMS: A REVIEWIRJET Journal
 
Facial emotion recognition
Facial emotion recognitionFacial emotion recognition
Facial emotion recognitionRahin Patel
 
SPEECH BASED EMOTION RECOGNITION USING VOICE
SPEECH BASED  EMOTION RECOGNITION USING VOICESPEECH BASED  EMOTION RECOGNITION USING VOICE
SPEECH BASED EMOTION RECOGNITION USING VOICEVamshidharSingh
 

Similar a ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using adjective noun pairs (20)

Sentiment Analysis on Twitter
Sentiment Analysis on TwitterSentiment Analysis on Twitter
Sentiment Analysis on Twitter
 
A Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorA Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection Supervisor
 
Elderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detectionElderly Assistance- Deep Learning Theme detection
Elderly Assistance- Deep Learning Theme detection
 
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptx
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptxTraffic_Sign_Recognition_Using_CNN_-_PPT.pptx
Traffic_Sign_Recognition_Using_CNN_-_PPT.pptx
 
InternshipReport
InternshipReportInternshipReport
InternshipReport
 
Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...Saliency-based Models of Image Content and their Application to Auto-Annotati...
Saliency-based Models of Image Content and their Application to Auto-Annotati...
 
Face recognition
Face recognition Face recognition
Face recognition
 
Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.Facial emotion detection on babies' emotional face using Deep Learning.
Facial emotion detection on babies' emotional face using Deep Learning.
 
Final year ppt
Final year pptFinal year ppt
Final year ppt
 
AINL 2016: Moskvichev
AINL 2016: MoskvichevAINL 2016: Moskvichev
AINL 2016: Moskvichev
 
Predicting Emotions through Facial Expressions
Predicting Emotions through Facial Expressions  Predicting Emotions through Facial Expressions
Predicting Emotions through Facial Expressions
 
B8_Mini project_Final review ppt.pptx
B8_Mini project_Final review ppt.pptxB8_Mini project_Final review ppt.pptx
B8_Mini project_Final review ppt.pptx
 
REVIEW PPT.pptx
REVIEW PPT.pptxREVIEW PPT.pptx
REVIEW PPT.pptx
 
Emotionally Intelligent Music Player.pptx
Emotionally Intelligent Music Player.pptxEmotionally Intelligent Music Player.pptx
Emotionally Intelligent Music Player.pptx
 
Ai use cases
Ai use casesAi use cases
Ai use cases
 
inam
inaminam
inam
 
EMOTION RECOGNITION SYSTEMS: A REVIEW
EMOTION RECOGNITION SYSTEMS: A REVIEWEMOTION RECOGNITION SYSTEMS: A REVIEW
EMOTION RECOGNITION SYSTEMS: A REVIEW
 
Facial emotion recognition
Facial emotion recognitionFacial emotion recognition
Facial emotion recognition
 
Emotion intelligence
Emotion intelligenceEmotion intelligence
Emotion intelligence
 
SPEECH BASED EMOTION RECOGNITION USING VOICE
SPEECH BASED  EMOTION RECOGNITION USING VOICESPEECH BASED  EMOTION RECOGNITION USING VOICE
SPEECH BASED EMOTION RECOGNITION USING VOICE
 

Más de Akisato Kimura

Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Akisato Kimura
 
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Akisato Kimura
 
多変量解析の一般化
多変量解析の一般化多変量解析の一般化
多変量解析の一般化Akisato Kimura
 
CVPR2016 reading - 特徴量学習とクロスモーダル転移について
CVPR2016 reading - 特徴量学習とクロスモーダル転移についてCVPR2016 reading - 特徴量学習とクロスモーダル転移について
CVPR2016 reading - 特徴量学習とクロスモーダル転移についてAkisato Kimura
 
NIPS2015 reading - Learning visual biases from human imagination
NIPS2015 reading - Learning visual biases from human imaginationNIPS2015 reading - Learning visual biases from human imagination
NIPS2015 reading - Learning visual biases from human imaginationAkisato Kimura
 
CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"Akisato Kimura
 
CVPR2015 reading "Understainding image virality" (in Japanese)
CVPR2015 reading "Understainding image virality" (in Japanese)CVPR2015 reading "Understainding image virality" (in Japanese)
CVPR2015 reading "Understainding image virality" (in Japanese)Akisato Kimura
 
Computational models of human visual attention driven by auditory cues
Computational models of human visual attention driven by auditory cuesComputational models of human visual attention driven by auditory cues
Computational models of human visual attention driven by auditory cuesAkisato Kimura
 
NIPS2014 reading - Top rank optimization in linear time
NIPS2014 reading - Top rank optimization in linear timeNIPS2014 reading - Top rank optimization in linear time
NIPS2014 reading - Top rank optimization in linear timeAkisato Kimura
 
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...Akisato Kimura
 
ICCV2013 reading: Learning to rank using privileged information
ICCV2013 reading: Learning to rank using privileged informationICCV2013 reading: Learning to rank using privileged information
ICCV2013 reading: Learning to rank using privileged informationAkisato Kimura
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsAkisato Kimura
 
関西CVPR勉強会 2012.10.28
関西CVPR勉強会 2012.10.28関西CVPR勉強会 2012.10.28
関西CVPR勉強会 2012.10.28Akisato Kimura
 
関西CVPR勉強会 2012.7.29
関西CVPR勉強会 2012.7.29関西CVPR勉強会 2012.7.29
関西CVPR勉強会 2012.7.29Akisato Kimura
 
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)Akisato Kimura
 
関西CVPRML勉強会(特定物体認識) 2012.1.14
関西CVPRML勉強会(特定物体認識) 2012.1.14関西CVPRML勉強会(特定物体認識) 2012.1.14
関西CVPRML勉強会(特定物体認識) 2012.1.14Akisato Kimura
 
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -Akisato Kimura
 
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明Akisato Kimura
 
立命館大学 AMLコロキウム 2011.10.20
立命館大学 AMLコロキウム 2011.10.20立命館大学 AMLコロキウム 2011.10.20
立命館大学 AMLコロキウム 2011.10.20Akisato Kimura
 

Más de Akisato Kimura (20)

Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
 
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
 
多変量解析の一般化
多変量解析の一般化多変量解析の一般化
多変量解析の一般化
 
CVPR2016 reading - 特徴量学習とクロスモーダル転移について
CVPR2016 reading - 特徴量学習とクロスモーダル転移についてCVPR2016 reading - 特徴量学習とクロスモーダル転移について
CVPR2016 reading - 特徴量学習とクロスモーダル転移について
 
NIPS2015 reading - Learning visual biases from human imagination
NIPS2015 reading - Learning visual biases from human imaginationNIPS2015 reading - Learning visual biases from human imagination
NIPS2015 reading - Learning visual biases from human imagination
 
CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Global refinement of random forest"
 
CVPR2015 reading "Understainding image virality" (in Japanese)
CVPR2015 reading "Understainding image virality" (in Japanese)CVPR2015 reading "Understainding image virality" (in Japanese)
CVPR2015 reading "Understainding image virality" (in Japanese)
 
Computational models of human visual attention driven by auditory cues
Computational models of human visual attention driven by auditory cuesComputational models of human visual attention driven by auditory cues
Computational models of human visual attention driven by auditory cues
 
NIPS2014 reading - Top rank optimization in linear time
NIPS2014 reading - Top rank optimization in linear timeNIPS2014 reading - Top rank optimization in linear time
NIPS2014 reading - Top rank optimization in linear time
 
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
 
ICCV2013 reading: Learning to rank using privileged information
ICCV2013 reading: Learning to rank using privileged informationICCV2013 reading: Learning to rank using privileged information
ICCV2013 reading: Learning to rank using privileged information
 
IJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphsIJCAI13 Paper review: Large-scale spectral clustering on graphs
IJCAI13 Paper review: Large-scale spectral clustering on graphs
 
関西CVPR勉強会 2012.10.28
関西CVPR勉強会 2012.10.28関西CVPR勉強会 2012.10.28
関西CVPR勉強会 2012.10.28
 
関西CVPR勉強会 2012.7.29
関西CVPR勉強会 2012.7.29関西CVPR勉強会 2012.7.29
関西CVPR勉強会 2012.7.29
 
ICWSM12 Brief Review
ICWSM12 Brief ReviewICWSM12 Brief Review
ICWSM12 Brief Review
 
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
 
関西CVPRML勉強会(特定物体認識) 2012.1.14
関西CVPRML勉強会(特定物体認識) 2012.1.14関西CVPRML勉強会(特定物体認識) 2012.1.14
関西CVPRML勉強会(特定物体認識) 2012.1.14
 
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
 
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
 
立命館大学 AMLコロキウム 2011.10.20
立命館大学 AMLコロキウム 2011.10.20立命館大学 AMLコロキウム 2011.10.20
立命館大学 AMLコロキウム 2011.10.20
 

Último

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 

Último (20)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 

ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using adjective noun pairs

  • 1. ACMMM2013 reading @ Kanto CV 2014.2.23 Akisato Kimura (@_akisato) NTT Communication Science Labs
  • 4. Basic strategy • Adjective noun pairs (ANPs) – Adjectives play a significant role in conveying sentiments, but visually inconsistent. – Combined phrases make the concepts more detectable than single adj. & n. • cf. Recognition using visual phrases [CVPR11]
  • 5. Contributions • Automatically construct a large-scale Visual Sentiment Ontology (VSO) with 3000 ANPs – With the help of psychological theories and web mining techniques • Propose SentiBank: a visual concept detector library to detect the presence of 1200 ANPs – Useful for sentiment analysis of visual contents as attributes
  • 6. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 7. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 8. 24 basic words for emotions • Founded on Plutchik’s Wheel on Emotions 1 1 2 3 4 1 2 3 4 1 2 3 4 1 2 3 4 4 4 3 1 2 2 3 3 4 3 1 2 4 2 3 1 2 1 4 1 2 3 http://en.wikipedia.org/wiki/Plutchik%27s_Wheel_ 4 of_Emotions#Plutchik.27s_wheel_of_emotions
  • 9. 24 basic words for emotions (cont.) • 8 basic emotions x 3 degrees 1 4 2 3 1 2 3 4 1 2 3 4 3 2 4 1
  • 10. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 11. Sentiment word discovery • Web mining strategy – Retrieve images & videos from Flickr & YouTube with each of 24 basic words as a query – Extract their associated tags by Lookapp tool [Borth+ ICMR11]
  • 12. Sentiment word discovery (cont.) • Exploits various NLP techniques & resources – Post-processings • Remove stop words, perform stemming • Top 100 tags are selected for each emotion – Sentiment value computation (-1 neg  +1 pos) • SentiWordNet [Esuli+ 2006] SentiStrength [Thelwall+ 2010]
  • 13. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 14. ANP construction • Take all the pairs of (ADJ, N)s into consideration – Remove named entities with meaning changed (e.g. “hot” + “dog”  generic named entity) • Fuse sentiment values – Simple sum-up model : s(ANP) = s(ADJ) + s(N) • If sgn(s(ADJ)) != sgn(s(N)), then s(ANP) = S(ADJ). • Rank ANPs by their frequency – Remove all ANPs with no images – Resulting in 47K ANP candidates
  • 15. ANP construction (cont.) • Ontology sampling – Partition candidates into individual ADJ sets – Sample a subset from each ADJ set – Take ANPs with sufficient (>125) images • Linking back to emotions – For each ANP, count images with 24 basic words & the ANP in their meta, create a 24-dim histogram
  • 16. How reliable ANP labels are? • Web annotation may not be reliable – Using Flickr tags as pseudo ANP labels might incur false positive • Manual (=AMT) validation – Randomly sample images of 200 ANPs – Each image is validated by 3 Turkers, treated as correct only if >= 2 Turkers agree – Results: 97% correct
  • 18. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 19. Training ANP detectors • Various visual features – Color histogram (3 colors x 256 dim), GIST (512 dim), LBP (53 dim), BoW with spatial pyramid and max pooling (1000 dim x 2 layers), attributes [Yu+ CVPR13] (2000 dim) • Training a linear SVM for every ANP – Parameter tuning by cross validation (AP@20-based) – Measure performance by AP@20, AUC & F-score. • Several feature fusions – Early fusion, late fusion, weighted early/late fusion
  • 20. Detector performance • Comparing visual features (left) – 1st: attributes, 2nd: BoWs • Comparing feature fusions (right) – 1st: Weighted late fusion, but not dominant – Adopt early fusion for implementation simplicity
  • 22. Detectability issues • Select only ANPs with good detection accuracy – 1200 ANPs with AP@20>0 & F-score>0.6 • No correlation bwt detectability & occurrence – Difficulty in detecting ANPs depends on the content diversity and the abstract level
  • 23. Other issues • Special visual features improve detectors – ObjectBank [Li+ NIPS2010], facial features, aesthetic features [Bhattacharya+ ACMMM13] • Ontology structure – Interactive process to combine 1200 ANPs into distinct groups  6 levels, 15 nodes at the top • N: standard “is-a” relations • ADJ: exclusive (“sad” vs “happy”) & strength (“nice”, “great”, “awesome”) – 41% nouns uncovered by ImageNet • Related to abstract concepts (e.g. “violence”, “religion”)
  • 24. Framework 1. Select 24 fundamental words representing emotion 2. Retrieve images with every of the words as a query 3. Tags associated with the images are extracted to build ANPs ( = strong sentiment ADJs + all Ns) 4. Train ANP detectors and keep only detectors with reasonable performance to form SentiBank
  • 25. SentiBank applications • Sentiment prediction in image tweets – Sentiment analysis rely on text-based tools – 140 characters (in ENG) are too short – Use SentiBank to complement and augment texts • Emotion classification – Demonstrate the performance against an emotion dataset of art photos [Machajdik+ ACMMM10]
  • 26. Sentiment prediction in tweets • Data collection – Gather tweets with images & popular hashtags • #nuclearpower, #election, #championsleague, #cairo … – AMT to obtain sentiment ground-truth • 3 Turkers for every tweets: almost agreed (below) http://www.ee.columbia.ed u/ln/dvmm/vso/download/t witter_dataset.html
  • 27. Sentiment prediction in tweets (cont.) • Visual-based classifier – Serve SentiBank as a mid-level representation • Use ANP responses as an input feature • Employ a linear classifier for the final output – Compare SentiBank with low-level features
  • 28. Sentiment prediction in tweets (cont.) • Text-based classifier – Naïve Bayes + SentiStrength • Overall performance
  • 29. Sentiment prediction in tweets (cont.) • Detailed performance
  • 30. Emotion classification • Dataset – 807 art photos, 8 emotion categories retrieved from DeviantArt.com
  • 31. Takeaway messages • To appear in Tomorrow’s meeting