SlideShare una empresa de Scribd logo
1 de 26
Adapting Sentiment Lexicons using
Contextual Semantics for Sentiment
Analysis of Twitter
Hassan Saif, Yulan He, Miriam Fernandez and Harith Alani
Knowledge Media Institute, The Open University,
Milton Keynes, United Kingdom
1st Workshop on Semantic Sentiment Analysis
Greece, Crete 2014
• Sentiment Analysis
• Sentiment Analysis Approaches
• Sentiment Lexicons on Twitter
• Sentiment Lexicon Adaptation
Approach
• Evaluation
• Conclusion
Outline
“Sentiment analysis is the task of identifying
positive and negative opinions, emotions and
evaluations in text”
3
Opinion OpinionFact
Sentiment Analysis
yes, It is sunny, but
also very humid :(
The weather is
great today :)
I think its almost
30 degrees today
I had nightmares all night long last night :(
Negative
Sentiment Lexicon
Text Processing
Algorithm
Sentiment Analysis
The Lexicon-based Approach
great
sad
down
wrong
horrible
love
Sentiment Analysis
Sentiment Lexicons
- Lists of Opinionated:
- Words and Phrases (MPQA, SentiWordNet, etc)
- Common Sense Concepts (SenticNet)
- Built:
- Manually
- Dictionary-based Approach
- Corpus-based Approach
- Applied to Conventional Text
- Movie Reviews, News, Blogs, Open Forums, etc.
Sentiment Lexicons on Twitter
Twitter Data
- Language Variations
- New Words
- Noisy Nature
- lol, gr8, :), :P
Traditional Lexicons
- Not tailored to Twitter
noisy data
- Fixed number of words
Twitter-specific Sentiment Lexicons
- Such as: Thelwall-Lexicon
- Built to specifically work on social data
- Contain lists of emoticons, slangs, abbreviations, etc.
- Coupled with rule-based method, SentiStrength
- Apply text pre-processing routine on tweets
Twitter-specific Sentiment Lexicons
Offer Context-Insensitive Prior Sentiment Orientations and Strength of words
..and Traditional Lexicons
Great
Problem Smile
Sentiment Lexicon
great
sad
down
wrong
horrible
love
Positive
Lexicons Adaptation Approaches
Require
Training
from Labeled
Corpora
Supervised Unsupervised
Use General Textual Corpora
(e.g., WEB)
or Static lexical
knowledge sources (e.g.,
WordNet)
Contextual Semantic Adaptation Approach
 Unsupervised Approach
 Captures the Contextual
Semantics of words
 To assign Contextual
Sentiment
Contextual Semantics of Words
“Words that occur in similar context tend to have similar meaning”
Wittgenstein (1953)
Great
Problem
Look Smile
Concert
Song
Weather
Loss
Game
Taylor Swift
Amazing
Great
Capturing Contextual Semantics
Term (m) C1 C2 Cn….
Context-Term Vector
Degree of Correlation
Prior SentimentSentiment
Lexicon
(1)
(2)
Great
Smile Look
SentiCircles Model
(3)
Contextual Sentiment
Strength
Contextual Sentiment
Orientation
Positive,
Negative
Neutral
[-1 (very negative)
+1 (very positive)]
Capturing Contextual Semantics
Term (m) C1
Degree of Correlation
Prior Sentiment
Great
Smile
SentiCircles Model
X = R * COS(θ)
Y = R * SIN(θ)
Smile
X
ri
θi
xi
yi
Great
PositiveVery Positive
Very Negative Negative
+1
-1
+1-1 Neutral
Region
ri = TDOC(Ci)
θi = Prior_Sentiment (Ci) * π
SentiCircles (Example)
Overall Contextual Sentiment
Ci
X
ri
θi
xi
yi
m
PositiveVery Positive
Very Negative Negative
+1
-1
+1-1 Neutral
Region
nwhicheachtermisused. Tocomputethenewsentiment of
tiCircleweusetheSenti-Median metric. Wenow havethe
hichiscomposedbytheset of (x, y) Cartesiancoordinatesof
wherethey valuerepresentsthesentiment andthex value
ength. Aneffectiveway toapproximatetheoverall sentiment
y calculatingthegeometricmedianof all itspoints. Formally,
(p1, p2, ..., pn ) inaSentiCircle⌦, the2Dgeometricmedian
g = arg min
g2 R2
nX
i = 1
k|pi − g||2, (5)
Senti-Median of SentiCircle
Sentiment Function
Lexicon Adaptation Method
• A set of Antecedent-Consequent Rules
• Decides on the new sentiment of a term
based on:
– How Weak/Strong its Prior Sentiment
– How Weak/Strong its Contextual Sentiment
• Based on the Position of the term’s SentiMedian
Thelwall-Lexicon
Case Study
fiery -2
fiery -2
vex*-3
fiery -2
witch -1
inspir* 3
fiery* -2
trite* -3
fiery -2
cunt* -4
fiery -2
fiery* -2
intelligent* 2
fiery -2
joll* 3
fiery* -2
fiery* -2
suffers -4
fiery -2
loved 4
insidious* -3
despis* -4
fiery* -2
hehe* 2
398
1919
229
0
500
1000
1500
2000
2500
Positive Negative Neutral
• Consists of 2546 terms
• Coupled with prior sentiment strength between |1| and |5|
[-2, -5] negative term
[2, 5] positive term
[-1, 1] neutral term
Adaptation Rules on
Thelwall-Lexicon
Prior Sentiment < -3 (week negative)
Revolution
Contextual Sentiment = Neutral
Change to Neutral
Rule 10
Experiments
• Sentiment Lexicon
– Thelwall-Lexicon
• Settings:
– Update Setting
– Expand Setting
– Update + Expand Setting
• Datasets
• Binary Sentiment Classification
– SentiStrength
• Lexicon-based Method
• Work on Thelwall-Lexicon
Results
Adaptation Impact on Thelwall-Lexicon
Results
Cross comparison results of the original and the adapted lexicons
Adapted Lexicons on HCR
Performance
35
37
39
41
43
45
Precision Recall F1
Positive Sentiment Detection
Original Updated Updated+Expanded
Sentiment Class Distribution
0.35
0.4
0.45
0.5
0.55
0.6
OMD HCR STS-Gold
Positive to Negative Ratio
Impact on Thelwall-Lexicon
10
15
20
25
30
OMD HCR STS-Gold
New Words Added To Thelwall-Lexicon
Conclusion
• We proposed an unsupervised approach for sentiment
lexicon adaptation from Twitter data.
• It update the words’ prior sentiment orientations and/or
strength based on their contextual semantics in tweets
• The evaluation was done on Thelwall-Lexicon using three
Twitter datasets.
• Results showed that lexicons adapted by our approach
improved the sentiment classification performance in both
accuracy and F1 in two out of three datasets.
Thank You
Email: hassan.saif@open.ac.uk
Twitter: hrsaif
Website: tweenator.com

Más contenido relacionado

La actualidad más candente

HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
WarNik Chow
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Knowledge Media Institute - The Open University
 
02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis
Subhas Kumar Ghosh
 
Machine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.pptMachine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.ppt
butest
 

La actualidad más candente (20)

HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
HashCount for SemEval-2018 Task 3: Concatenative Featurization of Tweet and H...
 
NLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLPNLP Bootcamp 2018 : Representation Learning of text for NLP
NLP Bootcamp 2018 : Representation Learning of text for NLP
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
 
IRJET- A System for Determining Sarcasm in Tweets: Sarcasm Detector
IRJET-  	  A System for Determining Sarcasm in Tweets: Sarcasm DetectorIRJET-  	  A System for Determining Sarcasm in Tweets: Sarcasm Detector
IRJET- A System for Determining Sarcasm in Tweets: Sarcasm Detector
 
sentiment analysis
sentiment analysis sentiment analysis
sentiment analysis
 
02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis02 naive bays classifier and sentiment analysis
02 naive bays classifier and sentiment analysis
 
The sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regressionThe sarcasm detection with the method of logistic regression
The sarcasm detection with the method of logistic regression
 
2 13
2 132 13
2 13
 
Textual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie ReviewsTextual & Sentiment Analysis of Movie Reviews
Textual & Sentiment Analysis of Movie Reviews
 
Machine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.pptMachine Learning Applications in NLP.ppt
Machine Learning Applications in NLP.ppt
 
Text summarization
Text summarization Text summarization
Text summarization
 
AI: Logic in AI
AI: Logic in AIAI: Logic in AI
AI: Logic in AI
 
MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]MTech Seminar Presentation [IIT-Bombay]
MTech Seminar Presentation [IIT-Bombay]
 
Word vectorization(embedding) with nnlm
Word vectorization(embedding) with nnlmWord vectorization(embedding) with nnlm
Word vectorization(embedding) with nnlm
 
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
 
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
 
Leveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word SimilarityLeveraging Sentiment to Compute Word Similarity
Leveraging Sentiment to Compute Word Similarity
 
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain ...
 
Ranking Twitter Conversations
Ranking Twitter ConversationsRanking Twitter Conversations
Ranking Twitter Conversations
 
Sarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour AnalysisSarcasm Detection and User Behaviour Analysis
Sarcasm Detection and User Behaviour Analysis
 

Destacado

On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of  TwitterOn Stopwords, Filtering and Data Sparsity for Sentiment Analysis of  Twitter
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
Knowledge Media Institute - The Open University
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltk
Wei-Ting Kuo
 
NLP based Mining on Movie Critics
NLP based Mining on Movie Critics NLP based Mining on Movie Critics
NLP based Mining on Movie Critics
supraja reddy
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
piya chauhan
 

Destacado (18)

On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of  TwitterOn Stopwords, Filtering and Data Sparsity for Sentiment Analysis of  Twitter
On Stopwords, Filtering and Data Sparsity for Sentiment Analysis of Twitter
 
Sentiment analysis-by-nltk
Sentiment analysis-by-nltkSentiment analysis-by-nltk
Sentiment analysis-by-nltk
 
Sentiment Analysis in Twitter
Sentiment Analysis in TwitterSentiment Analysis in Twitter
Sentiment Analysis in Twitter
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Intrusion Detection with Neural Networks
Intrusion Detection with Neural NetworksIntrusion Detection with Neural Networks
Intrusion Detection with Neural Networks
 
NLP based Mining on Movie Critics
NLP based Mining on Movie Critics NLP based Mining on Movie Critics
NLP based Mining on Movie Critics
 
Challenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysisChallenges of using Twitter for sentiment analysis
Challenges of using Twitter for sentiment analysis
 
Sentiments Improvement
Sentiments ImprovementSentiments Improvement
Sentiments Improvement
 
Sentiment Analysis Using Twitter
Sentiment Analysis Using TwitterSentiment Analysis Using Twitter
Sentiment Analysis Using Twitter
 
A comparison of Lexicon-based approaches for Sentiment Analysis of microblog ...
A comparison of Lexicon-based approaches for Sentiment Analysis of microblog ...A comparison of Lexicon-based approaches for Sentiment Analysis of microblog ...
A comparison of Lexicon-based approaches for Sentiment Analysis of microblog ...
 
Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Sentiments Analysis using Python and nltk
Sentiments Analysis using Python and nltk Sentiments Analysis using Python and nltk
Sentiments Analysis using Python and nltk
 
Social media & sentiment analysis splunk conf2012
Social media & sentiment analysis   splunk conf2012Social media & sentiment analysis   splunk conf2012
Social media & sentiment analysis splunk conf2012
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
 
Political sentiment analysis using twitter data
Political sentiment analysis using twitter dataPolitical sentiment analysis using twitter data
Political sentiment analysis using twitter data
 
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFiReal-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
Real-time Twitter Sentiment Analysis and Image Recognition with Apache NiFi
 
Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier Sentiment analysis using naive bayes classifier
Sentiment analysis using naive bayes classifier
 
Drone Data Flowing Through Apache NiFi
Drone Data Flowing Through Apache NiFiDrone Data Flowing Through Apache NiFi
Drone Data Flowing Through Apache NiFi
 

Similar a Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis of Twitter

An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
Surabhi Verma
 

Similar a Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis of Twitter (20)

Advances in Methods and Evaluations for Distributional Semantic Models using ...
Advances in Methods and Evaluations for Distributional Semantic Models using ...Advances in Methods and Evaluations for Distributional Semantic Models using ...
Advances in Methods and Evaluations for Distributional Semantic Models using ...
 
Analyzing Arguments during a Debate using Natural Language Processing in Python
Analyzing Arguments during a Debate using Natural Language Processing in PythonAnalyzing Arguments during a Debate using Natural Language Processing in Python
Analyzing Arguments during a Debate using Natural Language Processing in Python
 
An Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense DisambiguationAn Improved Approach to Word Sense Disambiguation
An Improved Approach to Word Sense Disambiguation
 
Rule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slidesRule based approach to sentiment analysis at romip’11 slides
Rule based approach to sentiment analysis at romip’11 slides
 
Lexicon Integrated CNN Models with Attention for Sentiment Analysis
Lexicon Integrated CNN Models with Attention for Sentiment AnalysisLexicon Integrated CNN Models with Attention for Sentiment Analysis
Lexicon Integrated CNN Models with Attention for Sentiment Analysis
 
Collective sensing
Collective sensingCollective sensing
Collective sensing
 
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social MediaKishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
Kishaloy Haldar and Wenqiang Lei - WESST - Sentiment Analysis of Social Media
 
Class14
Class14Class14
Class14
 
Lyle Ungar - 2016 - Modelling Valence and Arousal in Facebook Posts
Lyle Ungar - 2016 - Modelling Valence and Arousal in Facebook PostsLyle Ungar - 2016 - Modelling Valence and Arousal in Facebook Posts
Lyle Ungar - 2016 - Modelling Valence and Arousal in Facebook Posts
 
Sentiment Analysis of Film-Related Messages on Social Media
Sentiment Analysis of Film-Related Messages on Social MediaSentiment Analysis of Film-Related Messages on Social Media
Sentiment Analysis of Film-Related Messages on Social Media
 
Opinion mining
Opinion miningOpinion mining
Opinion mining
 
Mental model for emotion
Mental model for emotionMental model for emotion
Mental model for emotion
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1Engineering Intelligent NLP Applications Using Deep Learning – Part 1
Engineering Intelligent NLP Applications Using Deep Learning – Part 1
 
Semantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and RefinementSemantic Relation Classification: Task Formalisation and Refinement
Semantic Relation Classification: Task Formalisation and Refinement
 
DETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENTDETECTING OXYMORON IN A SINGLE STATEMENT
DETECTING OXYMORON IN A SINGLE STATEMENT
 
sa-mincut-aditya.ppt
sa-mincut-aditya.pptsa-mincut-aditya.ppt
sa-mincut-aditya.ppt
 
sa.ppt
sa.pptsa.ppt
sa.ppt
 
https://www.slideshare.net/amaresimachew/hot-topics-132093738
https://www.slideshare.net/amaresimachew/hot-topics-132093738https://www.slideshare.net/amaresimachew/hot-topics-132093738
https://www.slideshare.net/amaresimachew/hot-topics-132093738
 

Último

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Silpa
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
MohamedFarag457087
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
Silpa
 

Último (20)

Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.Atp synthase , Atp synthase complex 1 to 4.
Atp synthase , Atp synthase complex 1 to 4.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body GBSN - Microbiology (Unit 3)Defense Mechanism of the body
GBSN - Microbiology (Unit 3)Defense Mechanism of the body
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 

Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis of Twitter

  • 1. Adapting Sentiment Lexicons using Contextual Semantics for Sentiment Analysis of Twitter Hassan Saif, Yulan He, Miriam Fernandez and Harith Alani Knowledge Media Institute, The Open University, Milton Keynes, United Kingdom 1st Workshop on Semantic Sentiment Analysis Greece, Crete 2014
  • 2. • Sentiment Analysis • Sentiment Analysis Approaches • Sentiment Lexicons on Twitter • Sentiment Lexicon Adaptation Approach • Evaluation • Conclusion Outline
  • 3. “Sentiment analysis is the task of identifying positive and negative opinions, emotions and evaluations in text” 3 Opinion OpinionFact Sentiment Analysis yes, It is sunny, but also very humid :( The weather is great today :) I think its almost 30 degrees today
  • 4.
  • 5.
  • 6. I had nightmares all night long last night :( Negative Sentiment Lexicon Text Processing Algorithm Sentiment Analysis The Lexicon-based Approach great sad down wrong horrible love Sentiment Analysis
  • 7. Sentiment Lexicons - Lists of Opinionated: - Words and Phrases (MPQA, SentiWordNet, etc) - Common Sense Concepts (SenticNet) - Built: - Manually - Dictionary-based Approach - Corpus-based Approach - Applied to Conventional Text - Movie Reviews, News, Blogs, Open Forums, etc.
  • 8. Sentiment Lexicons on Twitter Twitter Data - Language Variations - New Words - Noisy Nature - lol, gr8, :), :P Traditional Lexicons - Not tailored to Twitter noisy data - Fixed number of words
  • 9. Twitter-specific Sentiment Lexicons - Such as: Thelwall-Lexicon - Built to specifically work on social data - Contain lists of emoticons, slangs, abbreviations, etc. - Coupled with rule-based method, SentiStrength - Apply text pre-processing routine on tweets
  • 10. Twitter-specific Sentiment Lexicons Offer Context-Insensitive Prior Sentiment Orientations and Strength of words ..and Traditional Lexicons Great Problem Smile Sentiment Lexicon great sad down wrong horrible love Positive
  • 11. Lexicons Adaptation Approaches Require Training from Labeled Corpora Supervised Unsupervised Use General Textual Corpora (e.g., WEB) or Static lexical knowledge sources (e.g., WordNet)
  • 12. Contextual Semantic Adaptation Approach  Unsupervised Approach  Captures the Contextual Semantics of words  To assign Contextual Sentiment
  • 13. Contextual Semantics of Words “Words that occur in similar context tend to have similar meaning” Wittgenstein (1953) Great Problem Look Smile Concert Song Weather Loss Game Taylor Swift Amazing Great
  • 14. Capturing Contextual Semantics Term (m) C1 C2 Cn…. Context-Term Vector Degree of Correlation Prior SentimentSentiment Lexicon (1) (2) Great Smile Look SentiCircles Model (3) Contextual Sentiment Strength Contextual Sentiment Orientation Positive, Negative Neutral [-1 (very negative) +1 (very positive)]
  • 15. Capturing Contextual Semantics Term (m) C1 Degree of Correlation Prior Sentiment Great Smile SentiCircles Model X = R * COS(θ) Y = R * SIN(θ) Smile X ri θi xi yi Great PositiveVery Positive Very Negative Negative +1 -1 +1-1 Neutral Region ri = TDOC(Ci) θi = Prior_Sentiment (Ci) * π
  • 17. Overall Contextual Sentiment Ci X ri θi xi yi m PositiveVery Positive Very Negative Negative +1 -1 +1-1 Neutral Region nwhicheachtermisused. Tocomputethenewsentiment of tiCircleweusetheSenti-Median metric. Wenow havethe hichiscomposedbytheset of (x, y) Cartesiancoordinatesof wherethey valuerepresentsthesentiment andthex value ength. Aneffectiveway toapproximatetheoverall sentiment y calculatingthegeometricmedianof all itspoints. Formally, (p1, p2, ..., pn ) inaSentiCircle⌦, the2Dgeometricmedian g = arg min g2 R2 nX i = 1 k|pi − g||2, (5) Senti-Median of SentiCircle Sentiment Function
  • 18. Lexicon Adaptation Method • A set of Antecedent-Consequent Rules • Decides on the new sentiment of a term based on: – How Weak/Strong its Prior Sentiment – How Weak/Strong its Contextual Sentiment • Based on the Position of the term’s SentiMedian
  • 19. Thelwall-Lexicon Case Study fiery -2 fiery -2 vex*-3 fiery -2 witch -1 inspir* 3 fiery* -2 trite* -3 fiery -2 cunt* -4 fiery -2 fiery* -2 intelligent* 2 fiery -2 joll* 3 fiery* -2 fiery* -2 suffers -4 fiery -2 loved 4 insidious* -3 despis* -4 fiery* -2 hehe* 2 398 1919 229 0 500 1000 1500 2000 2500 Positive Negative Neutral • Consists of 2546 terms • Coupled with prior sentiment strength between |1| and |5| [-2, -5] negative term [2, 5] positive term [-1, 1] neutral term
  • 20. Adaptation Rules on Thelwall-Lexicon Prior Sentiment < -3 (week negative) Revolution Contextual Sentiment = Neutral Change to Neutral Rule 10
  • 21. Experiments • Sentiment Lexicon – Thelwall-Lexicon • Settings: – Update Setting – Expand Setting – Update + Expand Setting • Datasets • Binary Sentiment Classification – SentiStrength • Lexicon-based Method • Work on Thelwall-Lexicon
  • 22. Results Adaptation Impact on Thelwall-Lexicon
  • 23. Results Cross comparison results of the original and the adapted lexicons
  • 24. Adapted Lexicons on HCR Performance 35 37 39 41 43 45 Precision Recall F1 Positive Sentiment Detection Original Updated Updated+Expanded Sentiment Class Distribution 0.35 0.4 0.45 0.5 0.55 0.6 OMD HCR STS-Gold Positive to Negative Ratio Impact on Thelwall-Lexicon 10 15 20 25 30 OMD HCR STS-Gold New Words Added To Thelwall-Lexicon
  • 25. Conclusion • We proposed an unsupervised approach for sentiment lexicon adaptation from Twitter data. • It update the words’ prior sentiment orientations and/or strength based on their contextual semantics in tweets • The evaluation was done on Thelwall-Lexicon using three Twitter datasets. • Results showed that lexicons adapted by our approach improved the sentiment classification performance in both accuracy and F1 in two out of three datasets.
  • 26. Thank You Email: hassan.saif@open.ac.uk Twitter: hrsaif Website: tweenator.com

Notas del editor

  1. Early work on Sentiment analysis focused mainly on extracting sentiment from conventional text such as movie reviews, blogs, news articles and open forums Textual content in these type of media sources is linguistically rich, consists of well structured and formal sentences, and discusses specific topic or domain (e.g., movie reviews)
  2. However, with the emergent of social media networks and microblogging platforms, especially Twitter, research interests shifted to analyzing and extracting sentiment from theses new sources. Nevertheless, One of the key challenges that Twitter sentiment analysis methods have to confront is the noisy nature of Twitter generated data. Twitter allows only for 140 characters in each post, which influences the use of abbreviations, irregular expressions and infrequent words. This phenomena increases the level of data sparsity, affecting the performance of Twitter sentiment classifiers
  3. There are several approaches to sentiment analysis. One common approach is the lexicon-based approach. This approach assumes that the sentiment orientations of a given
  4. Words in the lexicons have fixed prior sentiment orientations, i.e. each term has always the same associated sentiment orientation independently of the context in which the term is used.
  5. SentiCircles
  6. SentiCircles
  7. To build rules we need to look at the characteristics of the sentiment lexicon that we want to adapt.
  8. in our work we use thelwall-lexicon as a case study and therefore, we built our adaptation rules base don the characteristics of this lexicon
  9. As a case study
  10. To build rules we need to look at the characteristics of the sentiment lexicon that we want to adapt.
  11. To build rules we need to look at the characteristics of the sentiment lexicon that we want to adapt.