SlideShare una empresa de Scribd logo
1 de 18
Xiaowen Ding, Bing Liu and Philip Yu




Presenter: Quang Nguyen
Date: 2010.10.18
Saltlux Vietnam Development Center
   Featured-based Opinion Mining Tasks
    Task 1: Identify and extract object features F that have been
      commented on by an opinion holder (e.g., a reviewer).
    Task 2: Determine whether the opinions on the features F are
      positive, negative or neutral.
    Task 3: Group feature synonyms.
    • Produce a feature-based opinion summary of multiple reviews.


   This paper focuses on Task 2 assuming that features
    have been discovered


                                                                     2
 Opinion Words
  • Positive: beautiful, wonderful, good, amazing,
  • Negative: bad, poor, terrible, cost someone an
    arm and a leg (idiom).

 One effective approach is to use opinion lexicon,
 opinion words.
  • Identify all opinion words in a sentence
  • Aggregate these words to give the final opinion to
    each feature.



                                                         3
 Dictionary-based    approaches
  • Start from a seed opinion words
  • Use Wordnet’s hierarchy and synsets to acquire
   more opinion words
 Corpus-based approaches: extract opinion
 words from large corpora using syntactic
 rules and co-occurrence patterns

  Do not deal well with context dependent
  words!
                                                     4
 Improve lexicon-based approaches using
 context dependent opinion words
  • Negative: “The bedroom is very small”
  • Positive: “The Nokia N3100 is so small as to be
   put in any pockets”
 Propose  a function for aggregating multiple
  opinion words in the same sentence
 Consider explicit and implicit opinions



                                                      5
 Intra-sentence conjunction rule
 Pseudo intra-sentence conjunction
 Inter-sentence conjunction rule




                                      6
 Opinion
        on both sides of “and” should be
 the same
  • E.g., “This camera takes great pictures and has a
   long battery life”.

 Not   likely to say:
  • “This camera takes great pictures and has a short
   battery life.”



                                                        7
 Sometimes,  one may not use an explicit
 conjunction “and”.
  • Same opinion in same sentence, unless there is a
    “but”-like clause
  • E.g., “The camera has a long battery life, which is
    great”




                                                          8
 Peopleusually express the same opinion
 across sentences
  • unless there is an indication of opinion change
    using words such as “but” and “however”
  • E.g., “The picture quality is amazing. The battery life is
   long”
 Not   so natural to say:
  • “The picture quality is amazing. The battery life is
   short”



                                                                 9
 Opinion lexicon is far from sufficient. It needs
 special handling:
  • Negation/But Rule




  • Non-negation contains negative word, e.g., “I like this camera
    not just because it is beautiful”
  • Not contrary, but has a “but”, e.g., ““I not only like the picture
    quality of this camera, but also its size”
  • …



                                                                         10
 Implicit
         Feature is determined through
 adjectives (implicit feature indicator)
  • E.g., “This camera is very small”
  “small” is indicator for “size”
  • E.g., “This camera is very heavy”
  • “heavy” is indicator for “weight”




                                           11
 An object O is an entity which can be a product,
  person, event, organization, or topic
 An object O is represented with a finite set of features,
  F = {f1, f2, …, fn}.
    • Each feature fi in F can be expressed with a finite set of words
      or phrases Wi, which are synonyms.

   Model of a review: An opinion holder j comments on a
    subset of the features Sj F of object O.
    • For each feature fk   Sj that j comments on, he/she
       chooses a word or phrase from Wk to describe the
        feature, and
       expresses a positive, negative or neutral opinion on fk.

                                                                         12
   Input: a pair (f, s), where f is a product feature and s is a
    sentence that contains f.
   Output: whether the opinion on f in s is pos, neg, or neut.



                 wi: opinion word
                 V: set of all opinion words
                 dis(wi, f): distance between wi and f
                 SO: semantic orientation of wi (+1, -1, 0)




                                                                    13
14
15
Precision Recall F-Score
FBS
(M. Hu and B. Liu. Mining and         0.93    0.76    0.83
summarizing customer
reviews. KDD’04, 2004)

OPINE
(A-M. Popescu and O. Etzioni.
Extracting Product Features
                                      0.86    0.89    0.87
and Opinions from Reviews. EMNLP-
05, 2005.)
Opinion Observer                      0.92    0.91    0.91
(this paper)




                                                               16
   Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic
    Lexicon-Based Approach to Opinion Mining, Proceedings
    of the international conference on Web search and web
    data mining, USA, 2008




                                                            17
18

Más contenido relacionado

Similar a Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1Eddie Hollon
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2ddelargy
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assessiain bruce
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5Dave Eddy
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Machel Sweeney
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyDr Nur Suhaili Ramli
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Kira
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5kimpalmore
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringCS, NcState
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWholeeducation
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012Regina Navejar
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion miningSumit Sony
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
Reading Street
Reading StreetReading Street
Reading Streetcavalcic
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summaryjschultz
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012Regina Navejar
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviewsfirzhan naqash
 

Similar a Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining (20)

Technical Style Workshop Part 1
Technical Style Workshop Part 1Technical Style Workshop Part 1
Technical Style Workshop Part 1
 
Evaluation of research and planning 2
Evaluation of research and planning 2Evaluation of research and planning 2
Evaluation of research and planning 2
 
Fmp peer assess
Fmp peer assessFmp peer assess
Fmp peer assess
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Lens unit 2010 5
Lens unit 2010 5Lens unit 2010 5
Lens unit 2010 5
 
Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2Assignment revision edu 697 week 4 2
Assignment revision edu 697 week 4 2
 
BAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research MethodologyBAEB601 Chapter 1: Introduction to Research Methodology
BAEB601 Chapter 1: Introduction to Research Methodology
 
Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)Tutorial 13 (explicit ugc + sentiment analysis)
Tutorial 13 (explicit ugc + sentiment analysis)
 
Ewrt 211 class 5
Ewrt 211 class 5Ewrt 211 class 5
Ewrt 211 class 5
 
Learning Local Lessons in Software Engineering
Learning Local Lessons in Software EngineeringLearning Local Lessons in Software Engineering
Learning Local Lessons in Software Engineering
 
Whole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacyWhole education feb 7th 2013 literacy
Whole education feb 7th 2013 literacy
 
Bagwords
BagwordsBagwords
Bagwords
 
Tuesday, march 20, 2012
Tuesday, march 20, 2012Tuesday, march 20, 2012
Tuesday, march 20, 2012
 
Assignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransmanAssignment writing at university 2017 ginofransman
Assignment writing at university 2017 ginofransman
 
Sentiment analysis and opinion mining
Sentiment analysis and opinion miningSentiment analysis and opinion mining
Sentiment analysis and opinion mining
 
Reading Street
Reading StreetReading Street
Reading Street
 
Reading Street
Reading StreetReading Street
Reading Street
 
How to write the summary
How to write the summaryHow to write the summary
How to write the summary
 
Monday, march 19, 2012
Monday, march 19, 2012Monday, march 19, 2012
Monday, march 19, 2012
 
Ontology based opinion mining for book reviews
Ontology based opinion mining for book reviewsOntology based opinion mining for book reviews
Ontology based opinion mining for book reviews
 

Más de Nguyen Quang

Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement LearningNguyen Quang
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System ReviewNguyen Quang
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksNguyen Quang
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandraNguyen Quang
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architectureNguyen Quang
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginnerNguyen Quang
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrumNguyen Quang
 
Text categorization
Text categorizationText categorization
Text categorizationNguyen Quang
 

Más de Nguyen Quang (13)

Apache Zookeeper
Apache ZookeeperApache Zookeeper
Apache Zookeeper
 
Apache Storm
Apache StormApache Storm
Apache Storm
 
Deep Reinforcement Learning
Deep Reinforcement LearningDeep Reinforcement Learning
Deep Reinforcement Learning
 
Deep Dialog System Review
Deep Dialog System ReviewDeep Dialog System Review
Deep Dialog System Review
 
Sequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural NetworksSequence to Sequence Learning with Neural Networks
Sequence to Sequence Learning with Neural Networks
 
Introduction to cassandra
Introduction to cassandraIntroduction to cassandra
Introduction to cassandra
 
Web browser architecture
Web browser architectureWeb browser architecture
Web browser architecture
 
Eclipse orion
Eclipse orionEclipse orion
Eclipse orion
 
X Query for beginner
X Query for beginnerX Query for beginner
X Query for beginner
 
Html 5
Html 5Html 5
Html 5
 
Redistributable introtoscrum
Redistributable introtoscrumRedistributable introtoscrum
Redistributable introtoscrum
 
Text categorization
Text categorizationText categorization
Text categorization
 
Overview of NoSQL
Overview of NoSQLOverview of NoSQL
Overview of NoSQL
 

Último

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 

Último (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 

Xiaowen Ding, Bing Liu and Philip Yu: A Holistic Lexicon-Based Approach to Opinion Mining

  • 1. Xiaowen Ding, Bing Liu and Philip Yu Presenter: Quang Nguyen Date: 2010.10.18 Saltlux Vietnam Development Center
  • 2. Featured-based Opinion Mining Tasks Task 1: Identify and extract object features F that have been commented on by an opinion holder (e.g., a reviewer). Task 2: Determine whether the opinions on the features F are positive, negative or neutral. Task 3: Group feature synonyms. • Produce a feature-based opinion summary of multiple reviews.  This paper focuses on Task 2 assuming that features have been discovered 2
  • 3.  Opinion Words • Positive: beautiful, wonderful, good, amazing, • Negative: bad, poor, terrible, cost someone an arm and a leg (idiom).  One effective approach is to use opinion lexicon, opinion words. • Identify all opinion words in a sentence • Aggregate these words to give the final opinion to each feature. 3
  • 4.  Dictionary-based approaches • Start from a seed opinion words • Use Wordnet’s hierarchy and synsets to acquire more opinion words  Corpus-based approaches: extract opinion words from large corpora using syntactic rules and co-occurrence patterns Do not deal well with context dependent words! 4
  • 5.  Improve lexicon-based approaches using context dependent opinion words • Negative: “The bedroom is very small” • Positive: “The Nokia N3100 is so small as to be put in any pockets”  Propose a function for aggregating multiple opinion words in the same sentence  Consider explicit and implicit opinions 5
  • 6.  Intra-sentence conjunction rule  Pseudo intra-sentence conjunction  Inter-sentence conjunction rule 6
  • 7.  Opinion on both sides of “and” should be the same • E.g., “This camera takes great pictures and has a long battery life”.  Not likely to say: • “This camera takes great pictures and has a short battery life.” 7
  • 8.  Sometimes, one may not use an explicit conjunction “and”. • Same opinion in same sentence, unless there is a “but”-like clause • E.g., “The camera has a long battery life, which is great” 8
  • 9.  Peopleusually express the same opinion across sentences • unless there is an indication of opinion change using words such as “but” and “however” • E.g., “The picture quality is amazing. The battery life is long”  Not so natural to say: • “The picture quality is amazing. The battery life is short” 9
  • 10.  Opinion lexicon is far from sufficient. It needs special handling: • Negation/But Rule • Non-negation contains negative word, e.g., “I like this camera not just because it is beautiful” • Not contrary, but has a “but”, e.g., ““I not only like the picture quality of this camera, but also its size” • … 10
  • 11.  Implicit Feature is determined through adjectives (implicit feature indicator) • E.g., “This camera is very small” “small” is indicator for “size” • E.g., “This camera is very heavy” • “heavy” is indicator for “weight” 11
  • 12.  An object O is an entity which can be a product, person, event, organization, or topic  An object O is represented with a finite set of features, F = {f1, f2, …, fn}. • Each feature fi in F can be expressed with a finite set of words or phrases Wi, which are synonyms.  Model of a review: An opinion holder j comments on a subset of the features Sj F of object O. • For each feature fk Sj that j comments on, he/she  chooses a word or phrase from Wk to describe the feature, and  expresses a positive, negative or neutral opinion on fk. 12
  • 13. Input: a pair (f, s), where f is a product feature and s is a sentence that contains f.  Output: whether the opinion on f in s is pos, neg, or neut. wi: opinion word V: set of all opinion words dis(wi, f): distance between wi and f SO: semantic orientation of wi (+1, -1, 0) 13
  • 14. 14
  • 15. 15
  • 16. Precision Recall F-Score FBS (M. Hu and B. Liu. Mining and 0.93 0.76 0.83 summarizing customer reviews. KDD’04, 2004) OPINE (A-M. Popescu and O. Etzioni. Extracting Product Features 0.86 0.89 0.87 and Opinions from Reviews. EMNLP- 05, 2005.) Opinion Observer 0.92 0.91 0.91 (this paper) 16
  • 17. Xiaowen Ding, Bing Liu, and Philip S. Yu, A Holistic Lexicon-Based Approach to Opinion Mining, Proceedings of the international conference on Web search and web data mining, USA, 2008 17
  • 18. 18