SlideShare una empresa de Scribd logo
1 de 16
Descargar para leer sin conexión
On sentiment analysis tools for
software engineering research
Robbert Jongeling Subhajit Datta Alexander Serebrenik
Eindhoven U of
Technology (NL)
Singapore U of Technology
and Design (SG)
Eindhoven U of
Technology (NL)
@jongeling_r @datta_subhajit @aserebrenik
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
NLTK SentiStrength
E. Guzman, D. Azócar, and Y. Li,
“Sentiment analysis of commit
comments in GitHub: An empirical
study,” MSR 2014
A.-I. Rousinopoulos, G. Robles, and
J. M. González-Barahona, “Sentiment
analysis of Free/Open Source
developers: preliminary findings from
a case study,” Revista Eletrônica de
Sistemas de Informação, 2014
E. Guzman and B. Bruegge, “Towards
emotional awareness in software
development teams,” in Joint Meeting on
Foundations of Software Engineering, 2013
D. Pletea, B. Vasilescu, and A. Serebrenik,
“Security and emotion: Sentiment analysis
of security discussions on GitHub”, MSR
2014
M. Ortu, B. Adams, G. Destefanis, P. Tourani,
M. Marchesi, and R. Tonelli, “Are bullies
more productive? empirical study of
affectiveness vs. issue fixing time,” in MSR
2015
D. Garcia, M. S. Zanetti, and F. Schweitzer,
“The role of emotions in contributors
activity: A case study on the Gentoo
community,” in International Conference on
Cloud and Green Computing, 2013
NLTK SentiStrength
Trained on movie/product reviews.
Threat: might misidentify (or fail to identify) a
sentiment in a software engineering artefact
• RQ1: To what extent do different sentiment analysis
tools agree with emotions of software developers?
• RQ2: To what extent do different sentiment analysis
tools agree with each other?
• RQ3: Do different sentiment analysis tools lead to
contradictory results in a software engineering
study?
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
54
24
217
0 ≤ Adjusted Rand Index ≤ 1
[Santos, Embrechts, ICANN 2009]
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
Murgia et al.
MSR 2014
392 comments x 4 evaluators
joy love surprise anger fearsadness
positive negative
{
{
Consistent:
positive: 3 positive, none negative
negative: 3 negative, none positive
neutral: ≥3 without emotion indication
Alchemy
Stanford NLP
NLTK
SentiStrength
RQ1
Manual
neg neu pos
Tool
neg
neu
pos
54
24
217
0 ≤ Adjusted Rand Index ≤ 1
[Santos, Embrechts, ICANN 2009]
RQ2
Tool A
neg neu pos
Tool
B
neg
neu
pos
RQ1
RQ2
RQ1: To what extent do different sentiment analysis tools
agree with emotions of software developers?
RQ1
Manual
neg neu pos
NLTK
neg 19 51 11
neu 0 138 7
pos 5 28 36
Tool ARI
NLTK 0.239
SentiStrength 0.113
Stanford NLP 0.108
Alchemy 0.079
Tools do not agree with manual evaluation
RQ1
RQ2
RQ2: To what extent do different sentiment analysis tools
agree with each other?
RQ2
SentiStrength
neg neu pos
NLTK
neg 17 39 25
neu 15 96 34
pos 6 20 43
Tool A Tool B ARI
NLTK Alchemy 0.104
NLTK SentiStrength 0.090
Tools do not agree with each other
RQ1
RQ2
RQ3
issue tracker
over
text
response
time
Sentiment
Analysis Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK
issue tracker
over
text
response
time
Sentiment
Anal. Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK ∩
SentiStrength
issue tracker
over
text
response
time
Sentiment
Anal. Tool
compare times for
neg, neu, pos
issues/questions
q & a site
SentiStrength
RQ3
issue tracker
over
text
response
time
Sentiment
Analysis Tool
compare times for
neg, neu, pos
issues/questions
q & a site
NLTK
Are the results the same?
NLTK SentiStrength NLTK ∩ SentiStrength
ASF
descr
neg > neu*** neg > neu***
pos > neu*** pos > neu*** pos > neu***
pos > neg*** pos > neg***
ASF title
neg > neu**
pos > neu*** pos > neu**
pos > neg* pos > neg**
GNOME
descr
neg > neu*** neg > neu*** neg > neu***
pos > neu*** pos > neu*** pos > neu***
pos > neg***
neg > pos***
SO
descr
ø neg > pos* ø
RQ3 RQ3: Do different sentiment analysis tools lead to
contradictory results in a software engineering study?
Choice of the sentiment analysis tool affects results of the
software engineering study
Tools do not agree with manual evaluation
Tools do not agree with each other
Choice of the sentiment analysis tool affects results of the
software engineering study
Summary
Sentiment analysis tools are trained on movie/
product reviews.
Threat: might misidentify (or fail to identify) a
sentiment in a software engineering artefact
Next steps?
• Train sentiment analysis tools on software
engineering data
• Data of Murgia et al.: first step
• More and better-suited data is needed

Más contenido relacionado

Destacado

actionreserchpbl
actionreserchpblactionreserchpbl
actionreserchpblpuniga
 
Hh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaHh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaAtte Järvelä
 
ไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉันไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉันSchool in Phatthalung
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Alexander Serebrenik
 
Mock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap CarMock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap CarMelissa Anne Lim
 
Icsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarIcsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarAlexander Serebrenik
 
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information SitesEnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information SitesAlexander Serebrenik
 
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"paulinehalazoune
 
865 social capital
865 social capital865 social capital
865 social capitalIan Pollock
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Colt
 
Riverbend Market Cooperative
Riverbend Market CooperativeRiverbend Market Cooperative
Riverbend Market CooperativeColt
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Colt
 
Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++Alexander Serebrenik
 
PresentacióN Pcpi M
PresentacióN Pcpi MPresentacióN Pcpi M
PresentacióN Pcpi Mguestae4440
 
Benevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesBenevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesAlexander Serebrenik
 
Pbl_action_reserch
Pbl_action_reserchPbl_action_reserch
Pbl_action_reserchsojikarns
 
Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013edumallol
 

Destacado (20)

actionreserchpbl
actionreserchpblactionreserchpbl
actionreserchpbl
 
Hh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvelaHh kehittamistyo esitys_atte_jarvela
Hh kehittamistyo esitys_atte_jarvela
 
ไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉันไตร่ตรองงานวิจัยของฉัน
ไตร่ตรองงานวิจัยของฉัน
 
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
Invited Talk MESOCA 2014: Evolving software systems: emerging trends and chal...
 
Mock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap CarMock Advertisement for Amphibious Mousetrap Car
Mock Advertisement for Amphibious Mousetrap Car
 
Icsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliarIcsm 2011 you can't control the unfamiliar
Icsm 2011 you can't control the unfamiliar
 
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information SitesEnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
EnTagRec: An Enhanced Tag Recommendation System for Software Information Sites
 
Roman Vorobyev
Roman VorobyevRoman Vorobyev
Roman Vorobyev
 
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
Global Entrepreneurship Final Oral Presentation for "Les macarons de Pauline"
 
Saxony Germany
Saxony GermanySaxony Germany
Saxony Germany
 
865 social capital
865 social capital865 social capital
865 social capital
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
 
Databases Part 3: Searching
Databases Part 3: SearchingDatabases Part 3: Searching
Databases Part 3: Searching
 
Riverbend Market Cooperative
Riverbend Market CooperativeRiverbend Market Cooperative
Riverbend Market Cooperative
 
Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!Sneak peak at door prizes and silent auction items for Grand Opening reception!
Sneak peak at door prizes and silent auction items for Grand Opening reception!
 
Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++Flowgen: Flowchart-Based Documentation Framework for C++
Flowgen: Flowchart-Based Documentation Framework for C++
 
PresentacióN Pcpi M
PresentacióN Pcpi MPresentacióN Pcpi M
PresentacióN Pcpi M
 
Benevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgradesBenevol 2013: Visualizing the complexity of software module upgrades
Benevol 2013: Visualizing the complexity of software module upgrades
 
Pbl_action_reserch
Pbl_action_reserchPbl_action_reserch
Pbl_action_reserch
 
Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013Reunió famílies setembre p3 2012 2013
Reunió famílies setembre p3 2012 2013
 

Similar a Sentiment analysis tools for software engineering research cannot be used out of the box

A Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering ResearchA Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering ResearchNicole Novielli
 
Some examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchersSome examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchersAshish Sharma
 
Emotion Analysis in Software Ecosystems
Emotion Analysis in Software EcosystemsEmotion Analysis in Software Ecosystems
Emotion Analysis in Software EcosystemsAlexander Serebrenik
 
Intro to sentiment analysis
Intro to sentiment analysisIntro to sentiment analysis
Intro to sentiment analysisTimea Turdean
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET Journal
 
A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow Fabio Calefato
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisNicole Novielli
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsKimberly Pulley
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometricsDiane Talley
 
The Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer EcosystemThe Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer EcosystemNicole Novielli
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)HPCC Systems
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest linkCS, NcState
 
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open ChallengesKeynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open ChallengesNicole Novielli
 
White paper loren k schwappach
White paper   loren k schwappachWhite paper   loren k schwappach
White paper loren k schwappachLoren Schwappach
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterToine Bogers
 
How does Social Software support Global Software Development?
How does Social Software support Global Software Development?How does Social Software support Global Software Development?
How does Social Software support Global Software Development?Rosalba Giuffrida
 
A General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender SystemA General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender SystemLucio Narducci
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageJeff Nelson
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisEditor IJCATR
 

Similar a Sentiment analysis tools for software engineering research cannot be used out of the box (20)

A Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering ResearchA Benchmark Study on Sentiment Analysis for Software Engineering Research
A Benchmark Study on Sentiment Analysis for Software Engineering Research
 
Some examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchersSome examples of Open Source computational tools for researchers
Some examples of Open Source computational tools for researchers
 
Emotion Analysis in Software Ecosystems
Emotion Analysis in Software EcosystemsEmotion Analysis in Software Ecosystems
Emotion Analysis in Software Ecosystems
 
Intro to sentiment analysis
Intro to sentiment analysisIntro to sentiment analysis
Intro to sentiment analysis
 
IRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression IdentificationIRJET- BDI using NLP for Efficient Depression Identification
IRJET- BDI using NLP for Efficient Depression Identification
 
A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow A Gold Standard for Emotion Annotation in Stack Overflow
A Gold Standard for Emotion Annotation in Stack Overflow
 
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment AnalysisTo Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
To Label or Not? Advances and Open Challenges in SE-specific Sentiment Analysis
 
Aspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel ReviewsAspect-Level Sentiment Analysis On Hotel Reviews
Aspect-Level Sentiment Analysis On Hotel Reviews
 
R programming for psychometrics
R programming for psychometricsR programming for psychometrics
R programming for psychometrics
 
The Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer EcosystemThe Challenges of Affect Detection in the Social Programmer Ecosystem
The Challenges of Affect Detection in the Social Programmer Ecosystem
 
Analyzing Big Data's Weakest Link (hint: it might be you)
Analyzing Big Data's Weakest Link  (hint: it might be you)Analyzing Big Data's Weakest Link  (hint: it might be you)
Analyzing Big Data's Weakest Link (hint: it might be you)
 
Big Data: the weakest link
Big Data: the weakest linkBig Data: the weakest link
Big Data: the weakest link
 
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open ChallengesKeynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
Keynote@QUATIC - Recognizing Developer's Emotions: Advances and Open Challenges
 
White paper loren k schwappach
White paper   loren k schwappachWhite paper   loren k schwappach
White paper loren k schwappach
 
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on TwitterMicro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
Micro-Serendipity: Meaningful Coincidences in Everyday Life Shared on Twitter
 
How does Social Software support Global Software Development?
How does Social Software support Global Software Development?How does Social Software support Global Software Development?
How does Social Software support Global Software Development?
 
Empathic Mixed Reality
Empathic Mixed RealityEmpathic Mixed Reality
Empathic Mixed Reality
 
A General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender SystemA General Architecture for an Emotion-aware Content-based Recommender System
A General Architecture for an Emotion-aware Content-based Recommender System
 
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam LanguageA Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
A Subjective Feature Extraction For Sentiment Analysis In Malayalam Language
 
Neural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment AnalysisNeural Network Based Context Sensitive Sentiment Analysis
Neural Network Based Context Sensitive Sentiment Analysis
 

Más de Alexander Serebrenik

Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...Alexander Serebrenik
 
Towards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBotTowards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBotAlexander Serebrenik
 
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...Alexander Serebrenik
 
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...Alexander Serebrenik
 
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...Alexander Serebrenik
 
Gender and Age in Software Engineering
Gender and Age in Software EngineeringGender and Age in Software Engineering
Gender and Age in Software EngineeringAlexander Serebrenik
 
Diversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroomDiversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroomAlexander Serebrenik
 
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis AlarmsAn Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis AlarmsAlexander Serebrenik
 
Classification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis AlarmsClassification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis AlarmsAlexander Serebrenik
 
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The NetherlandsWhat Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The NetherlandsAlexander Serebrenik
 
From team organisation to software quality
From team organisation to software qualityFrom team organisation to software quality
From team organisation to software qualityAlexander Serebrenik
 
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...Alexander Serebrenik
 
My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)Alexander Serebrenik
 
Opinion Mining for Software Engineering
Opinion Mining for Software EngineeringOpinion Mining for Software Engineering
Opinion Mining for Software EngineeringAlexander Serebrenik
 
Removing Self Admitted Technical Debt
Removing Self Admitted Technical DebtRemoving Self Admitted Technical Debt
Removing Self Admitted Technical DebtAlexander Serebrenik
 
Gender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software EngineeringGender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software EngineeringAlexander Serebrenik
 
Identifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the ArtIdentifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the ArtAlexander Serebrenik
 

Más de Alexander Serebrenik (20)

Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...Software development is a human activity: understanding software requires und...
Software development is a human activity: understanding software requires und...
 
Towards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBotTowards Continuous Performance Assessment of Java Applications With PerfBot
Towards Continuous Performance Assessment of Java Applications With PerfBot
 
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
“STILL AROUND”: Experiences and Survival Strategies of Veteran Women Software...
 
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
A Qualitative Study of Developers’ Discussions of Their Problems and Joys Dur...
 
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
Investigating the Resolution of Vulnerable Dependencies with Dependabot Secur...
 
Gender and Age in Software Engineering
Gender and Age in Software EngineeringGender and Age in Software Engineering
Gender and Age in Software Engineering
 
Alexander - intro
Alexander - introAlexander - intro
Alexander - intro
 
Diversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroomDiversity and inclusion in a CS classroom
Diversity and inclusion in a CS classroom
 
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis AlarmsAn Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
An Empirical Assessment on Merging and Repositioning of Static Analysis Alarms
 
Classification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis AlarmsClassification and Ranking of Delta Static Analysis Alarms
Classification and Ranking of Delta Static Analysis Alarms
 
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The NetherlandsWhat Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
What Is an AI Engineer? An Empirical Analysis of Job Ads in The Netherlands
 
Gender and Community Smells
Gender and Community SmellsGender and Community Smells
Gender and Community Smells
 
Bias in MSR Research
Bias in MSR ResearchBias in MSR Research
Bias in MSR Research
 
From team organisation to software quality
From team organisation to software qualityFrom team organisation to software quality
From team organisation to software quality
 
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
Women in Dutch Computer Science: Best Practices for Recruitment, Onboarding a...
 
My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)My research story (presentation at ICSE 2021 New Faculty Symposium)
My research story (presentation at ICSE 2021 New Faculty Symposium)
 
Opinion Mining for Software Engineering
Opinion Mining for Software EngineeringOpinion Mining for Software Engineering
Opinion Mining for Software Engineering
 
Removing Self Admitted Technical Debt
Removing Self Admitted Technical DebtRemoving Self Admitted Technical Debt
Removing Self Admitted Technical Debt
 
Gender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software EngineeringGender Diversity and Inclusion and Software Engineering
Gender Diversity and Inclusion and Software Engineering
 
Identifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the ArtIdentifying Developers’ Gender: State of the Art
Identifying Developers’ Gender: State of the Art
 

Último

The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxThe Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxRTS corp
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...OnePlan Solutions
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingShane Coughlan
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesVictoriaMetrics
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?Alexandre Beguel
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogueitservices996
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonApplitools
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolsosttopstonverter
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Rob Geurden
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptxVinzoCenzo
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecturerahul_net
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...OnePlan Solutions
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxAndreas Kunz
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shardsChristopher Curtin
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfmaor17
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorTier1 app
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZABSYZ Inc
 
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...Bert Jan Schrijver
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identityteam-WIBU
 

Último (20)

The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptxThe Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
The Role of IoT and Sensor Technology in Cargo Cloud Solutions.pptx
 
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
Revolutionizing the Digital Transformation Office - Leveraging OnePlan’s AI a...
 
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full RecordingOpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
 
What’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 UpdatesWhat’s New in VictoriaMetrics: Q1 2024 Updates
What’s New in VictoriaMetrics: Q1 2024 Updates
 
SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?SAM Training Session - How to use EXCEL ?
SAM Training Session - How to use EXCEL ?
 
Ronisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited CatalogueRonisha Informatics Private Limited Catalogue
Ronisha Informatics Private Limited Catalogue
 
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + KobitonLeveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
Leveraging AI for Mobile App Testing on Real Devices | Applitools + Kobiton
 
eSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration toolseSoftTools IMAP Backup Software and migration tools
eSoftTools IMAP Backup Software and migration tools
 
Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...Simplifying Microservices & Apps - The art of effortless development - Meetup...
Simplifying Microservices & Apps - The art of effortless development - Meetup...
 
Osi security architecture in network.pptx
Osi security architecture in network.pptxOsi security architecture in network.pptx
Osi security architecture in network.pptx
 
Understanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM ArchitectureUnderstanding Flamingo - DeepMind's VLM Architecture
Understanding Flamingo - DeepMind's VLM Architecture
 
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
Tech Tuesday Slides - Introduction to Project Management with OnePlan's Work ...
 
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptxUI5ers live - Custom Controls wrapping 3rd-party libs.pptx
UI5ers live - Custom Controls wrapping 3rd-party libs.pptx
 
2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards2024 DevNexus Patterns for Resiliency: Shuffle shards
2024 DevNexus Patterns for Resiliency: Shuffle shards
 
Zer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdfZer0con 2024 final share short version.pdf
Zer0con 2024 final share short version.pdf
 
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News UpdateVictoriaMetrics Q1 Meet Up '24 - Community & News Update
VictoriaMetrics Q1 Meet Up '24 - Community & News Update
 
Effectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryErrorEffectively Troubleshoot 9 Types of OutOfMemoryError
Effectively Troubleshoot 9 Types of OutOfMemoryError
 
Salesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZSalesforce Implementation Services PPT By ABSYZ
Salesforce Implementation Services PPT By ABSYZ
 
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
JavaLand 2024 - Going serverless with Quarkus GraalVM native images and AWS L...
 
Post Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on IdentityPost Quantum Cryptography – The Impact on Identity
Post Quantum Cryptography – The Impact on Identity
 

Sentiment analysis tools for software engineering research cannot be used out of the box

  • 1. On sentiment analysis tools for software engineering research Robbert Jongeling Subhajit Datta Alexander Serebrenik Eindhoven U of Technology (NL) Singapore U of Technology and Design (SG) Eindhoven U of Technology (NL) @jongeling_r @datta_subhajit @aserebrenik
  • 2. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013
  • 3. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013 NLTK SentiStrength
  • 4. E. Guzman, D. Azócar, and Y. Li, “Sentiment analysis of commit comments in GitHub: An empirical study,” MSR 2014 A.-I. Rousinopoulos, G. Robles, and J. M. González-Barahona, “Sentiment analysis of Free/Open Source developers: preliminary findings from a case study,” Revista Eletrônica de Sistemas de Informação, 2014 E. Guzman and B. Bruegge, “Towards emotional awareness in software development teams,” in Joint Meeting on Foundations of Software Engineering, 2013 D. Pletea, B. Vasilescu, and A. Serebrenik, “Security and emotion: Sentiment analysis of security discussions on GitHub”, MSR 2014 M. Ortu, B. Adams, G. Destefanis, P. Tourani, M. Marchesi, and R. Tonelli, “Are bullies more productive? empirical study of affectiveness vs. issue fixing time,” in MSR 2015 D. Garcia, M. S. Zanetti, and F. Schweitzer, “The role of emotions in contributors activity: A case study on the Gentoo community,” in International Conference on Cloud and Green Computing, 2013 NLTK SentiStrength Trained on movie/product reviews. Threat: might misidentify (or fail to identify) a sentiment in a software engineering artefact
  • 5. • RQ1: To what extent do different sentiment analysis tools agree with emotions of software developers? • RQ2: To what extent do different sentiment analysis tools agree with each other? • RQ3: Do different sentiment analysis tools lead to contradictory results in a software engineering study?
  • 6. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { RQ1 RQ2
  • 7. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 8. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos 54 24 217 0 ≤ Adjusted Rand Index ≤ 1 [Santos, Embrechts, ICANN 2009] RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 9. Murgia et al. MSR 2014 392 comments x 4 evaluators joy love surprise anger fearsadness positive negative { { Consistent: positive: 3 positive, none negative negative: 3 negative, none positive neutral: ≥3 without emotion indication Alchemy Stanford NLP NLTK SentiStrength RQ1 Manual neg neu pos Tool neg neu pos 54 24 217 0 ≤ Adjusted Rand Index ≤ 1 [Santos, Embrechts, ICANN 2009] RQ2 Tool A neg neu pos Tool B neg neu pos RQ1 RQ2
  • 10. RQ1: To what extent do different sentiment analysis tools agree with emotions of software developers? RQ1 Manual neg neu pos NLTK neg 19 51 11 neu 0 138 7 pos 5 28 36 Tool ARI NLTK 0.239 SentiStrength 0.113 Stanford NLP 0.108 Alchemy 0.079 Tools do not agree with manual evaluation RQ1 RQ2
  • 11. RQ2: To what extent do different sentiment analysis tools agree with each other? RQ2 SentiStrength neg neu pos NLTK neg 17 39 25 neu 15 96 34 pos 6 20 43 Tool A Tool B ARI NLTK Alchemy 0.104 NLTK SentiStrength 0.090 Tools do not agree with each other RQ1 RQ2
  • 12. RQ3 issue tracker over text response time Sentiment Analysis Tool compare times for neg, neu, pos issues/questions q & a site NLTK
  • 13. issue tracker over text response time Sentiment Anal. Tool compare times for neg, neu, pos issues/questions q & a site NLTK ∩ SentiStrength issue tracker over text response time Sentiment Anal. Tool compare times for neg, neu, pos issues/questions q & a site SentiStrength RQ3 issue tracker over text response time Sentiment Analysis Tool compare times for neg, neu, pos issues/questions q & a site NLTK Are the results the same?
  • 14. NLTK SentiStrength NLTK ∩ SentiStrength ASF descr neg > neu*** neg > neu*** pos > neu*** pos > neu*** pos > neu*** pos > neg*** pos > neg*** ASF title neg > neu** pos > neu*** pos > neu** pos > neg* pos > neg** GNOME descr neg > neu*** neg > neu*** neg > neu*** pos > neu*** pos > neu*** pos > neu*** pos > neg*** neg > pos*** SO descr ø neg > pos* ø RQ3 RQ3: Do different sentiment analysis tools lead to contradictory results in a software engineering study? Choice of the sentiment analysis tool affects results of the software engineering study
  • 15. Tools do not agree with manual evaluation Tools do not agree with each other Choice of the sentiment analysis tool affects results of the software engineering study Summary Sentiment analysis tools are trained on movie/ product reviews. Threat: might misidentify (or fail to identify) a sentiment in a software engineering artefact
  • 16. Next steps? • Train sentiment analysis tools on software engineering data • Data of Murgia et al.: first step • More and better-suited data is needed