SlideShare a Scribd company logo
1 of 29
Download to read offline
Multimodal Stance Detection in Tweets on
Catalan #1Oct Referendum
@Ibereval 2018
Mariona Taulé, M. Antònia Martí, Universitat de Barcelona
Francisco Rangel, Autoritas Consulting &
Universitat Politècnica de València
Paolo Rosso, Universitat Politècnica de València
MultiStanceCat
• Introduction
• MultiStanceCat: Task Description
• TW-CaSe corpus
• Evaluation Framework
• Overview of the submitted approaches
• Conclusions
http://www.autoritas.net/MultiStanceCat-IberEval2018/
→ Semeval-2016 task 6: Detecting stance in tweets → English
(Mohammad, S.M., et al. 2016)
→ IberEval-2017 StanceCat task 7→ Catalan and Spanish
(Taulé et al. (2017)
IberEval-2018: MultiModal Stance Detection in tweets on
Catalan #1Oct Referendum task (MultiStanceCat )
To detect the authors stances with respect to the 1October
Referendum (2017) in tweets written in Catalan and Spanish from a
multimodal perspective
Multimodality: images from author’s timeline
Contextual information: tweet before and after
Text of the tweet + link
MultiStanceCat: Introduction
MultiModal Stance Detection in tweets on Catalan
#1Oct Referendum task (MultiStanceCat )
Task related to Sentiment Analysis: the systems detect the positive, negative
or neutral polarity of the text BUT
stance detection: the systems detect whether a text message is
favorable or unfavorable to a topic of discussion, usually controversial,
and which may or may not be explicitly mentioned in the text message
1Oct Referendum: heated debate
→ Legitimate referendum (favor)
→ Illegal referendum (against)
MultiStanceCat: Introduction
MultiStanceCat: Task Description
MultiStanceCat Task
Deciding whether each message is neutral, in favor or
against the target: ‘Catalan first of October
Referendum’ from a multimodal perspective
Languages: Catalan and Spanish
MultiStanceCat: Corpus
• TW-1O Referendum corpus → 11,398 tweets
#1oct 1O
#oct2017 1oct16
[20/09/2017-30/09/2017]
TW-1OReferendum Training Test
Catalan 5,853 4,684 1,169
Spanish 5,545 4,437 1,108
Total 11,398 9,121 2,277
Cosmos tool (by Autoritas) 80% 20%
TW-1OReferendum
Catalan 87,449
Spanish 132,699
Total 220,148
MultiStanceCat: Corpus
•Annotation Scheme:
MultiStance Tags
–AGAINST: Negative stance
–FAVOR: Positive stance
–NEUTRAL: Neutral stance informative/reporting tweets
stance cannot be inferred
MultiStanceCat: Corpus
Tweet: Res ni ningú, ens aturarà #Votarem #DretaDecidir #1Oct
#CatalunyaLliure #defensemlademocracia http://t.co/PgVLYH8AgN
Stance: FAVOR
'Nothing and nobody will stop us #Votarem #DretaDecidir #1Oct
#CatalunyaLliure #defensemlademocracia http://t.co/PgVLYH8AgN'
Tweet: Más q votos creo q estais usando personas jugando con sus
sentimientos SABIAIS q el #1Oct ES ILEGAL https://t.co/1SJcwn7LHd
Stance: AGAINST
'You know that more than votes you are using persons playing with their sentiments YOU KNOW that the
#1Oct IS ILLEGAL https://t.co/1SJcwn7LHd'
Tweet: Voteu! #1Oct ¿Crees que la respuesta del Estado al desafio
independentista catalán está siendo adecuada? https://t.co/LlZrkd20gh via
@20
Stance: NEUTRAL
'Vote! #1Oct Do you think that the State’s response to the Catalan pro-independence challenge is
appropriate? https://t.co/LlZrkd20gh vía @20m'
MultiStanceCat: Corpus
• Annotation procedure
– 1st
stage: Automatic annotation
List of preselected authors (0.32% of the total annotated tweets)
– 2nd stage: Manual annotation
1) 2 annotators tagged the stance in 500 Catalan tweets and in
500 Spanish tweets in parallel
2) 1st Interannotator Agreement Test (IAT)
3) annotators tagged 1,300 tweets in each language
4) 2nd IAT
•Annotation of the whole corpus individually
Annotators: 2 trained annotators + 3 seniors researchers
Meetings: once a week → problematic cases solved by common consensus
MultiStanceCat: Corpus
• Criteria:
– Writing text: emoticons, @mentions and #hashtags ✓
– Links (webpages, photographs, videos…) ✓
– Images on the authors timeline ✓
+Pragmatic information (knowledge about this topic)
MultiStanceCat: Corpus
• Interannotator Agreement Test: Results
Stance (N= 500) Text Text+Link
TW-1OReferendum-C
A
%Agreement 81.8% 86.2%
Kappa 0.63 0.76
TW-1OReferendum-E
S
%Agreement 67.3% 81.2%
Kappa 0.54 0.68
Stance (N=1,300) Text Text+Link
TW-1OReferendum-C
A
%Agreement 86.9% 89.4%
Kappa 0.73 0.82
TW-1OReferendum-E
S
%Agreement 68.1% 83.3%
Kappa 0.57 0.65
1stIAT2ndIAT
MultiStanceCat: Corpus
• Disagreements: Assignment of NEUTRAL tag unclear
Tweet: Coscubielibers! El nostre idol esta La Sexta! Parlara del Daniel?#1octL6
Stance: NEUTRAL
'Coscubielibers! Our idol is on La Sexta (TV Channel). Will he talk about
Daniel? #1octL6’
A= NEUTRAL B=AGAINST
MultiStanceCat: Corpus
• Disagreements: Irony
Tweet: Els RADIKALS abduits i antidemocratics que provoquen el TUMULTO
certament fan bastanta por... #referendumCAt #1O…https://t.co/nlEa8rkXTT
Stance: FAVOR
'These brainwashed,anti-democratic RADIKALS who caused this TUMULT
certainly generate fear...'#referendumCAt #1O…https://t.co/nlEa8rkXTT
A= FAVOR B=AGAINST
MultiStanceCat: Corpus
• Format and distribution: xml files
Training set: 80% of TW-1OReferendum
– The ID of the tweet
– The text of the tweet to be evaluated
– the contextual information: the tweet before and after the
tweet under evaluation
– the name of the image (up to 10 images) obtained from the
author's timeline.
Test set: 20% of TW-1OReferendum
– Xml files without truth values
• Distribution of stance labels
MultiStanceCat: Corpus
Stance
TW-1OReferendum-CA TW-1OReferendum-ES
Training Test Total Training Test Total
FAVOR 4,085 1,021 5,106 1,680 419 2,099
AGAINST 120 29 149 1,785 446 2,231
NEUTRAL 479 119 598 972 243 1,215
Total 4,684 1,169 5,853 4,437 1,108 5,545
StanceCat: Evaluation Metrics & baseline
• Macro-average on F-score:
– Favor, Against, Neutral
– Semeval 2016 Task-6 &
StanceCat@IberEval 2017
• Majority-class baseline
StanceCat: Participation
TEAM CATALAN SPANISH
Casacufans T
T + C
T + C + I
T
T + C
T + C + I
CriCa T
T + C
T
C
ELiRF - T
uc3m T
T + C
T
T + C
StanceCat: Approaches
TEAM MODE APPROACH
Casacufans T & C Hashing Vectorized from scikit-learn + SVM
I CNN (the authors did not send a working note)
CriCa T & C Bag-of-Words, stemming and TF-IDF + Linear SVM
ELiRF T Lowercase, remove accents and dieresis, normalized
Twitter elements:
● RUN 1: Word Embeddings + CNN
● RUN 2:Character word n-grams + Linear SVM
uc3m T & C Bag-of-Words, TF-IDF + Linear SVM
StanceCat: Stance Results
StanceCat: Features Analysis
CATALAN SPANISH
TEXT + CONTEXT + IMAGES TEXT + CONTEXT + IMAGES
22.47 29.33 29.13 21.94 26.98 27.09
StanceCat: Error Analysis
● In Catalan, more errors from Against to Favor. In Spanish, more errors
from Favor to Against
● In Catalan, errors from Favor to Against are minimal (0.08%)
StanceCat: Error Analysis (CA)
StanceCat: Error Analysis (CA)
StanceCat: Error Analysis (ES)
StanceCat: Error Analysis (ES)
StanceCat: Social Network Analysis
STANCE SEED NETWORK %
IN FAVOR 4,510 808,549 51.44%
AGAINST 1,478 393,405 25.03%
BOTH 27 214,411 13.64%
NEUTRAL 1,041 155,522 9.89%
TOTAL 7,056 1,571,887 100%
• Almost disconnected communities (13.64%)
– Independents more closed community (51.44% vs. 25.03%)
• Few neutral people (9.89%)
StanceCat: Conclusions
• Multimodal Stance Identification task:
– Only with text, text + context, text + context + images
– Catalan and Spanish
• Low participation (only one participant used images
• Challenging task (imbalanced data):
– In Catalan, most systems performed below the baseline
– In Spanish, the best performing system improved in 9% the
baseline
• The use of context:
– More than 30% in Catalan
– More than 20% in Spanish
• Echo chamber effect:
– There is a lack of interest in communicating with the other
community
StanceCat: Credits
Programa I+D: TIN2015-71147
Thank you!
francisco.rangel@autoritas.es
prosso@dsic.upv.esamarti@ub.edu
mtaule@ub.edu

More Related Content

More from Francisco Manuel Rangel Pardo

Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...
Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...
Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...Francisco Manuel Rangel Pardo
 
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling ...
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling  ...Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling  ...
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling ...Francisco Manuel Rangel Pardo
 
AL4Trust - Artificial Intelligence for Building Trust 2019
AL4Trust - Artificial Intelligence for Building Trust 2019AL4Trust - Artificial Intelligence for Building Trust 2019
AL4Trust - Artificial Intelligence for Building Trust 2019Francisco Manuel Rangel Pardo
 
Author Profiling en Social Media. En la Academia... y en la Industria.
Author Profiling en Social Media. En la Academia... y en la Industria.Author Profiling en Social Media. En la Academia... y en la Industria.
Author Profiling en Social Media. En la Academia... y en la Industria.Francisco Manuel Rangel Pardo
 
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...Francisco Manuel Rangel Pardo
 
RusProfiling Gender Identification in Russian Texts PAN@FIRE
RusProfiling Gender Identification in Russian Texts PAN@FIRERusProfiling Gender Identification in Russian Texts PAN@FIRE
RusProfiling Gender Identification in Russian Texts PAN@FIREFrancisco Manuel Rangel Pardo
 
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...Gender and Language Variety Identification in Twitter. Overview of the 5th. A...
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...Francisco Manuel Rangel Pardo
 
Overview of the 4th. Author Profiling task at PAN-CLEF 2016
Overview of the 4th. Author Profiling task at PAN-CLEF 2016Overview of the 4th. Author Profiling task at PAN-CLEF 2016
Overview of the 4th. Author Profiling task at PAN-CLEF 2016Francisco Manuel Rangel Pardo
 
AL4Trust - Artificial Intelligence for Building Trust
AL4Trust - Artificial Intelligence for Building TrustAL4Trust - Artificial Intelligence for Building Trust
AL4Trust - Artificial Intelligence for Building TrustFrancisco Manuel Rangel Pardo
 
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)Francisco Manuel Rangel Pardo
 
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...Francisco Manuel Rangel Pardo
 
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...Francisco Manuel Rangel Pardo
 
A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...Francisco Manuel Rangel Pardo
 
Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Francisco Manuel Rangel Pardo
 

More from Francisco Manuel Rangel Pardo (20)

Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...
Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...
Overview of the 8th Author Profiling task at PAN: Profiling Fake News Spreade...
 
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling ...
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling  ...Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling  ...
Overview of the 7th Author Profiling task at PAN: Bots and Gender Profiling ...
 
AL4Trust - Artificial Intelligence for Building Trust 2019
AL4Trust - Artificial Intelligence for Building Trust 2019AL4Trust - Artificial Intelligence for Building Trust 2019
AL4Trust - Artificial Intelligence for Building Trust 2019
 
Author Profiling en Social Media. En la Academia... y en la Industria.
Author Profiling en Social Media. En la Academia... y en la Industria.Author Profiling en Social Media. En la Academia... y en la Industria.
Author Profiling en Social Media. En la Academia... y en la Industria.
 
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...
Overview of the 6th Author Profiling task at PAN: Multimodal Gender Identific...
 
RusProfiling Gender Identification in Russian Texts PAN@FIRE
RusProfiling Gender Identification in Russian Texts PAN@FIRERusProfiling Gender Identification in Russian Texts PAN@FIRE
RusProfiling Gender Identification in Russian Texts PAN@FIRE
 
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...Gender and Language Variety Identification in Twitter. Overview of the 5th. A...
Gender and Language Variety Identification in Twitter. Overview of the 5th. A...
 
Overview of the 4th. Author Profiling task at PAN-CLEF 2016
Overview of the 4th. Author Profiling task at PAN-CLEF 2016Overview of the 4th. Author Profiling task at PAN-CLEF 2016
Overview of the 4th. Author Profiling task at PAN-CLEF 2016
 
Redes sociales y preadolescentes
Redes sociales y preadolescentesRedes sociales y preadolescentes
Redes sociales y preadolescentes
 
AL4Trust - Artificial Intelligence for Building Trust
AL4Trust - Artificial Intelligence for Building TrustAL4Trust - Artificial Intelligence for Building Trust
AL4Trust - Artificial Intelligence for Building Trust
 
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)
PR-SOCO Personality Recognition in SOurce COde (PAN@FIRE 2016)
 
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...
Overview of PAN'16 - New challenges for Authorship Analysis: Cross-genre prof...
 
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...
El Futuro de las Comunicaciones Personales a Través de los Dispositivos Móvil...
 
Smart Listening - MUIinf
Smart Listening - MUIinfSmart Listening - MUIinf
Smart Listening - MUIinf
 
IA + Big Data = problema + oportunidad
IA + Big Data = problema + oportunidadIA + Big Data = problema + oportunidad
IA + Big Data = problema + oportunidad
 
A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...
 
Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...
 
Author Profiling task at PAN Lab at CLEF 2015
Author Profiling task at PAN Lab at CLEF 2015Author Profiling task at PAN Lab at CLEF 2015
Author Profiling task at PAN Lab at CLEF 2015
 
EmoGraph for Age and Gender Identification
EmoGraph for Age and Gender IdentificationEmoGraph for Age and Gender Identification
EmoGraph for Age and Gender Identification
 
My Phd Student T-Shirt
My Phd Student T-ShirtMy Phd Student T-Shirt
My Phd Student T-Shirt
 

Recently uploaded

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 

Recently uploaded (20)

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 

Multimodal Stance Detection in Tweets on Catalan #1Oct Referendum @Ibereval 2018 @SEPLN 2018

  • 1. Multimodal Stance Detection in Tweets on Catalan #1Oct Referendum @Ibereval 2018 Mariona Taulé, M. Antònia Martí, Universitat de Barcelona Francisco Rangel, Autoritas Consulting & Universitat Politècnica de València Paolo Rosso, Universitat Politècnica de València
  • 2. MultiStanceCat • Introduction • MultiStanceCat: Task Description • TW-CaSe corpus • Evaluation Framework • Overview of the submitted approaches • Conclusions http://www.autoritas.net/MultiStanceCat-IberEval2018/
  • 3. → Semeval-2016 task 6: Detecting stance in tweets → English (Mohammad, S.M., et al. 2016) → IberEval-2017 StanceCat task 7→ Catalan and Spanish (Taulé et al. (2017) IberEval-2018: MultiModal Stance Detection in tweets on Catalan #1Oct Referendum task (MultiStanceCat ) To detect the authors stances with respect to the 1October Referendum (2017) in tweets written in Catalan and Spanish from a multimodal perspective Multimodality: images from author’s timeline Contextual information: tweet before and after Text of the tweet + link MultiStanceCat: Introduction
  • 4. MultiModal Stance Detection in tweets on Catalan #1Oct Referendum task (MultiStanceCat ) Task related to Sentiment Analysis: the systems detect the positive, negative or neutral polarity of the text BUT stance detection: the systems detect whether a text message is favorable or unfavorable to a topic of discussion, usually controversial, and which may or may not be explicitly mentioned in the text message 1Oct Referendum: heated debate → Legitimate referendum (favor) → Illegal referendum (against) MultiStanceCat: Introduction
  • 5. MultiStanceCat: Task Description MultiStanceCat Task Deciding whether each message is neutral, in favor or against the target: ‘Catalan first of October Referendum’ from a multimodal perspective Languages: Catalan and Spanish
  • 6. MultiStanceCat: Corpus • TW-1O Referendum corpus → 11,398 tweets #1oct 1O #oct2017 1oct16 [20/09/2017-30/09/2017] TW-1OReferendum Training Test Catalan 5,853 4,684 1,169 Spanish 5,545 4,437 1,108 Total 11,398 9,121 2,277 Cosmos tool (by Autoritas) 80% 20% TW-1OReferendum Catalan 87,449 Spanish 132,699 Total 220,148
  • 7. MultiStanceCat: Corpus •Annotation Scheme: MultiStance Tags –AGAINST: Negative stance –FAVOR: Positive stance –NEUTRAL: Neutral stance informative/reporting tweets stance cannot be inferred
  • 8. MultiStanceCat: Corpus Tweet: Res ni ningú, ens aturarà #Votarem #DretaDecidir #1Oct #CatalunyaLliure #defensemlademocracia http://t.co/PgVLYH8AgN Stance: FAVOR 'Nothing and nobody will stop us #Votarem #DretaDecidir #1Oct #CatalunyaLliure #defensemlademocracia http://t.co/PgVLYH8AgN' Tweet: Más q votos creo q estais usando personas jugando con sus sentimientos SABIAIS q el #1Oct ES ILEGAL https://t.co/1SJcwn7LHd Stance: AGAINST 'You know that more than votes you are using persons playing with their sentiments YOU KNOW that the #1Oct IS ILLEGAL https://t.co/1SJcwn7LHd' Tweet: Voteu! #1Oct ¿Crees que la respuesta del Estado al desafio independentista catalán está siendo adecuada? https://t.co/LlZrkd20gh via @20 Stance: NEUTRAL 'Vote! #1Oct Do you think that the State’s response to the Catalan pro-independence challenge is appropriate? https://t.co/LlZrkd20gh vía @20m'
  • 9. MultiStanceCat: Corpus • Annotation procedure – 1st stage: Automatic annotation List of preselected authors (0.32% of the total annotated tweets) – 2nd stage: Manual annotation 1) 2 annotators tagged the stance in 500 Catalan tweets and in 500 Spanish tweets in parallel 2) 1st Interannotator Agreement Test (IAT) 3) annotators tagged 1,300 tweets in each language 4) 2nd IAT •Annotation of the whole corpus individually Annotators: 2 trained annotators + 3 seniors researchers Meetings: once a week → problematic cases solved by common consensus
  • 10. MultiStanceCat: Corpus • Criteria: – Writing text: emoticons, @mentions and #hashtags ✓ – Links (webpages, photographs, videos…) ✓ – Images on the authors timeline ✓ +Pragmatic information (knowledge about this topic)
  • 11. MultiStanceCat: Corpus • Interannotator Agreement Test: Results Stance (N= 500) Text Text+Link TW-1OReferendum-C A %Agreement 81.8% 86.2% Kappa 0.63 0.76 TW-1OReferendum-E S %Agreement 67.3% 81.2% Kappa 0.54 0.68 Stance (N=1,300) Text Text+Link TW-1OReferendum-C A %Agreement 86.9% 89.4% Kappa 0.73 0.82 TW-1OReferendum-E S %Agreement 68.1% 83.3% Kappa 0.57 0.65 1stIAT2ndIAT
  • 12. MultiStanceCat: Corpus • Disagreements: Assignment of NEUTRAL tag unclear Tweet: Coscubielibers! El nostre idol esta La Sexta! Parlara del Daniel?#1octL6 Stance: NEUTRAL 'Coscubielibers! Our idol is on La Sexta (TV Channel). Will he talk about Daniel? #1octL6’ A= NEUTRAL B=AGAINST
  • 13. MultiStanceCat: Corpus • Disagreements: Irony Tweet: Els RADIKALS abduits i antidemocratics que provoquen el TUMULTO certament fan bastanta por... #referendumCAt #1O…https://t.co/nlEa8rkXTT Stance: FAVOR 'These brainwashed,anti-democratic RADIKALS who caused this TUMULT certainly generate fear...'#referendumCAt #1O…https://t.co/nlEa8rkXTT A= FAVOR B=AGAINST
  • 14. MultiStanceCat: Corpus • Format and distribution: xml files Training set: 80% of TW-1OReferendum – The ID of the tweet – The text of the tweet to be evaluated – the contextual information: the tweet before and after the tweet under evaluation – the name of the image (up to 10 images) obtained from the author's timeline. Test set: 20% of TW-1OReferendum – Xml files without truth values
  • 15. • Distribution of stance labels MultiStanceCat: Corpus Stance TW-1OReferendum-CA TW-1OReferendum-ES Training Test Total Training Test Total FAVOR 4,085 1,021 5,106 1,680 419 2,099 AGAINST 120 29 149 1,785 446 2,231 NEUTRAL 479 119 598 972 243 1,215 Total 4,684 1,169 5,853 4,437 1,108 5,545
  • 16. StanceCat: Evaluation Metrics & baseline • Macro-average on F-score: – Favor, Against, Neutral – Semeval 2016 Task-6 & StanceCat@IberEval 2017 • Majority-class baseline
  • 17. StanceCat: Participation TEAM CATALAN SPANISH Casacufans T T + C T + C + I T T + C T + C + I CriCa T T + C T C ELiRF - T uc3m T T + C T T + C
  • 18. StanceCat: Approaches TEAM MODE APPROACH Casacufans T & C Hashing Vectorized from scikit-learn + SVM I CNN (the authors did not send a working note) CriCa T & C Bag-of-Words, stemming and TF-IDF + Linear SVM ELiRF T Lowercase, remove accents and dieresis, normalized Twitter elements: ● RUN 1: Word Embeddings + CNN ● RUN 2:Character word n-grams + Linear SVM uc3m T & C Bag-of-Words, TF-IDF + Linear SVM
  • 20. StanceCat: Features Analysis CATALAN SPANISH TEXT + CONTEXT + IMAGES TEXT + CONTEXT + IMAGES 22.47 29.33 29.13 21.94 26.98 27.09
  • 21. StanceCat: Error Analysis ● In Catalan, more errors from Against to Favor. In Spanish, more errors from Favor to Against ● In Catalan, errors from Favor to Against are minimal (0.08%)
  • 26. StanceCat: Social Network Analysis STANCE SEED NETWORK % IN FAVOR 4,510 808,549 51.44% AGAINST 1,478 393,405 25.03% BOTH 27 214,411 13.64% NEUTRAL 1,041 155,522 9.89% TOTAL 7,056 1,571,887 100% • Almost disconnected communities (13.64%) – Independents more closed community (51.44% vs. 25.03%) • Few neutral people (9.89%)
  • 27. StanceCat: Conclusions • Multimodal Stance Identification task: – Only with text, text + context, text + context + images – Catalan and Spanish • Low participation (only one participant used images • Challenging task (imbalanced data): – In Catalan, most systems performed below the baseline – In Spanish, the best performing system improved in 9% the baseline • The use of context: – More than 30% in Catalan – More than 20% in Spanish • Echo chamber effect: – There is a lack of interest in communicating with the other community