SlideShare una empresa de Scribd logo
1 de 15
Tweet4act: Using Incident-Specific Profiles for
Classifying Crisis-Related Messages
Soudip Roy Chowdhury, Muhammad Imran, Muhammad Rizwan Asghar,
Sihem Amer-Yahia, Carlos Castillo
Disaster & Social Media
Disaster Strikes, Social Media Responds
Virtual Collaboration, Information Sharing
• Valuable information
• Contribute to situational awareness
• Highly useful, if analyzed timely and
effectively(Starbird et al., 2010; Latonero and Shklovski, 2010)
Social Media Response to Disaster Phases
Before
During
After
Disaster Management, Crisis Informatics
- Caution, warnings
- Alerts etc.
- Damage
- Causalities etc.
- Request for help
- Donations etc.
• The main goals of our research:
1. Identify messages related to an incident.
2. Classify incident-messages with the corresponding period
(PRE, DURING, POST).
Datasets & Examples
1. Joplin Tornado on May 22, 2011
2. Nesat Typhoon in Philipines on Sep 27, 2011
3. Haiti Earthquake on Jan 12, 2010
• [PRE] New #tropical storm forms in the West #Pacific. #Nesat may hit the #Philippines & #China as a #typhoon
next week
• [DURING] @Yahoo News: Powerful #typhoon with winds up to 106 mph makes landfall in #Philippines as
100,000 odered to fless homes
• [POST] News5 Action center is now accepting donations for the victims of Typhoon “pedring. Drop boxes are
located @ TV5 Office :)
Tweet4Act System
• Collection -> Filtering -> Period Classification
1. Filtering Process
• Normalization: remove “RT @username” and “@username”
prefixes and remove duplicate messages
• Apply the k-mediod method with the manhattan distance
between medoids and messages in each cluster
• Discard all cluster having a negative number or zero as
silhouette coefficient
• Select from each cluster the fraction m messages closer to
the mediod
Filtering Process Validation
• Using CrowdFlower crowdsourcing platform
2. Dictionary Based Period Classification
• Most frequent words across datasets
• “warning” & “alert” typically found in the Pre
• “now”, “sweeps” etc. typically found in During
• “aftermath”, “donate” etc. typically found in Post
3. NLP-Based Period Classification
• Tense of verbs can help identify period. (A. Iyengar et al., 2011)
POS tagging
1. Dictionary based verbs get +1 (ignore below)
2. Aux verbs get +1(e.g., could-PRE, are-DURING, did-POST)
3. If a main verb in future/present/past tense, add +0.5 to
pre/during/post period, respectively.
Ties: PRE > DURING > POST
Manual Period Classification
• CrowdFlower crowdsourcing period labeling
Performance of Tweet4Act
Period Tweet4act SVM MaxEnt Tree RF
P R F1 P R F1 P R F1 P R F1 P R F1
Joplin Tornado
PRE 0.33 0.85 0.48 0.00 0.00 0.00 0.43 0.21 0.28 0 0 0 0 0 0
DURIN
G
0.88 0.89 0.89 0.32 0.91 0.47 0.35 0.55 0.43 0.3 0.73 0.43 0.32 0.1 0.48
POST 0.61 0.84 0.71 0.67 0.20 0.31 0.55 0.6 0.57 0.57 0.4 0.47 1.00 0.1 0.18
AVG. 0.61 0.86 0.69 0.33 0.37 0.39 0.44 0.45 0.42 0.29 0.38 0.45 0.66 0.37 0.33
Haiti Earthquake
PRE 0.63 1.00 0.77 1.00 0.67 0.80 1.00 1.00 1.00 1 0.67 0.8 1.00 0.33 0.5
DURIN
G
0.72 0.97 0.83 0.75 0.6 0.67 0.67 0.80 0.73 0.6 0.6 0.6 1.00 0.4 0.57
POST 0.46 0.82 0.59 0.92 0.97 0.94 0.97 0.95 0.96 0.92 0.95 0.93 0.88 1.00 0.94
AVG. 0.60 0.87 0.71 0.89 0.74 0.80 0.88 0.91 0.89 0.84 0.74 0.78 0.96 0.58 0.67
Nesat Typhoon
PRE 0.36 1.00 0.53 1.00 0.50 0.67 1.00 0.50 0.67 0.33 0.25 0.28 1.00 0.5 0.67
DURIN
G
0.94 0.94 0.94 0.79 1.00 0.88 0.81 1.00 0.90 0.71 0.77 0.74 0.79 1 0.88
POST 0.52 0.85 0.65 1.00 0.2 0.33 1.00 0.40 0.57 0 0 0 1.00 0.2 0.33
AVG. 0.61 0.93 0.71 0.93 0.57 0.62 0.94 0.63 0.71 0.35 0.34 0.51 0.93 0.57 0.63
PRE
PRE
PRE
DURING
DURING
DURING
POST
POST
POST
AVG
AVG
AVG
References
• A. Iyengar, T. Finin, and A. Joshi (2011) Content-based prediction of
temporal boundaries for events in Twitter. In Proceedings of the Third IEEE
International Conference on Social Computing.
• K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red:
what hazards threat reveals about the social life of microblogged
information. In Proceedings of the 2010 ACM conference on Computer
supported cooperative work, pages 241–250. ACM.
• Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and
Service”: Emergency Management & Social Media Evangelism.”
Proceedings of the 7th International ISCRAM Conference– Seattle. Vol. 1.
2010.
Thank you!
Muhammad Imran
mimran@qf.org.qa

Más contenido relacionado

Más de Muhammad Imran

A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...Muhammad Imran
 
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioSummarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioMuhammad Imran
 
The Role of Social Media and Artificial Intelligence for Disaster Response
The Role of Social Media and Artificial Intelligence for Disaster ResponseThe Role of Social Media and Artificial Intelligence for Disaster Response
The Role of Social Media and Artificial Intelligence for Disaster ResponseMuhammad Imran
 
Introduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster ResponseIntroduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster ResponseMuhammad Imran
 
Artificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseArtificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseMuhammad Imran
 
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...Muhammad Imran
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific MashupsMuhammad Imran
 
Reseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOReseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOMuhammad Imran
 
ResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformMuhammad Imran
 

Más de Muhammad Imran (9)

A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
A Robust Framework for Classifying Evolving Document Streams in an Expert-Mac...
 
Summarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis ScenarioSummarizing Situational Tweets in Crisis Scenario
Summarizing Situational Tweets in Crisis Scenario
 
The Role of Social Media and Artificial Intelligence for Disaster Response
The Role of Social Media and Artificial Intelligence for Disaster ResponseThe Role of Social Media and Artificial Intelligence for Disaster Response
The Role of Social Media and Artificial Intelligence for Disaster Response
 
Introduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster ResponseIntroduction to Machine Learning: An Application to Disaster Response
Introduction to Machine Learning: An Application to Disaster Response
 
Artificial Intelligence for Disaster Response
Artificial Intelligence for Disaster ResponseArtificial Intelligence for Disaster Response
Artificial Intelligence for Disaster Response
 
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Di...
 
Domain Specific Mashups
Domain Specific MashupsDomain Specific Mashups
Domain Specific Mashups
 
Reseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECOReseval Mashup Platform Talk at SECO
Reseval Mashup Platform Talk at SECO
 
ResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platformResEval: Resource-oriented Research Impact Evaluation platform
ResEval: Resource-oriented Research Impact Evaluation platform
 

Último

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 

Último (20)

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 

Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

  • 1. Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages Soudip Roy Chowdhury, Muhammad Imran, Muhammad Rizwan Asghar, Sihem Amer-Yahia, Carlos Castillo
  • 2. Disaster & Social Media Disaster Strikes, Social Media Responds
  • 3. Virtual Collaboration, Information Sharing • Valuable information • Contribute to situational awareness • Highly useful, if analyzed timely and effectively(Starbird et al., 2010; Latonero and Shklovski, 2010)
  • 4. Social Media Response to Disaster Phases Before During After
  • 5. Disaster Management, Crisis Informatics - Caution, warnings - Alerts etc. - Damage - Causalities etc. - Request for help - Donations etc. • The main goals of our research: 1. Identify messages related to an incident. 2. Classify incident-messages with the corresponding period (PRE, DURING, POST).
  • 6. Datasets & Examples 1. Joplin Tornado on May 22, 2011 2. Nesat Typhoon in Philipines on Sep 27, 2011 3. Haiti Earthquake on Jan 12, 2010 • [PRE] New #tropical storm forms in the West #Pacific. #Nesat may hit the #Philippines & #China as a #typhoon next week • [DURING] @Yahoo News: Powerful #typhoon with winds up to 106 mph makes landfall in #Philippines as 100,000 odered to fless homes • [POST] News5 Action center is now accepting donations for the victims of Typhoon “pedring. Drop boxes are located @ TV5 Office :)
  • 7. Tweet4Act System • Collection -> Filtering -> Period Classification
  • 8. 1. Filtering Process • Normalization: remove “RT @username” and “@username” prefixes and remove duplicate messages • Apply the k-mediod method with the manhattan distance between medoids and messages in each cluster • Discard all cluster having a negative number or zero as silhouette coefficient • Select from each cluster the fraction m messages closer to the mediod
  • 9. Filtering Process Validation • Using CrowdFlower crowdsourcing platform
  • 10. 2. Dictionary Based Period Classification • Most frequent words across datasets • “warning” & “alert” typically found in the Pre • “now”, “sweeps” etc. typically found in During • “aftermath”, “donate” etc. typically found in Post
  • 11. 3. NLP-Based Period Classification • Tense of verbs can help identify period. (A. Iyengar et al., 2011) POS tagging 1. Dictionary based verbs get +1 (ignore below) 2. Aux verbs get +1(e.g., could-PRE, are-DURING, did-POST) 3. If a main verb in future/present/past tense, add +0.5 to pre/during/post period, respectively. Ties: PRE > DURING > POST
  • 12. Manual Period Classification • CrowdFlower crowdsourcing period labeling
  • 13. Performance of Tweet4Act Period Tweet4act SVM MaxEnt Tree RF P R F1 P R F1 P R F1 P R F1 P R F1 Joplin Tornado PRE 0.33 0.85 0.48 0.00 0.00 0.00 0.43 0.21 0.28 0 0 0 0 0 0 DURIN G 0.88 0.89 0.89 0.32 0.91 0.47 0.35 0.55 0.43 0.3 0.73 0.43 0.32 0.1 0.48 POST 0.61 0.84 0.71 0.67 0.20 0.31 0.55 0.6 0.57 0.57 0.4 0.47 1.00 0.1 0.18 AVG. 0.61 0.86 0.69 0.33 0.37 0.39 0.44 0.45 0.42 0.29 0.38 0.45 0.66 0.37 0.33 Haiti Earthquake PRE 0.63 1.00 0.77 1.00 0.67 0.80 1.00 1.00 1.00 1 0.67 0.8 1.00 0.33 0.5 DURIN G 0.72 0.97 0.83 0.75 0.6 0.67 0.67 0.80 0.73 0.6 0.6 0.6 1.00 0.4 0.57 POST 0.46 0.82 0.59 0.92 0.97 0.94 0.97 0.95 0.96 0.92 0.95 0.93 0.88 1.00 0.94 AVG. 0.60 0.87 0.71 0.89 0.74 0.80 0.88 0.91 0.89 0.84 0.74 0.78 0.96 0.58 0.67 Nesat Typhoon PRE 0.36 1.00 0.53 1.00 0.50 0.67 1.00 0.50 0.67 0.33 0.25 0.28 1.00 0.5 0.67 DURIN G 0.94 0.94 0.94 0.79 1.00 0.88 0.81 1.00 0.90 0.71 0.77 0.74 0.79 1 0.88 POST 0.52 0.85 0.65 1.00 0.2 0.33 1.00 0.40 0.57 0 0 0 1.00 0.2 0.33 AVG. 0.61 0.93 0.71 0.93 0.57 0.62 0.94 0.63 0.71 0.35 0.34 0.51 0.93 0.57 0.63 PRE PRE PRE DURING DURING DURING POST POST POST AVG AVG AVG
  • 14. References • A. Iyengar, T. Finin, and A. Joshi (2011) Content-based prediction of temporal boundaries for events in Twitter. In Proceedings of the Third IEEE International Conference on Social Computing. • K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red: what hazards threat reveals about the social life of microblogged information. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, pages 241–250. ACM. • Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and Service”: Emergency Management & Social Media Evangelism.” Proceedings of the 7th International ISCRAM Conference– Seattle. Vol. 1. 2010.

Notas del editor

  1. Social media empowers individuals, providing them a platform from which to share opinions, experiences and information from anywhere at any time.