SlideShare una empresa de Scribd logo
1 de 12
Social Networking: Visualizing Twitter
TEAM BIRCH: Chris, Ruth, Nut, Aminu and Anil
Overview
1. Introduction
2. Background to Twitter and Boston Bombings
3. Big and Dirty Data Issues
4. Process: Capturing the integrated learning process
5. 5 W’ of Twitter Analytics
6. DEMO ‘Visualisation’
7. Further Work
8. Learning Outcomes
Who We Are
• Aminu
• Anil
• Chris
• Nut
• Ruth
Our Data
• Twitter Data from 16:00 to 19:00
RE: Boston Marathon (Bombing)
• Approx 550,000 tweets covering the 3 hour Period
• Challenges
– Data format
– Lack of information
– UserIDs vs. UserNames
Big and Dirty Data Issues
1. Each tweet should have a record of its own! (Lines)
2. Formatting Issues
3. No standardisation (only ~10% tweets geo-location)
4. Only 5 fields > had to create three more
5. Different languages
6. Information overload – many different patterns identified
therefore difficult to focus on a particular visualisation.
Overview of Process
Python
Script
Harvests
Tweets using
the Twitter
API
MapReduce
code
processes
tweets
Acquire Parse/Filter/Mine
Create
Visualisation
in Tableau
Public and
Google
Fusion
Write out
Text Files
relevant to
the analytics
Display in
Web Portal
on Users
Screen
Represent Interact
Map Reduce
MapReduce code processes tweets
• Parse
• Added information where possible – retweet/hashtag/touser
• Filter
• Remove Records with invalid fields
• Split into Geocoded, non- Geocoded
• Mine
• Word Counts
• Hashtag Counts – all and split by location / original vs. retweet
• Sentiment Extraction
Acquire Parse/Filter/Mine Represent Interact
Visualisation Tools Used
Created a Real-time Twitter Analytics Portal with
• Tableau Public
• Google Fusion
• Wix Web Portal
• Purpose:
– Insight
– Exploratory
– Confirmation
Twitter Analytics
• 5 W’s of Social Media!
– Who
– What
– Where
– When
– Why
DEMO
Future Work
• Gain an holistic view of the story over time
– Bombing – 15th April
– Shooting – 18th April
– Fire fight & Manhunt – 19th April
• Reflect the story as it evolved
– Clustering
– NLP (to move from basic to advanced analytics)
– Explore more visualisation types
Thank you for Listening!
TEAM BIRCH: Chris, Ruth, Nut, Aminu and Anil

Más contenido relacionado

Similar a Social Networking: Visualizing Twitter

New Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataNew Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataAxel Bruns
 
Web Analytics: A new Statistical Domain
Web Analytics: A new Statistical DomainWeb Analytics: A new Statistical Domain
Web Analytics: A new Statistical DomainPaul Askew
 
Content strategy in social media platforms
Content strategy in social media platformsContent strategy in social media platforms
Content strategy in social media platformsHossein sharafi
 
Twitter: A Hands-On Learning Session for Researcher
Twitter: A Hands-On Learning Session for ResearcherTwitter: A Hands-On Learning Session for Researcher
Twitter: A Hands-On Learning Session for ResearcherKMb Unit, York University
 
ThingSpeak: apps for social things
ThingSpeak: apps for social thingsThingSpeak: apps for social things
ThingSpeak: apps for social thingsHans Scharler
 
Online data sources and information exposure
Online data sources and information exposureOnline data sources and information exposure
Online data sources and information exposureUniversity of Southampton
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysisikanow
 
Professional Social Media
Professional Social Media Professional Social Media
Professional Social Media Danielle Brigida
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & AnalysisScott Sanders
 
Creating a social media presence
Creating a social media presenceCreating a social media presence
Creating a social media presencePhil Bradley
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisOpen Analytics
 
Digital First Thinking and Working
Digital First Thinking and WorkingDigital First Thinking and Working
Digital First Thinking and WorkingSteve Buttry
 
Introduction to information visualisation for humanities PhDs
Introduction to information visualisation for humanities PhDsIntroduction to information visualisation for humanities PhDs
Introduction to information visualisation for humanities PhDsMia
 
#AMC2013 Participatory Social Impact Research
#AMC2013 Participatory Social Impact Research#AMC2013 Participatory Social Impact Research
#AMC2013 Participatory Social Impact ResearchGeorgia Bullen
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Marco Brambilla
 
Rob Procter
Rob ProcterRob Procter
Rob ProcterNSMNSS
 

Similar a Social Networking: Visualizing Twitter (20)

New Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter DataNew Methodologies for Capturing and Working with Publicly Available Twitter Data
New Methodologies for Capturing and Working with Publicly Available Twitter Data
 
Web Analytics: A new Statistical Domain
Web Analytics: A new Statistical DomainWeb Analytics: A new Statistical Domain
Web Analytics: A new Statistical Domain
 
Collecting Twitter Data
Collecting Twitter DataCollecting Twitter Data
Collecting Twitter Data
 
Content strategy in social media platforms
Content strategy in social media platformsContent strategy in social media platforms
Content strategy in social media platforms
 
Social Media Dataset
Social Media DatasetSocial Media Dataset
Social Media Dataset
 
Twitter: A Hands-On Learning Session for Researcher
Twitter: A Hands-On Learning Session for ResearcherTwitter: A Hands-On Learning Session for Researcher
Twitter: A Hands-On Learning Session for Researcher
 
ThingSpeak: apps for social things
ThingSpeak: apps for social thingsThingSpeak: apps for social things
ThingSpeak: apps for social things
 
Online data sources and information exposure
Online data sources and information exposureOnline data sources and information exposure
Online data sources and information exposure
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Professional Social Media
Professional Social Media Professional Social Media
Professional Social Media
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
Creating a social media presence
Creating a social media presenceCreating a social media presence
Creating a social media presence
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Digital First Thinking and Working
Digital First Thinking and WorkingDigital First Thinking and Working
Digital First Thinking and Working
 
Introduction to information visualisation for humanities PhDs
Introduction to information visualisation for humanities PhDsIntroduction to information visualisation for humanities PhDs
Introduction to information visualisation for humanities PhDs
 
2014_WWW_BTOR
2014_WWW_BTOR2014_WWW_BTOR
2014_WWW_BTOR
 
#AMC2013 Participatory Social Impact Research
#AMC2013 Participatory Social Impact Research#AMC2013 Participatory Social Impact Research
#AMC2013 Participatory Social Impact Research
 
DECEPTICONv2
DECEPTICONv2DECEPTICONv2
DECEPTICONv2
 
Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...Myths and challenges in knowledge extraction and analysis from human-generate...
Myths and challenges in knowledge extraction and analysis from human-generate...
 
Rob Procter
Rob ProcterRob Procter
Rob Procter
 

Último

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 

Último (20)

[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 

Social Networking: Visualizing Twitter

  • 1. Social Networking: Visualizing Twitter TEAM BIRCH: Chris, Ruth, Nut, Aminu and Anil
  • 2. Overview 1. Introduction 2. Background to Twitter and Boston Bombings 3. Big and Dirty Data Issues 4. Process: Capturing the integrated learning process 5. 5 W’ of Twitter Analytics 6. DEMO ‘Visualisation’ 7. Further Work 8. Learning Outcomes
  • 3. Who We Are • Aminu • Anil • Chris • Nut • Ruth
  • 4. Our Data • Twitter Data from 16:00 to 19:00 RE: Boston Marathon (Bombing) • Approx 550,000 tweets covering the 3 hour Period • Challenges – Data format – Lack of information – UserIDs vs. UserNames
  • 5. Big and Dirty Data Issues 1. Each tweet should have a record of its own! (Lines) 2. Formatting Issues 3. No standardisation (only ~10% tweets geo-location) 4. Only 5 fields > had to create three more 5. Different languages 6. Information overload – many different patterns identified therefore difficult to focus on a particular visualisation.
  • 6. Overview of Process Python Script Harvests Tweets using the Twitter API MapReduce code processes tweets Acquire Parse/Filter/Mine Create Visualisation in Tableau Public and Google Fusion Write out Text Files relevant to the analytics Display in Web Portal on Users Screen Represent Interact
  • 7. Map Reduce MapReduce code processes tweets • Parse • Added information where possible – retweet/hashtag/touser • Filter • Remove Records with invalid fields • Split into Geocoded, non- Geocoded • Mine • Word Counts • Hashtag Counts – all and split by location / original vs. retweet • Sentiment Extraction Acquire Parse/Filter/Mine Represent Interact
  • 8. Visualisation Tools Used Created a Real-time Twitter Analytics Portal with • Tableau Public • Google Fusion • Wix Web Portal • Purpose: – Insight – Exploratory – Confirmation
  • 9. Twitter Analytics • 5 W’s of Social Media! – Who – What – Where – When – Why
  • 10. DEMO
  • 11. Future Work • Gain an holistic view of the story over time – Bombing – 15th April – Shooting – 18th April – Fire fight & Manhunt – 19th April • Reflect the story as it evolved – Clustering – NLP (to move from basic to advanced analytics) – Explore more visualisation types
  • 12. Thank you for Listening! TEAM BIRCH: Chris, Ruth, Nut, Aminu and Anil