SlideShare una empresa de Scribd logo
1 de 52
Descargar para leer sin conexión
Leveraging Engagement:
Big Data Lesson
Agenda
What is Big Data?
•  Some Definitions
•  Mixed Methods Approach
Champion’s League & World Cup Case Study
•  Process
•  Results and Usage
•  Pitfalls and Learnings
Moving Forward
•  Data Approach Flow
•  Caveats
•  Organization and Communication
What is Big Data?
So many different definitions… nobody quite
agrees….
… except that it’s definitely a buzzword
What is Big Data?
It is just generally agreed upon that it’s messy and complex. This
is an opportunity and challenge for us to innovate.
“an all-encompassing term for any collection of data sets so large and complex
that it becomes difficult to process using on-hand data management tools or
traditional data processing applications.”
“Big data is a buzzword, or catch-phrase, used to describe a massive
volume of both structured and unstructured data that is
so large that it's difficult to process using traditional database and
software techniques. In most enterprise scenarios the data is too big or it moves
too fast or it exceeds current processing capacity. Big data has the
potential to help companies improve operations and
make faster, more intelligent decisions.”
“Volume, Variety, Velocity, Variability, Complexity”
Quotes	
  from:	
  	
  
h-p://www.forbes.com/sites/gilpress/2014/09/03/12-­‐big-­‐data-­‐definiBons-­‐whats-­‐yours/2/	
  
h-p://www.webopedia.com/TERM/B/big_data.html	
  
h-p://en.wikipedia.org/wiki/Big_data	
  
What We Do Need to Solve Big
Data?
… for leveraging engagement at least.
…	
  for	
  leveraging	
  engagement	
  at	
  least.	
  
Determine	
  Right	
  
QuesBons	
  and	
  
Goals	
  for	
  Data	
  
Interdisciplinary	
  
Approach	
  
IteraBve	
  
Refinement	
  
“Combining the what (quantitative) with the why (qualitative) can
be exponentially powerful.  It is also critical to our ability to take all our
clickstream data and truly analyze it, to find insights that drive
meaningful website changes that will improve our customers’
experiences.” – Avinash Kaushik
Answer:
Mixed Methods and Innovation
Quote	
  from:	
  	
  Web	
  AnalyBcs	
  in	
  One	
  Hour	
  a	
  Day	
  by	
  Avinash	
  Kaukshik	
  
CHAMPIONS LEAGUE AND
WORLD CUP BIG DATA
DISCOVERY PROCESS
Annenberg Lab Framework
GOALS
Sports Fan and Engagement
Study Overall Goals for HAVAS
•  to identify and define communities of sports fans
based around passion points(A)
•  to analyze fan interactions with those passions
(B)
•  position HAVAS Sports & Entertainment to more
effectively advise brands on how to meaningfully
engage with sports fans by leveraging passion-
based communities.
(C)
Big Data Research Objectives
•  Discover a mixed
methodology
framework for sports
and entertainment fan
engagement
External
for
Havas
•  Justify our fan logic
topology in relation to
Twitter conversations
through natural
language processing
Internal
for Lab
Initial Data Collection Steps
1) Modify data collection process to fit live
soccer events using Champion’s league as
a test run
2) Establish methodology in seeding initial
pool of users, keywords, and hashtags
3) Analyze tweets and how they fit into
logics of engagement
4) Establish methodology in how to gain
insight from twitter conversations
“Analyzing Big Data is a BIG JOB
with Many People” – Jake
Inputs & Equipment
Keywords,
hashtags, user
clusters file on
txt document
Dedicated
server system
colllecting
information
Engineering
Run and modify
Python script
Register Public
Screening API
Parse for
results
Live Viewing Team
Team to watch game and look for patterns
Data Collection Process
Engineering &
Team: Tech and
Data Set-Up
Engineer: Run
Script with Seed
File
Team: Watch Event
for Patterns and
Additional Seeds
Team: Decide Data
to Analyze
Engineer: Parse
Data into User-
Friendly Format
Team: Look at Data
and prepare for
next event
DATA SEED METHODOLOGY
Initial Keyword Seed Scoping
Keep it simple
Discover through
observations
Soccer Hashtags and Keywords
Official
Hashtags
Sponsors
Team
Names
Key
Terms
Key
Players
Headliners
Official Organization Handles
Official Team Handles
Official	
  
Hashtags	
  
Sponsors	
  
Team	
  
Names	
  
Key	
  Terms	
  
Key	
  
Players	
  
Sponsors
Sponsors will often have official hashtags promoted during
sporting events to cross-promote their brand and the sporting
event.
Official	
  
Hashtags	
  
Sponsors	
  
Team	
  
Names	
  
Key	
  
Terms	
  
Key	
  
Players	
  
Supporting Characters
Superfans
-Fans with
unusual
followings on
Twitter
Sports
Commentators
-ESPN
commentators
and the like
Prominent
Bloggers
-Blogs or
bloggers with
large following
on certain
teams
Initial Data Seed Scoping Caveats
• Twitter caps at couple of
thousand tweets per
second on Public API
• Public API received tweets
do not appear to be
affected by location based
factors the way individual
user feeds are
• Twitter chunks these
tweets in mysterious
algorithm it deems
important
• Number of Tweets
scrapped render these
factors nominal in terms of
large-scale user behavior
ENGAGEMENT HYPOTHESIS &
ANALYSIS
What kind of Tweets or tone in
tweets fit into logics of
engagement?
*Informed by survey and ethnography
Entertainment Immersion
Social
Connection
Identification
Mastery Pride Play Advocacy
Operational Process
Plan for World Cup & Modeling with Beacon Capabilities
See how conservations analyzed from a big data perspective fit and build on the
logics of engagement model
Determine what data frameworks worked in capturing useful information
Initial qualitative look at data
Exercise: Seed Scoping
Questions on Approach Before
We Get Into Analysis?
Big Data
Analysis Process to Dashboard
Big Data
Basic Methods of Analysis
• Text processing of tweets and plotting using algorithms into
agglomerative clusters (aka cool visuals)
• Frequency of terms, associations, and word clouds fall under
here
• Goal: Find texts of what spurred the most conversation
Textual
• A way to visually see social connection data
• Understand forms of bonds and the connections between
individual data points worth exploring
• Goal: Detecting communities (our clusters, brands)
Networks
• Toolkits (such as Hootsuite) that measure “sentiment” using
positive and negative language
• Can be used to see if an initiative performed well
• Goal: Measure success of a campaign at different times
Sentiment
Big Data
Low-Hanging Fruit - Topline
Rt Author Screenname
FIFAWorldCup 76172
9GAG 37459
DFB_Team_EN 21247
BBCSport 19564
FCBayern 14782
FTBpro 13409
_Snape_ 11371
benparr 10616
TheTweetOfGod 9435
espn 7465
Queen_UK 7174
thereaIbanksy 7113
sulsultm3 6646
damnitstrue 6603
asshaaban 6513
SportsCenter 6470
fifaworldcup_es 6365
LicDice_ 6361
FIFAworldcup_e 6241
DFB_Team 6114
Argentina 5964
Big Data
Low-Hanging Fruit – Sentiment Analysis
Fan
Handles1 Game
Data2 Brand
Data3 Integrate insights
with Ethnographic
and Survey Data for
final deliverables
Initial Idealized Approach
•  Survey Twitter Handles
–  See if their online behavior matches survey logics
–  What does the content they’re sharing look like
–  Trends by cluster, gender, other data points
•  Match Data
–  Look for clusters of behavior to events in games
–  See popularity of brand campaigns and behavioral response to brand stories
–  Gain insight from bursts of activity and real-time marketing
–  See what are characteristics of influencers
•  Brand Data
–  Identify how these strategies were executed in online conversations and responses
–  Identify types of interactions/content/other markers around brands on Twitter
–  Do influential brands mean consistent users interacting across brands? Why are people
interacting in this way? How can we categorize these interactions according to our logic
clusters?
–  Was the content agile?
–  See how users responded by the logics to different types of content
–  Look for differences in fan response and fan-initiated behavior to the brands
Questions and Hypothesis
What We Planned To Do
•  Steps
•  Define interesting WC fan moments and brand moments
•  Examine moments in time and certain brand campaigns
•  Investigate possible Natural Language Processing tools
•  Formulated Questions
•  Timeline
•  Created a timeline assigning roles to each person
•  Deliverables
•  TBD, likely looking at clusters of behavior around brand campaigns.
•  Sentiment analysis may tie in here
Ethnographic
Report
-What did people
say about the
brand or the
logics they used?
Survey Data
-Under this brand
logic utilized,
what is the
intensity and who
are the clusters?
Big Data
-How did
audiences
respond online to
actions by the
brand?
Approaching with Mixed
Methods
Exercise: Group Datasets
Figure out what insight you might be able
to get from each piece of data and how
would you apply mixed methods.
Dashboard Process
The Future of Social Media
Analytics
“We will be moving beyond key-word based
queries into machine-learning algorithms.
Influencers whom I have with with echo
similar ideas about the increasing use and
refine of latent semantic indexing (or some
variant of it) and other machine-learning
algorithms in order to improve social
listening, automatic categorization of
content, and the ability to take action on
data” - Marshall Sponder
Key
Learnings
for Mood
Board
Ethnography
Survey Twitter Data
Brazil
Brought Together All Data
Concept Creation
The Dashboard Build Process
Pulled 250
Retweeted
Tweets with
Verification
from
BigSheets
Coded
Tweets
According to
Logic for
Testing Data
Built
Dictionary
According to
Sample
Tweets,
Ethnography,
Survey
Created
Natural
Language
Processing
and Machine
Learning
Algorithms
Fan
Engagement
Dashboard
Prototype
Model
Technology
Collaboration
Innovation Fan Engagement
Dashboard Prototype
jStart Beacon
Custom-Built Twitter Collection Web App
jStart BigSheets
Leveraging Engagement Framework
Annenberg Innovation Lab Fan
Engagement Dashboard built through
collaboration and mixed methods
learning.
67% Accuracy in classifying tweets by
Logic of Engagement leading to
actionable insight and business intelligence
for Leveraging Fan Engagement.
The Process End-to-End
Collecting and
Managing Data
Data Back Up Data Clean Up Run Models
Gain InsightsRefine Models
Learn Actionable
Insights
Communicate
Insights (Reports,
Infographic
Blueprints)
Create Initial
Dictionary for
Natural Language
Processing
Annotate/Code
Tweets for
Training Data for
Machine Learning
Created
Dashboard
Improve on
Design
Now What?
Moving Forward
Your Challenge
•  Your data will be different
client-to-client
•  Twitter is just the beginning
•  Your will get to be creative
and work on collaborative
cross-functional teams to
dive into the data
•  *This will be both rewarding
and potentially difficult
Tasks Ahead
•  Begin thinking about
what you can learn from
data to help our sponsors
reach their goals
•  Start thinking about how
your fans behave in your
approach to figuring out
what questions to ask the
data
Most Basic Steps
Determine Goals Capture Data Curate Data
Merge Datasets
and Bring Together
Methodologies if
Necessary
Additional Data
Processing to
Usable Form
Deliver Insight to
the Client
Thinking About Process
Bumps in the Road Ahead
•  Privacy Issues and
Respecting the Fans
•  Company layers and
politics – releasing data
from companies is
fraught with back and
forth
•  Getting data into a
usable form
•  Assumptions were wrong
or have to be redefined
– it’s ok to fail fast – but
be ready to keep moving
•  Working in cross-
functional groups
Image	
  from:	
  CapGemini	
  h-p://www.capgemini.com/sites/default/files/technology-­‐blog/files/2012/09/big-­‐data-­‐vendors.jpg	
  
Cross-Functional Communication
Goal	
   Timing	
  	
  
Point	
  
People	
  
Resources	
  
Needed	
  	
  
Bring it Together
Draw connections between the data sets
and how could they relate to the eight
logics and situational triggers.
“While social media data are always interesting in
themselves (at least, for an analyst), when business
owners are able to combine data and layer them
efficiently, the information will become more useful
and actionable.” – Marshall Sponder
Thank You
Questions?

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Big data ppt
Big data pptBig data ppt
Big data ppt
 
The Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient WorldThe Pros and Cons of Big Data in an ePatient World
The Pros and Cons of Big Data in an ePatient World
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Chapter 4 what is data and data types
Chapter 4  what is data and data typesChapter 4  what is data and data types
Chapter 4 what is data and data types
 
NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
 
Big data lecture notes
Big data lecture notesBig data lecture notes
Big data lecture notes
 
Big Data can be fun!
Big Data can be fun!Big Data can be fun!
Big Data can be fun!
 
L18 Big Data and Analytics
L18 Big Data and AnalyticsL18 Big Data and Analytics
L18 Big Data and Analytics
 
Big Data for Beginners
Big Data for BeginnersBig Data for Beginners
Big Data for Beginners
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
 
Big data unit i
Big data unit iBig data unit i
Big data unit i
 
Big data overview external
Big data overview externalBig data overview external
Big data overview external
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Big Data vs. Small Data...what's the difference?
Big Data vs. Small Data...what's the difference?Big Data vs. Small Data...what's the difference?
Big Data vs. Small Data...what's the difference?
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies10 Most Effective Big Data Technologies
10 Most Effective Big Data Technologies
 
A Big Data Concept
A Big Data ConceptA Big Data Concept
A Big Data Concept
 
Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data Data-Ed Webinar: Demystifying Big Data
Data-Ed Webinar: Demystifying Big Data
 
Big data
Big dataBig data
Big data
 

Destacado (7)

10. Graph Databases
10. Graph Databases10. Graph Databases
10. Graph Databases
 
LO1 - Lesson 13 - Feedback
LO1 - Lesson 13 - FeedbackLO1 - Lesson 13 - Feedback
LO1 - Lesson 13 - Feedback
 
LO1 - Lesson 8 - Storyboards
LO1 - Lesson 8 - StoryboardsLO1 - Lesson 8 - Storyboards
LO1 - Lesson 8 - Storyboards
 
crimes vocabulary
crimes vocabularycrimes vocabulary
crimes vocabulary
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare Data4 Best Practices for Analyzing Healthcare Data
4 Best Practices for Analyzing Healthcare Data
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

Similar a Approaching Big Data: Lesson Plan

Open Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media AnalysisOpen Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media Analysis
ikanow
 
ScienceOnline impact workshop
ScienceOnline impact workshop ScienceOnline impact workshop
ScienceOnline impact workshop
SpotOnLondon
 

Similar a Approaching Big Data: Lesson Plan (20)

Open Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media AnalysisOpen Analytics: Building Effective Frameworks for Social Media Analysis
Open Analytics: Building Effective Frameworks for Social Media Analysis
 
How to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organizationHow to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organization
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
Introducing Data Driven Tech Leadership: Social media, Google Analytics, and ...
Introducing Data Driven Tech Leadership: Social media, Google Analytics, and ...Introducing Data Driven Tech Leadership: Social media, Google Analytics, and ...
Introducing Data Driven Tech Leadership: Social media, Google Analytics, and ...
 
Social Media Strategy
Social Media StrategySocial Media Strategy
Social Media Strategy
 
Information Search
Information SearchInformation Search
Information Search
 
Social media analytics powered by data science
Social media analytics powered by data scienceSocial media analytics powered by data science
Social media analytics powered by data science
 
Data Informed Design - Good Tech Test - May 2018
Data Informed Design - Good Tech Test - May 2018Data Informed Design - Good Tech Test - May 2018
Data Informed Design - Good Tech Test - May 2018
 
Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016Narrative Mind Week 7 H4D Stanford 2016
Narrative Mind Week 7 H4D Stanford 2016
 
Finding Meaning in the Numbers: Making Data-Informed Decisions Across Your Or...
Finding Meaning in the Numbers: Making Data-Informed Decisions Across Your Or...Finding Meaning in the Numbers: Making Data-Informed Decisions Across Your Or...
Finding Meaning in the Numbers: Making Data-Informed Decisions Across Your Or...
 
Big data for sales and marketing people
Big data for sales and marketing peopleBig data for sales and marketing people
Big data for sales and marketing people
 
HighRoad Solution Session at AUC016-Creating the Insight-Driven Content Marke...
HighRoad Solution Session at AUC016-Creating the Insight-Driven Content Marke...HighRoad Solution Session at AUC016-Creating the Insight-Driven Content Marke...
HighRoad Solution Session at AUC016-Creating the Insight-Driven Content Marke...
 
More Than a Feeling: Data-Informed Design
More Than a Feeling: Data-Informed DesignMore Than a Feeling: Data-Informed Design
More Than a Feeling: Data-Informed Design
 
[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan Etlinger[Slides] Social Data Intelligence Webinar, By Susan Etlinger
[Slides] Social Data Intelligence Webinar, By Susan Etlinger
 
Using Google Analytics To Market Your Software Idea
Using Google Analytics To Market Your Software IdeaUsing Google Analytics To Market Your Software Idea
Using Google Analytics To Market Your Software Idea
 
Social Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan EtlingerSocial Data Intelligence: Webinar with Susan Etlinger
Social Data Intelligence: Webinar with Susan Etlinger
 
Introduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websitesIntroduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websites
 
ScienceOnline impact workshop
ScienceOnline impact workshop ScienceOnline impact workshop
ScienceOnline impact workshop
 

Más de Bessie Chu

Más de Bessie Chu (16)

Product Management Impact Mapping
Product Management Impact MappingProduct Management Impact Mapping
Product Management Impact Mapping
 
Brand Building for Start-Ups
Brand Building for Start-UpsBrand Building for Start-Ups
Brand Building for Start-Ups
 
Tweet Tracking App Design Document
Tweet Tracking App Design DocumentTweet Tracking App Design Document
Tweet Tracking App Design Document
 
Simple Slide Design and Data Visualization Crash Course
Simple Slide Design and Data Visualization Crash CourseSimple Slide Design and Data Visualization Crash Course
Simple Slide Design and Data Visualization Crash Course
 
Marketing to Female Sports Fans
Marketing to Female Sports FansMarketing to Female Sports Fans
Marketing to Female Sports Fans
 
Thesis: Marketing to Female Sports Fans
Thesis: Marketing to Female Sports Fans Thesis: Marketing to Female Sports Fans
Thesis: Marketing to Female Sports Fans
 
Video Game Journalism: Content Analysis
Video Game Journalism: Content AnalysisVideo Game Journalism: Content Analysis
Video Game Journalism: Content Analysis
 
Comparative Broadband Policy: Hong Kong, Singapore, and Taiwan
Comparative Broadband Policy: Hong Kong, Singapore, and TaiwanComparative Broadband Policy: Hong Kong, Singapore, and Taiwan
Comparative Broadband Policy: Hong Kong, Singapore, and Taiwan
 
Mad Men and Scandal: Marketing via Fan Tastes
Mad Men and Scandal: Marketing via Fan TastesMad Men and Scandal: Marketing via Fan Tastes
Mad Men and Scandal: Marketing via Fan Tastes
 
Snapbasket Survey Results
Snapbasket Survey ResultsSnapbasket Survey Results
Snapbasket Survey Results
 
Data Visualization as Narrative?
Data Visualization as Narrative?Data Visualization as Narrative?
Data Visualization as Narrative?
 
Design and Material Culture: Taiwan Case Study
Design and Material Culture: Taiwan Case StudyDesign and Material Culture: Taiwan Case Study
Design and Material Culture: Taiwan Case Study
 
Amazon Web Services SWOT & Competitor Analysis
Amazon Web Services SWOT & Competitor AnalysisAmazon Web Services SWOT & Competitor Analysis
Amazon Web Services SWOT & Competitor Analysis
 
Social Good App for EBT Users: SNAPBasket
Social Good App for EBT Users: SNAPBasketSocial Good App for EBT Users: SNAPBasket
Social Good App for EBT Users: SNAPBasket
 
Amazon Web Services SWOT
Amazon Web Services SWOTAmazon Web Services SWOT
Amazon Web Services SWOT
 
Square Payments Class Presentation
Square Payments Class PresentationSquare Payments Class Presentation
Square Payments Class Presentation
 

Último

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
wsppdmt
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 

Último (20)

Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
如何办理英国诺森比亚大学毕业证(NU毕业证书)成绩单原件一模一样
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
Gomti Nagar & best call girls in Lucknow | 9548273370 Independent Escorts & D...
 

Approaching Big Data: Lesson Plan

  • 2. Agenda What is Big Data? •  Some Definitions •  Mixed Methods Approach Champion’s League & World Cup Case Study •  Process •  Results and Usage •  Pitfalls and Learnings Moving Forward •  Data Approach Flow •  Caveats •  Organization and Communication
  • 3. What is Big Data? So many different definitions… nobody quite agrees…. … except that it’s definitely a buzzword
  • 4. What is Big Data? It is just generally agreed upon that it’s messy and complex. This is an opportunity and challenge for us to innovate. “an all-encompassing term for any collection of data sets so large and complex that it becomes difficult to process using on-hand data management tools or traditional data processing applications.” “Big data is a buzzword, or catch-phrase, used to describe a massive volume of both structured and unstructured data that is so large that it's difficult to process using traditional database and software techniques. In most enterprise scenarios the data is too big or it moves too fast or it exceeds current processing capacity. Big data has the potential to help companies improve operations and make faster, more intelligent decisions.” “Volume, Variety, Velocity, Variability, Complexity” Quotes  from:     h-p://www.forbes.com/sites/gilpress/2014/09/03/12-­‐big-­‐data-­‐definiBons-­‐whats-­‐yours/2/   h-p://www.webopedia.com/TERM/B/big_data.html   h-p://en.wikipedia.org/wiki/Big_data  
  • 5. What We Do Need to Solve Big Data? … for leveraging engagement at least.
  • 6. …  for  leveraging  engagement  at  least.   Determine  Right   QuesBons  and   Goals  for  Data   Interdisciplinary   Approach   IteraBve   Refinement   “Combining the what (quantitative) with the why (qualitative) can be exponentially powerful.  It is also critical to our ability to take all our clickstream data and truly analyze it, to find insights that drive meaningful website changes that will improve our customers’ experiences.” – Avinash Kaushik Answer: Mixed Methods and Innovation Quote  from:    Web  AnalyBcs  in  One  Hour  a  Day  by  Avinash  Kaukshik  
  • 7. CHAMPIONS LEAGUE AND WORLD CUP BIG DATA DISCOVERY PROCESS Annenberg Lab Framework
  • 9. Sports Fan and Engagement Study Overall Goals for HAVAS •  to identify and define communities of sports fans based around passion points(A) •  to analyze fan interactions with those passions (B) •  position HAVAS Sports & Entertainment to more effectively advise brands on how to meaningfully engage with sports fans by leveraging passion- based communities. (C)
  • 10. Big Data Research Objectives •  Discover a mixed methodology framework for sports and entertainment fan engagement External for Havas •  Justify our fan logic topology in relation to Twitter conversations through natural language processing Internal for Lab
  • 11. Initial Data Collection Steps 1) Modify data collection process to fit live soccer events using Champion’s league as a test run 2) Establish methodology in seeding initial pool of users, keywords, and hashtags 3) Analyze tweets and how they fit into logics of engagement 4) Establish methodology in how to gain insight from twitter conversations
  • 12. “Analyzing Big Data is a BIG JOB with Many People” – Jake Inputs & Equipment Keywords, hashtags, user clusters file on txt document Dedicated server system colllecting information Engineering Run and modify Python script Register Public Screening API Parse for results Live Viewing Team Team to watch game and look for patterns
  • 13. Data Collection Process Engineering & Team: Tech and Data Set-Up Engineer: Run Script with Seed File Team: Watch Event for Patterns and Additional Seeds Team: Decide Data to Analyze Engineer: Parse Data into User- Friendly Format Team: Look at Data and prepare for next event
  • 15. Initial Keyword Seed Scoping Keep it simple Discover through observations
  • 16. Soccer Hashtags and Keywords Official Hashtags Sponsors Team Names Key Terms Key Players
  • 17. Headliners Official Organization Handles Official Team Handles Official   Hashtags   Sponsors   Team   Names   Key  Terms   Key   Players  
  • 18. Sponsors Sponsors will often have official hashtags promoted during sporting events to cross-promote their brand and the sporting event. Official   Hashtags   Sponsors   Team   Names   Key   Terms   Key   Players  
  • 19. Supporting Characters Superfans -Fans with unusual followings on Twitter Sports Commentators -ESPN commentators and the like Prominent Bloggers -Blogs or bloggers with large following on certain teams
  • 20. Initial Data Seed Scoping Caveats • Twitter caps at couple of thousand tweets per second on Public API • Public API received tweets do not appear to be affected by location based factors the way individual user feeds are • Twitter chunks these tweets in mysterious algorithm it deems important • Number of Tweets scrapped render these factors nominal in terms of large-scale user behavior
  • 22. What kind of Tweets or tone in tweets fit into logics of engagement? *Informed by survey and ethnography Entertainment Immersion Social Connection Identification Mastery Pride Play Advocacy
  • 23. Operational Process Plan for World Cup & Modeling with Beacon Capabilities See how conservations analyzed from a big data perspective fit and build on the logics of engagement model Determine what data frameworks worked in capturing useful information Initial qualitative look at data
  • 25. Questions on Approach Before We Get Into Analysis?
  • 27. Big Data Basic Methods of Analysis • Text processing of tweets and plotting using algorithms into agglomerative clusters (aka cool visuals) • Frequency of terms, associations, and word clouds fall under here • Goal: Find texts of what spurred the most conversation Textual • A way to visually see social connection data • Understand forms of bonds and the connections between individual data points worth exploring • Goal: Detecting communities (our clusters, brands) Networks • Toolkits (such as Hootsuite) that measure “sentiment” using positive and negative language • Can be used to see if an initiative performed well • Goal: Measure success of a campaign at different times Sentiment
  • 28. Big Data Low-Hanging Fruit - Topline Rt Author Screenname FIFAWorldCup 76172 9GAG 37459 DFB_Team_EN 21247 BBCSport 19564 FCBayern 14782 FTBpro 13409 _Snape_ 11371 benparr 10616 TheTweetOfGod 9435 espn 7465 Queen_UK 7174 thereaIbanksy 7113 sulsultm3 6646 damnitstrue 6603 asshaaban 6513 SportsCenter 6470 fifaworldcup_es 6365 LicDice_ 6361 FIFAworldcup_e 6241 DFB_Team 6114 Argentina 5964
  • 29. Big Data Low-Hanging Fruit – Sentiment Analysis
  • 30. Fan Handles1 Game Data2 Brand Data3 Integrate insights with Ethnographic and Survey Data for final deliverables Initial Idealized Approach
  • 31. •  Survey Twitter Handles –  See if their online behavior matches survey logics –  What does the content they’re sharing look like –  Trends by cluster, gender, other data points •  Match Data –  Look for clusters of behavior to events in games –  See popularity of brand campaigns and behavioral response to brand stories –  Gain insight from bursts of activity and real-time marketing –  See what are characteristics of influencers •  Brand Data –  Identify how these strategies were executed in online conversations and responses –  Identify types of interactions/content/other markers around brands on Twitter –  Do influential brands mean consistent users interacting across brands? Why are people interacting in this way? How can we categorize these interactions according to our logic clusters? –  Was the content agile? –  See how users responded by the logics to different types of content –  Look for differences in fan response and fan-initiated behavior to the brands Questions and Hypothesis
  • 32. What We Planned To Do •  Steps •  Define interesting WC fan moments and brand moments •  Examine moments in time and certain brand campaigns •  Investigate possible Natural Language Processing tools •  Formulated Questions •  Timeline •  Created a timeline assigning roles to each person •  Deliverables •  TBD, likely looking at clusters of behavior around brand campaigns. •  Sentiment analysis may tie in here
  • 33. Ethnographic Report -What did people say about the brand or the logics they used? Survey Data -Under this brand logic utilized, what is the intensity and who are the clusters? Big Data -How did audiences respond online to actions by the brand? Approaching with Mixed Methods
  • 34. Exercise: Group Datasets Figure out what insight you might be able to get from each piece of data and how would you apply mixed methods.
  • 36. The Future of Social Media Analytics “We will be moving beyond key-word based queries into machine-learning algorithms. Influencers whom I have with with echo similar ideas about the increasing use and refine of latent semantic indexing (or some variant of it) and other machine-learning algorithms in order to improve social listening, automatic categorization of content, and the ability to take action on data” - Marshall Sponder
  • 37. Key Learnings for Mood Board Ethnography Survey Twitter Data Brazil Brought Together All Data
  • 39. The Dashboard Build Process Pulled 250 Retweeted Tweets with Verification from BigSheets Coded Tweets According to Logic for Testing Data Built Dictionary According to Sample Tweets, Ethnography, Survey Created Natural Language Processing and Machine Learning Algorithms Fan Engagement Dashboard Prototype
  • 40. Model Technology Collaboration Innovation Fan Engagement Dashboard Prototype jStart Beacon Custom-Built Twitter Collection Web App jStart BigSheets Leveraging Engagement Framework
  • 41. Annenberg Innovation Lab Fan Engagement Dashboard built through collaboration and mixed methods learning. 67% Accuracy in classifying tweets by Logic of Engagement leading to actionable insight and business intelligence for Leveraging Fan Engagement.
  • 42.
  • 43. The Process End-to-End Collecting and Managing Data Data Back Up Data Clean Up Run Models Gain InsightsRefine Models Learn Actionable Insights Communicate Insights (Reports, Infographic Blueprints) Create Initial Dictionary for Natural Language Processing Annotate/Code Tweets for Training Data for Machine Learning Created Dashboard Improve on Design
  • 45. Moving Forward Your Challenge •  Your data will be different client-to-client •  Twitter is just the beginning •  Your will get to be creative and work on collaborative cross-functional teams to dive into the data •  *This will be both rewarding and potentially difficult Tasks Ahead •  Begin thinking about what you can learn from data to help our sponsors reach their goals •  Start thinking about how your fans behave in your approach to figuring out what questions to ask the data
  • 46. Most Basic Steps Determine Goals Capture Data Curate Data Merge Datasets and Bring Together Methodologies if Necessary Additional Data Processing to Usable Form Deliver Insight to the Client
  • 48. Bumps in the Road Ahead •  Privacy Issues and Respecting the Fans •  Company layers and politics – releasing data from companies is fraught with back and forth •  Getting data into a usable form •  Assumptions were wrong or have to be redefined – it’s ok to fail fast – but be ready to keep moving •  Working in cross- functional groups
  • 49. Image  from:  CapGemini  h-p://www.capgemini.com/sites/default/files/technology-­‐blog/files/2012/09/big-­‐data-­‐vendors.jpg  
  • 50. Cross-Functional Communication Goal   Timing     Point   People   Resources   Needed    
  • 51. Bring it Together Draw connections between the data sets and how could they relate to the eight logics and situational triggers. “While social media data are always interesting in themselves (at least, for an analyst), when business owners are able to combine data and layer them efficiently, the information will become more useful and actionable.” – Marshall Sponder