SlideShare una empresa de Scribd logo
1 de 46
Who are We Studying:
Bots or Humans?
Anatoliy Gruzd
gruzd@ryerson.ca
@gruzd
Canada Research Chair in Social Media Data Stewardship
Associate Professor, Ted Rogers School of Management
Director, Social Media Lab
Ryerson University
Research at the Social Media Lab
• Social Media Analytics
• Social Media Data Stewardship
• Networked Influence
• Online Political Engagement
• Learning Analytics
• Social Media & Health
Outline
• Social Media Analytics
• The Rise of the Bots
• Case Study: Social Media Use during the 2014 EuroMaidan Revolution in Ukraine
• Detecting Bots
• Next steps
1.7B
users
500M
users
300M
users
Growth of Social Media Data
Self-
collected
/reported
Public
APIs
Data
Resellers
+ More Ways to Access Social Media Data
Cloud &
Distributed
Computing
Data &
Information
Organization
Analytics Visualization
+ More Tools for Big Data Analytics
Data -> Visualizations -> Understanding
How to Make Sense of Social Media Data?
Twitter: @gruzdAnatoliy Gruzd 7
How to Make Sense of Social Media Data?
Example: Geo-based Analysis
Twitter: @gruzdAnatoliy Gruzd 8
How to Make Sense of Social Media Data?
Example: Geo-based + Content Analysis
Tracking Hate Speech on Twitter
Twitter: @gruzd Anatoliy Gruzd 9
Source: http://www.fenuxe.com/tag/geo-coded
The Rise of Social Bots
• Who are we studying:
Humans or Bots?
Social Media Data Analytics Challenges
Social Bot – software
designed to act on the
Internet with some level
of autonomy
Different Types of Bots
Free music,
games, books,
downloads
Jewelery,
electronics,
vehicles
Contest,
gambling,
prizes
Finance, loans,
realty
Increase
Twitter
following
DietAdult
(Grier et al, 2010)
Sample Twitter Bots
Social Bot Example:
Microsoft’s AI Twitter
chatbot
Social Bot Example:
Microsoft’s AI Twitter
chatbot
Platform-reported & Estimated % of Bots
Fake
5%
Fake
2%
Fake
8%
Source: http://blogs.wsj.com/digits/2015/06/30/fake-accounts-still-plague-instagram-despite-purge-study-finds/
1.5B
users
300M
users
400M
users
… but is that everything?
(30,000,000)(15,000,000) (32,000,000)
Why does it matter if there are bots,
spammers and fakers in our datasets?
Popular topics mentioned in the
14,500 abstracts of journal &
conference papers on “social media”
or “social networking websites”
published since 1999
(Gruzd, 2015)
Why does it matter if there are bots,
spammers and fakers in our datasets?
How many of these 14,500
papers took into account the
presence and influence of bots,
spammers or fakers ?
(Gruzd, 2015)
Case Study: 2014 EuroMaidan Revolution in Ukraine
"2014-02-21 11-04 Euromaidan in Kiev" by Amakuha. Licensed
under CC BY-SA 3.0 via Wikimedia
November 21, 2013 - Ukraine gov. suspended the
trade & association agreement with EU
Gruzd, A., & Tsyganova, K. (2015). Information Wars and Online Activism During
the 2013/2014 Crisis in Ukraine: Examining the Social Structures of Pro- and Anti-
Maidan Groups. Policy & Internet, 7(2), 121–158. http://doi.org/10.1002/poi3.91
About Vkontakte:
#1 Social Networking Website in Ukraine
26
source: http://en.wikipedia.org
@gruzd
Example:
VK Group User Interface – Posts, Likes, Comments…
…Discussion board, Links & Media Files
Data Collection
PRO1
Pro-Maidan
PRO2
Pro-Maidan
ANTI1
Anti-Maidan
ANTI2
Anti-Maidan
Num. of Nodes 141,542 96,402 60,506 69,029
Num. of Connections 338,344 221,452 280,678 192,273
• Data collection: 2 most popular (public) Pro-Maidan and Anti-Maidan groups
• Period: February 18 – May 25, 2015
• Used VK Public API
• Communities – information about groups and group members
• Wall – posts and comments
• Likes – “likes” that members and visitors leave on posts
• Friends – group members’ friendship relations
What can we learn from structures of friendship
networks?
Anti-EuroMaidan groupPro-EuroMaidan group
Subgroup 3
31@gruzd2014 EuroMaidan Revolution
Example: VK Group – Pro EuroMaidan
Marketing,
Spam
% of spammers among participants with friends is higher
than among all group members
Spam
accounts
5%
Spam
accounts
15%
Group
members
Members
w/friends
Reported & Estimated % of Bot Accounts
Fake
5%
Fake
2%
Fake
8%
Source: http://blogs.wsj.com/digits/2015/06/30/fake-accounts-still-plague-instagram-despite-purge-study-finds/
1.5B
users
300M
users
400M
users
… but is that everything?
Detecting
Bots…
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Creation date
•Engagement
level
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Example: Fake
Twitter account
@gruzd?
Not really!
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Creation date
•Engagement
level
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Detecting Bots…
Fakers like to post on Fridays!
Fake accounts Real accounts
Frequency of Twitter posts
(Gurajala et.al, 2015)
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Creation date
•Engagement
level
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Engagement
level
•Creation date
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Detecting Bots…
Fake accounts tend to be created later in the week
Fakers Real accounts
Frequency of creation days for Twitter accounts
(Gurajala et.al, 2015)
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Engagement
level
•Creation date
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Detecting Bots…
Photo
•Color & Edge
histograms
•Color & Edge
Directivity
Descriptor
(CEDD)
•Image
Similarity
Message
•Sensitive
words
•URL
•Duplicates
•#hashtags
•@replies
Poster
•Username
•Engagement
level
•Creation date
SocialNetwork
•# Friends
•# Following
•In/out degree
centrality
•Clustering
(Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
Detecting Bots…
Using Social
Network
Analysis
Example:
A spam account
attempting to take over
a conference hashtag
Detecting
Bots…
to introduce these emerging
techniques to researchers
who are increasingly relying
on social media data as
their go-to data source!
The challenge is ...
© Chris Allen licensed under Creative Commons
Calling on computational researchers to
• Develop and share principles, protocols, tools and
techniques around handling and cleaning social media
data.
• Develop stronger partnerships with social science
researchers to start discussing how to handle bot-like
accounts properly.
• …because the nature of bots and their influence on users’
online behavior is not just a computational, but also social
science issue.
Questions to consider
Remove or Keep?
Scenario 1: Marketing-related
bots, if they do not interact
with anyone else in the study
group but just there to
increase their follower base
Scenario 2: Automated
Twitter accounts designed
to repost certain news
stories
Questions to consider
Remove or Keep?
Scenario 1: Marketing-related
bots, if they do not interact
with anyone else in the study
group but just there to
increase their follower base
Scenario 2: Automated
Twitter accounts designed
to repost Trump’s tweets
2017 #SMSociety Theme: Social Media for Social Good or Evil
https://socialmediaandsociety.org
Who are We Studying:
Bots or Humans?
Anatoliy Gruzd
gruzd@ryerson.ca
@gruzd
Canada Research Chair in Social Media Data Stewardship
Associate Professor, Ted Rogers School of Management
Director, Social Media Lab
Ryerson University
References
• Agichtein, E., Castillo, C., Donato, D., Gionis, A., & Mishne, G. (2008). Finding High-
quality Content in Social Media. In Proceedings of the 2008 International Conference
on Web Search and Data Mining (pp. 183–194). New York, NY, USA: ACM.
• Gruzd, A., & Roy, J. (2014). Investigating Political Polarization on Twitter: A Canadian
Perspective. Policy & Internet, 6(1), 28–45. http://doi.org/10.1002/1944-2866.POI354
• Gruzd, A., & Tsyganova, K. (2015). Information Wars and Online Activism During the
2013/2014 Crisis in Ukraine: Examining the Social Structures of Pro- and Anti-Maidan
Groups. Policy & Internet, 7(2), 121–158. http://doi.org/10.1002/poi3.91
• Grier, C., Thomas, K., Paxson, V., & Zhang, M. (2010). @spam: the underground on 140
characters or less (p. 27). ACM Press.
• Wang, A. H. (2010). Don’t follow me: Spam detection in Twitter. In Proceedings of the
2010 International Conference on Security and Cryptography (SECRYPT) (pp. 1–10). IEEE.
• Yardi, S., Romero, D., Schoenebeck, G., & Boyd, D. (2009). Detecting spam in a Twitter
network. First Monday, 15(1).

Más contenido relacionado

La actualidad más candente

CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)
Lora Aroyo
 
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time ContentVisualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
Samantha Finn
 
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on TwitterBroker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
Cybersecurity Education and Research Centre
 
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
Derek Weber
 
Classifying Twitter Content
Classifying Twitter ContentClassifying Twitter Content
Classifying Twitter Content
Stephen Dann
 

La actualidad más candente (20)

Research with Social Media Data: Stewardship & Ethical Considerations
Research with Social Media Data: Stewardship & Ethical ConsiderationsResearch with Social Media Data: Stewardship & Ethical Considerations
Research with Social Media Data: Stewardship & Ethical Considerations
 
CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)CrowdTruth @VU Faculty Colloquium (June 2015)
CrowdTruth @VU Faculty Colloquium (June 2015)
 
Social Web 2014: Final Presentations (Part I)
Social Web 2014: Final Presentations (Part I)Social Web 2014: Final Presentations (Part I)
Social Web 2014: Final Presentations (Part I)
 
20151001 charles university prague - marc smith - node xl-picturing political...
20151001 charles university prague - marc smith - node xl-picturing political...20151001 charles university prague - marc smith - node xl-picturing political...
20151001 charles university prague - marc smith - node xl-picturing political...
 
Shareworthiness and Motivated Reasoning in Hyper-Partisan News Sharing Behavi...
Shareworthiness and Motivated Reasoning in Hyper-Partisan News Sharing Behavi...Shareworthiness and Motivated Reasoning in Hyper-Partisan News Sharing Behavi...
Shareworthiness and Motivated Reasoning in Hyper-Partisan News Sharing Behavi...
 
Picturing the Social: Talk for Transforming Digital Methods Winter School
Picturing the Social: Talk for Transforming Digital Methods Winter SchoolPicturing the Social: Talk for Transforming Digital Methods Winter School
Picturing the Social: Talk for Transforming Digital Methods Winter School
 
A cross-national comparison of Twitter user interactions with leading politic...
A cross-national comparison of Twitter user interactions with leading politic...A cross-national comparison of Twitter user interactions with leading politic...
A cross-national comparison of Twitter user interactions with leading politic...
 
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time ContentVisualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
 
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on TwitterBroker Bots: Analyzing automated activity during High Impact Events on Twitter
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
 
Following, Mentioning, Sharing: A Search for Filter Bubbles in the Australian...
Following, Mentioning, Sharing: A Search for Filter Bubbles in the Australian...Following, Mentioning, Sharing: A Search for Filter Bubbles in the Australian...
Following, Mentioning, Sharing: A Search for Filter Bubbles in the Australian...
 
Identifying Influencers on Social Media Using Social Network Analysis
Identifying Influencers on Social Media Using Social Network AnalysisIdentifying Influencers on Social Media Using Social Network Analysis
Identifying Influencers on Social Media Using Social Network Analysis
 
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
#ArsonEmergency and Australia’s “Black Summer”: A study of polarisation and i...
 
"Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ...
"Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ..."Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ...
"Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ...
 
Presentation on netnography
Presentation on netnographyPresentation on netnography
Presentation on netnography
 
OpenThreads: The Community of OpenStreetMap Mailing List
OpenThreads: The Community of OpenStreetMap Mailing ListOpenThreads: The Community of OpenStreetMap Mailing List
OpenThreads: The Community of OpenStreetMap Mailing List
 
Infotainment and the Impact of Connective Action: The Case of #MilkedDry
Infotainment and the Impact of Connective Action: The Case of #MilkedDryInfotainment and the Impact of Connective Action: The Case of #MilkedDry
Infotainment and the Impact of Connective Action: The Case of #MilkedDry
 
Web Science 2010 slides
Web Science 2010 slidesWeb Science 2010 slides
Web Science 2010 slides
 
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
Dynamics of a Scandal: The Centrelink Robodebt Affair on TwitterDynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
 
Classifying Twitter Content
Classifying Twitter ContentClassifying Twitter Content
Classifying Twitter Content
 
Social media engagement
Social media engagementSocial media engagement
Social media engagement
 

Destacado

Application of data mining based malicious code detection techniques for dete...
Application of data mining based malicious code detection techniques for dete...Application of data mining based malicious code detection techniques for dete...
Application of data mining based malicious code detection techniques for dete...
UltraUploader
 
Love At First Sight
Love At First SightLove At First Sight
Love At First Sight
guestad7742
 
Love at first sight
Love at first sightLove at first sight
Love at first sight
supaporn2590
 
Identification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A SurveyIdentification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A Survey
Cybersecurity Education and Research Centre
 
TFSC #DISC2014 Special Issue cover content
TFSC #DISC2014 Special Issue cover   contentTFSC #DISC2014 Special Issue cover   content
TFSC #DISC2014 Special Issue cover content
Han Woo PARK
 

Destacado (11)

Some thoughts on slides and presenting
Some thoughts on slides and presentingSome thoughts on slides and presenting
Some thoughts on slides and presenting
 
Examining Polarization in Political Social Media: A Case of Twitter and the ...
Examining Polarization in Political Social Media:  A Case of Twitter and the ...Examining Polarization in Political Social Media:  A Case of Twitter and the ...
Examining Polarization in Political Social Media: A Case of Twitter and the ...
 
Application of data mining based malicious code detection techniques for dete...
Application of data mining based malicious code detection techniques for dete...Application of data mining based malicious code detection techniques for dete...
Application of data mining based malicious code detection techniques for dete...
 
Love At First Sight
Love At First SightLove At First Sight
Love At First Sight
 
Love at first sight
Love at first sightLove at first sight
Love at first sight
 
Identifying influential twitter_users
Identifying influential twitter_usersIdentifying influential twitter_users
Identifying influential twitter_users
 
Identification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A SurveyIdentification and Analysis of Malicious Content on Facebook: A Survey
Identification and Analysis of Malicious Content on Facebook: A Survey
 
TFSC #DISC2014 Special Issue cover content
TFSC #DISC2014 Special Issue cover   contentTFSC #DISC2014 Special Issue cover   content
TFSC #DISC2014 Special Issue cover content
 
Culture Marketing - Brand Content That Matters
Culture Marketing - Brand Content That MattersCulture Marketing - Brand Content That Matters
Culture Marketing - Brand Content That Matters
 
Scott 김효동 네트워크분석 박한우 영남대 슬라이드
Scott 김효동 네트워크분석 박한우 영남대 슬라이드Scott 김효동 네트워크분석 박한우 영남대 슬라이드
Scott 김효동 네트워크분석 박한우 영남대 슬라이드
 
PID Signposting Pattern
PID Signposting PatternPID Signposting Pattern
PID Signposting Pattern
 

Similar a Who are We Studying: Humans or Bots?

Grounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methodsGrounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methods
Citizens in the Making
 
twitter_visual_chi2016
twitter_visual_chi2016twitter_visual_chi2016
twitter_visual_chi2016
Cat Yao
 

Similar a Who are We Studying: Humans or Bots? (20)

Studying Online & Offline Communities through the Prism of Social Media Data
Studying Online & Offline Communities through the Prism of Social Media DataStudying Online & Offline Communities through the Prism of Social Media Data
Studying Online & Offline Communities through the Prism of Social Media Data
 
Grounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methodsGrounded theory meets big data: One way to marry ethnography and digital methods
Grounded theory meets big data: One way to marry ethnography and digital methods
 
Challenges in-archiving-twitter
Challenges in-archiving-twitterChallenges in-archiving-twitter
Challenges in-archiving-twitter
 
Meyer Big Data SDP13
Meyer Big Data SDP13Meyer Big Data SDP13
Meyer Big Data SDP13
 
The evolution of research on social media
The evolution of research on social mediaThe evolution of research on social media
The evolution of research on social media
 
"Who is this redchanit?" Applying digital methods for issue mapping to one we...
"Who is this redchanit?" Applying digital methods for issue mapping to one we..."Who is this redchanit?" Applying digital methods for issue mapping to one we...
"Who is this redchanit?" Applying digital methods for issue mapping to one we...
 
User Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social NetworkUser Behaviour Pattern Recognition On Twitter Social Network
User Behaviour Pattern Recognition On Twitter Social Network
 
Researching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media AnalysisResearching Social Media – Big Data and Social Media Analysis
Researching Social Media – Big Data and Social Media Analysis
 
Accessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science KnowledgeAccessing and Using Big Data to Advance Social Science Knowledge
Accessing and Using Big Data to Advance Social Science Knowledge
 
Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?Filter Bubbles in the Australian Twittersphere?
Filter Bubbles in the Australian Twittersphere?
 
Digital Civic Engagement: Helping Students Find Their Voice
Digital Civic Engagement: Helping Students Find Their VoiceDigital Civic Engagement: Helping Students Find Their Voice
Digital Civic Engagement: Helping Students Find Their Voice
 
Social media data stewardship: The ethics of social media data use for research
Social media data stewardship: The ethics of social media data use for researchSocial media data stewardship: The ethics of social media data use for research
Social media data stewardship: The ethics of social media data use for research
 
twitter_visual_chi2016
twitter_visual_chi2016twitter_visual_chi2016
twitter_visual_chi2016
 
The Birth of Social Media Methods
The Birth of Social Media MethodsThe Birth of Social Media Methods
The Birth of Social Media Methods
 
Jf2516311637
Jf2516311637Jf2516311637
Jf2516311637
 
Jf2516311637
Jf2516311637Jf2516311637
Jf2516311637
 
Digital Tattoo Workshop for VPL
Digital Tattoo Workshop for VPLDigital Tattoo Workshop for VPL
Digital Tattoo Workshop for VPL
 
Data augmented ethnography: 
using big data and ethnography to explore candi...
Data augmented ethnography: 
using big data and ethnography  to explore candi...Data augmented ethnography: 
using big data and ethnography  to explore candi...
Data augmented ethnography: 
using big data and ethnography to explore candi...
 
Online radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directionsOnline radicalisation: work, challenges and future directions
Online radicalisation: work, challenges and future directions
 
2015 pdf-marc smith-node xl-social media sna
2015 pdf-marc smith-node xl-social media sna2015 pdf-marc smith-node xl-social media sna
2015 pdf-marc smith-node xl-social media sna
 

Más de Toronto Metropolitan University

The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
Toronto Metropolitan University
 
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
Toronto Metropolitan University
 
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
Toronto Metropolitan University
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
Toronto Metropolitan University
 
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job ApplicantsYou're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
Toronto Metropolitan University
 
Sampling and recruiting on Facebook
Sampling and recruiting on FacebookSampling and recruiting on Facebook
Sampling and recruiting on Facebook
Toronto Metropolitan University
 

Más de Toronto Metropolitan University (16)

The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
 
Computational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social MediaComputational Approaches to Studying Anti-Social Behaviour on Social Media
Computational Approaches to Studying Anti-Social Behaviour on Social Media
 
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and DiscoveryThe Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
 
Examining toxic interactions and political engagement on Twitter
Examining toxic interactions and political engagement on TwitterExamining toxic interactions and political engagement on Twitter
Examining toxic interactions and political engagement on Twitter
 
Who is Influencing the #GDPR Discussion on Twitter: Implications for Public ...
Who is Influencing the #GDPR Discussion on Twitter:  Implications for Public ...Who is Influencing the #GDPR Discussion on Twitter:  Implications for Public ...
Who is Influencing the #GDPR Discussion on Twitter: Implications for Public ...
 
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
 
Research & Teaching in the Social Media Age
Research & Teaching in the Social Media Age Research & Teaching in the Social Media Age
Research & Teaching in the Social Media Age
 
Social Media for Informal Learning: a Case of #Twitterstorians
Social Media for Informal Learning: a Case of #TwitterstoriansSocial Media for Informal Learning: a Case of #Twitterstorians
Social Media for Informal Learning: a Case of #Twitterstorians
 
The State of Social Media Research After Cambridge Analytica
The State of Social Media Research After Cambridge AnalyticaThe State of Social Media Research After Cambridge Analytica
The State of Social Media Research After Cambridge Analytica
 
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
 
From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...
From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...
From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Learning Analytics Dashboard for Twitter
Learning Analytics Dashboard for TwitterLearning Analytics Dashboard for Twitter
Learning Analytics Dashboard for Twitter
 
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job ApplicantsYou're Hired: Examining Acceptance of Social Media Screening of Job Applicants
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
 
Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...
Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...
Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...
 
Sampling and recruiting on Facebook
Sampling and recruiting on FacebookSampling and recruiting on Facebook
Sampling and recruiting on Facebook
 

Último

Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
anilsa9823
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
Delhi  99530 vip 56974  Genuine Escort Service Call Girls in MasudpurDelhi  99530 vip 56974  Genuine Escort Service Call Girls in Masudpur
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
anilsa9823
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
ednyonat
 

Último (20)

O9654467111 Call Girls In Dwarka Women Seeking Men
O9654467111 Call Girls In Dwarka Women Seeking MenO9654467111 Call Girls In Dwarka Women Seeking Men
O9654467111 Call Girls In Dwarka Women Seeking Men
 
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
Night 7k Call Girls Noida New Ashok Nagar Escorts Call Me: 8448380779
 
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Rohini Sector 35 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
Production diary Film the city powerpoint
Production diary Film the city powerpointProduction diary Film the city powerpoint
Production diary Film the city powerpoint
 
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncrCall Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
Call Girls In Gurgaon Dlf pHACE 2 Women Delhi ncr
 
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
Night 7k Call Girls Noida Sector 121 Call Me: 8448380779
 
Your LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence PackageYour LinkedIn Makeover: Sociocosmos Presence Package
Your LinkedIn Makeover: Sociocosmos Presence Package
 
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
Craft Your Legacy: Invest in YouTube Presence from Sociocosmos"
 
Night 7k Call Girls Atta Market Escorts Call Me: 8448380779
Night 7k Call Girls Atta Market Escorts Call Me: 8448380779Night 7k Call Girls Atta Market Escorts Call Me: 8448380779
Night 7k Call Girls Atta Market Escorts Call Me: 8448380779
 
This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...This is a Powerpoint about research into the codes and conventions of a film ...
This is a Powerpoint about research into the codes and conventions of a film ...
 
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
Top Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash PaymentTop Call Girls In Telibagh ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment
Top Call Girls In Telibagh ( Lucknow ) 🔝 8923113531 🔝 Cash Payment
 
SELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANYSELECTING A SOCIAL MEDIA MARKETING COMPANY
SELECTING A SOCIAL MEDIA MARKETING COMPANY
 
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCRElite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
Elite Class ➥8448380779▻ Call Girls In Nizammuddin Delhi NCR
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
Delhi  99530 vip 56974  Genuine Escort Service Call Girls in MasudpurDelhi  99530 vip 56974  Genuine Escort Service Call Girls in Masudpur
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Masudpur
 
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
Night 7k Call Girls Noida Sector 120 Call Me: 8448380779
 
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
CALL ON ➥8923113531 🔝Call Girls Ashiyana Colony Lucknow best sexual service O...
 
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
Night 7k Call Girls Pari Chowk Escorts Call Me: 8448380779
 
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptxDickinsonSlides teeeeeeeeeeessssssssssst.pptx
DickinsonSlides teeeeeeeeeeessssssssssst.pptx
 
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic HappensIgnite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
Ignite Your Online Influence: Sociocosmos - Where Social Media Magic Happens
 
Film show post-production powerpoint for site
Film show post-production powerpoint for siteFilm show post-production powerpoint for site
Film show post-production powerpoint for site
 

Who are We Studying: Humans or Bots?

  • 1. Who are We Studying: Bots or Humans? Anatoliy Gruzd gruzd@ryerson.ca @gruzd Canada Research Chair in Social Media Data Stewardship Associate Professor, Ted Rogers School of Management Director, Social Media Lab Ryerson University
  • 2. Research at the Social Media Lab • Social Media Analytics • Social Media Data Stewardship • Networked Influence • Online Political Engagement • Learning Analytics • Social Media & Health
  • 3. Outline • Social Media Analytics • The Rise of the Bots • Case Study: Social Media Use during the 2014 EuroMaidan Revolution in Ukraine • Detecting Bots • Next steps
  • 6. Cloud & Distributed Computing Data & Information Organization Analytics Visualization + More Tools for Big Data Analytics
  • 7. Data -> Visualizations -> Understanding How to Make Sense of Social Media Data? Twitter: @gruzdAnatoliy Gruzd 7
  • 8. How to Make Sense of Social Media Data? Example: Geo-based Analysis Twitter: @gruzdAnatoliy Gruzd 8
  • 9. How to Make Sense of Social Media Data? Example: Geo-based + Content Analysis Tracking Hate Speech on Twitter Twitter: @gruzd Anatoliy Gruzd 9 Source: http://www.fenuxe.com/tag/geo-coded
  • 10. The Rise of Social Bots • Who are we studying: Humans or Bots? Social Media Data Analytics Challenges
  • 11. Social Bot – software designed to act on the Internet with some level of autonomy
  • 12. Different Types of Bots Free music, games, books, downloads Jewelery, electronics, vehicles Contest, gambling, prizes Finance, loans, realty Increase Twitter following DietAdult (Grier et al, 2010)
  • 14. Social Bot Example: Microsoft’s AI Twitter chatbot
  • 15. Social Bot Example: Microsoft’s AI Twitter chatbot
  • 16.
  • 17. Platform-reported & Estimated % of Bots Fake 5% Fake 2% Fake 8% Source: http://blogs.wsj.com/digits/2015/06/30/fake-accounts-still-plague-instagram-despite-purge-study-finds/ 1.5B users 300M users 400M users … but is that everything? (30,000,000)(15,000,000) (32,000,000)
  • 18. Why does it matter if there are bots, spammers and fakers in our datasets? Popular topics mentioned in the 14,500 abstracts of journal & conference papers on “social media” or “social networking websites” published since 1999 (Gruzd, 2015)
  • 19. Why does it matter if there are bots, spammers and fakers in our datasets? How many of these 14,500 papers took into account the presence and influence of bots, spammers or fakers ? (Gruzd, 2015)
  • 20. Case Study: 2014 EuroMaidan Revolution in Ukraine "2014-02-21 11-04 Euromaidan in Kiev" by Amakuha. Licensed under CC BY-SA 3.0 via Wikimedia November 21, 2013 - Ukraine gov. suspended the trade & association agreement with EU Gruzd, A., & Tsyganova, K. (2015). Information Wars and Online Activism During the 2013/2014 Crisis in Ukraine: Examining the Social Structures of Pro- and Anti- Maidan Groups. Policy & Internet, 7(2), 121–158. http://doi.org/10.1002/poi3.91
  • 21. About Vkontakte: #1 Social Networking Website in Ukraine 26 source: http://en.wikipedia.org @gruzd
  • 22. Example: VK Group User Interface – Posts, Likes, Comments… …Discussion board, Links & Media Files
  • 23. Data Collection PRO1 Pro-Maidan PRO2 Pro-Maidan ANTI1 Anti-Maidan ANTI2 Anti-Maidan Num. of Nodes 141,542 96,402 60,506 69,029 Num. of Connections 338,344 221,452 280,678 192,273 • Data collection: 2 most popular (public) Pro-Maidan and Anti-Maidan groups • Period: February 18 – May 25, 2015 • Used VK Public API • Communities – information about groups and group members • Wall – posts and comments • Likes – “likes” that members and visitors leave on posts • Friends – group members’ friendship relations
  • 24. What can we learn from structures of friendship networks? Anti-EuroMaidan groupPro-EuroMaidan group
  • 25. Subgroup 3 31@gruzd2014 EuroMaidan Revolution Example: VK Group – Pro EuroMaidan Marketing, Spam % of spammers among participants with friends is higher than among all group members Spam accounts 5% Spam accounts 15% Group members Members w/friends
  • 26. Reported & Estimated % of Bot Accounts Fake 5% Fake 2% Fake 8% Source: http://blogs.wsj.com/digits/2015/06/30/fake-accounts-still-plague-instagram-despite-purge-study-finds/ 1.5B users 300M users 400M users … but is that everything?
  • 28. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Creation date •Engagement level SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 30. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Creation date •Engagement level SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 31. Detecting Bots… Fakers like to post on Fridays! Fake accounts Real accounts Frequency of Twitter posts (Gurajala et.al, 2015)
  • 32. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Creation date •Engagement level SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 33. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Engagement level •Creation date SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 34. Detecting Bots… Fake accounts tend to be created later in the week Fakers Real accounts Frequency of creation days for Twitter accounts (Gurajala et.al, 2015)
  • 35. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Engagement level •Creation date SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 36. Detecting Bots… Photo •Color & Edge histograms •Color & Edge Directivity Descriptor (CEDD) •Image Similarity Message •Sensitive words •URL •Duplicates •#hashtags •@replies Poster •Username •Engagement level •Creation date SocialNetwork •# Friends •# Following •In/out degree centrality •Clustering (Yardi et al, ‘09; Grier et al, ‘10; Wang, ‘10; Jin et al, ‘11)
  • 37. Detecting Bots… Using Social Network Analysis Example: A spam account attempting to take over a conference hashtag
  • 38.
  • 40. to introduce these emerging techniques to researchers who are increasingly relying on social media data as their go-to data source! The challenge is ... © Chris Allen licensed under Creative Commons
  • 41. Calling on computational researchers to • Develop and share principles, protocols, tools and techniques around handling and cleaning social media data. • Develop stronger partnerships with social science researchers to start discussing how to handle bot-like accounts properly. • …because the nature of bots and their influence on users’ online behavior is not just a computational, but also social science issue.
  • 42. Questions to consider Remove or Keep? Scenario 1: Marketing-related bots, if they do not interact with anyone else in the study group but just there to increase their follower base Scenario 2: Automated Twitter accounts designed to repost certain news stories
  • 43. Questions to consider Remove or Keep? Scenario 1: Marketing-related bots, if they do not interact with anyone else in the study group but just there to increase their follower base Scenario 2: Automated Twitter accounts designed to repost Trump’s tweets
  • 44. 2017 #SMSociety Theme: Social Media for Social Good or Evil https://socialmediaandsociety.org
  • 45. Who are We Studying: Bots or Humans? Anatoliy Gruzd gruzd@ryerson.ca @gruzd Canada Research Chair in Social Media Data Stewardship Associate Professor, Ted Rogers School of Management Director, Social Media Lab Ryerson University
  • 46. References • Agichtein, E., Castillo, C., Donato, D., Gionis, A., & Mishne, G. (2008). Finding High- quality Content in Social Media. In Proceedings of the 2008 International Conference on Web Search and Data Mining (pp. 183–194). New York, NY, USA: ACM. • Gruzd, A., & Roy, J. (2014). Investigating Political Polarization on Twitter: A Canadian Perspective. Policy & Internet, 6(1), 28–45. http://doi.org/10.1002/1944-2866.POI354 • Gruzd, A., & Tsyganova, K. (2015). Information Wars and Online Activism During the 2013/2014 Crisis in Ukraine: Examining the Social Structures of Pro- and Anti-Maidan Groups. Policy & Internet, 7(2), 121–158. http://doi.org/10.1002/poi3.91 • Grier, C., Thomas, K., Paxson, V., & Zhang, M. (2010). @spam: the underground on 140 characters or less (p. 27). ACM Press. • Wang, A. H. (2010). Don’t follow me: Spam detection in Twitter. In Proceedings of the 2010 International Conference on Security and Cryptography (SECRYPT) (pp. 1–10). IEEE. • Yardi, S., Romero, D., Schoenebeck, G., & Boyd, D. (2009). Detecting spam in a Twitter network. First Monday, 15(1).