SlideShare a Scribd company logo
1 of 14
Download to read offline
Cornelius Puschmann, Humboldt-Universität zu Berlin
            Jean Burgess, Queensland University of Technology
             Axel Bruns, Queensland University of Technology
            Merja Mahrt, Heinrich-Heine-Universität Düsseldorf




Data Access, Ownership and Control in Social Web Services:
               Issues for Twitter Research


                                ICA 2012
                  Track: Communication and Technology
               Session: Researching Social Media: Ethical and
                         Methodological Challenges
                           26 May 2012, Phoenix
“There are also significant questions of truth, control, and
power in Big Data studies: researchers have the tools and the
access, while social media users as a whole do not. Their data
were created in highly context-sensitive spaces, and it is entirely
possible that some users would not give permission for their
data to be used elsewhere.”
(boyd & Crawford, 2012, p.12)
#1
Access, control, ownership and
interpretation of data are interrelated facets
that raise questions of power.



                   #2
                   Market, legislation, social norms and code
                   are dynamic regulatory forces in social web
                   platforms.
Access (technology)                 Control (ability)




TOS                                                API
“law”     defines       Data          enables     “code”




 Ownership (law)          Interpretation (competence)
• founded in 2006 by Jack Dorsey
• 140 mio active users
• 340 mio tweets per day
• source of real-time information on a breadth of issues
  from pop culture to politics
• increasingly used as a data source among researchers
  (e.g. on election prediction via Twitter: Tumasjan et al,
  2010, Jungherr et al, 2011, Gayo-Avello, 2012)
• Twitter‘s (future) business model is based on advertising
• ad revenue of $260 mio in 2012
• sources of revenue:
  • promoted accounts
  • promoted tweets
  • promoted trends
Twitter Rules


“Don‘t do what gets
  us into trouble”

                      Terms of Service


                      “What‘s yours is yours
                        (but also ours)”


                                               API Rules


                                             “..but only if you
                                           know how to get it”
The TOS
“By submitting, posting or displaying Content on or through
the Services, you grant us a worldwide, non-exclusive,
royalty-free license (with the right to sublicense) to use,
copy, reproduce, process, adapt, modify, publish, transmit,
display and distribute such Content in any and all media or
distribution methods (now known or later developed).”

                  “You agree that this license includes the right for Twitter to
                  make such Content available to other companies,
                  organizations or individuals who partner with Twitter for
                  the syndication, broadcast, distribution or publication of
                  such Content on other media and services, subject to our
                  terms and conditions for such Content use.”

“We encourage and permit broad re-use of
Content. The Twitter API exists to enable this.”
API Rules
“You will not attempt or encourage others to: sell, rent,
lease, sublicense, redistribute, or syndicate access to the
Twitter API or Twitter Content to any third party without
prior written approval from Twitter. If you provide an API
that returns Twitter data, you may only return IDs (including
tweet IDs and user IDs).You may export or extract non-
programmatic, GUI-driven Twitter Content as a PDF or
spreadsheet by using "save as" or similar functionality.
Exporting Twitter Content to a datastore as a service or
other cloud based service, however, is not permitted.”

                  “Except as permitted through the Services (or these Terms),
                  you have to use the Twitter API if you want to reproduce,
                  modify, create derivative works, distribute, sell, transfer,
                  publicly display, publicly perform, transmit, or otherwise use
                  the Content or Services.”
The APIs

    Search API                  REST API              Streaming API
• similar to site          • allows interaction       • real-time access to
    search functionality       with Twitter similar       information moving
•   originally a third-        to an individual           through Twitter
    party product              user (“core” data)     •   for developers with
•   rate-limited           •   rate-limited               “data-intensive
•   use of Streaming       •   whitelisting was           needs”
    API for high               previously
    velocity queries is        possible, now
    recommended                discontinued
Intermediaries of Data

• Twitter doesn‘t look to analytics as a source of revenue
• providing data is costly in terms of computing resources
• analytics are left to companies like Gnip and Datasift
• these data resellers have little to gain by catering to the
 scientific community or Twitter‘s users
Actors and Options
                       Data reseller Large data     Small data
                                                                  Individual
             Twitter      (Gnip,     interpreter   interpreter
                                                                     user
                        Datasift)       (orga.)    (individual)


Log data



Historical
  data


Real-time
data (all)

Real-time
  data
(sample)
Conclusions
• the exact sample size and quality of any data from
  Twitter is unknown (see e.g. Gnip‘s Power Track)
• TOS and API regulate access to Twitter data for
  different actors (users, researchers) on different
  levels (access, control, ownership, interpretation)
• for users, the API is the only point of access to
  “their” data apart from the web interface
• the implicit audience for virtually all services built on
  Twitter data are companies
• both users and scholars lacking access to high-
  performance computing infrastructure are likely to
  be sidelined by the trend towards Big Twitter Data
images retrieved from Twitter 1% random sample




                        Thank you for your attention!




Contact: Cornelius Puschmann
puschmann@ibi.hu-berlin.de / @coffee001

More Related Content

Similar to Data Access, Ownership and Control in Social Web Services: Issues for Twitter Research

You Name Here1. List several products or services subject to n.docx
You Name Here1. List several products or services subject to n.docxYou Name Here1. List several products or services subject to n.docx
You Name Here1. List several products or services subject to n.docx
jeffevans62972
 

Similar to Data Access, Ownership and Control in Social Web Services: Issues for Twitter Research (20)

Collecting Twitter Data
Collecting Twitter DataCollecting Twitter Data
Collecting Twitter Data
 
Introduction to the Responsible Use of Social Media Monitoring and SOCMINT Tools
Introduction to the Responsible Use of Social Media Monitoring and SOCMINT ToolsIntroduction to the Responsible Use of Social Media Monitoring and SOCMINT Tools
Introduction to the Responsible Use of Social Media Monitoring and SOCMINT Tools
 
The evolution of research on social media
The evolution of research on social mediaThe evolution of research on social media
The evolution of research on social media
 
Thou Shalt not Share Collections of Tweets: Should we give a TOS?
Thou Shalt not Share Collections of Tweets: Should we give a TOS?Thou Shalt not Share Collections of Tweets: Should we give a TOS?
Thou Shalt not Share Collections of Tweets: Should we give a TOS?
 
Eavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging SiteEavesdropping on the Twitter Microblogging Site
Eavesdropping on the Twitter Microblogging Site
 
Liberating data power of APIs
Liberating data power of APIsLiberating data power of APIs
Liberating data power of APIs
 
Online text data for machine learning, data science, and research - Who can p...
Online text data for machine learning, data science, and research - Who can p...Online text data for machine learning, data science, and research - Who can p...
Online text data for machine learning, data science, and research - Who can p...
 
Big data analytics with Apache Hadoop
Big data analytics with Apache  HadoopBig data analytics with Apache  Hadoop
Big data analytics with Apache Hadoop
 
Twitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake WhiteTwitter Terms of Service Explained - Jake White
Twitter Terms of Service Explained - Jake White
 
SoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social MiningSoBigData. European Research Infrastructure for Big Data and Social Mining
SoBigData. European Research Infrastructure for Big Data and Social Mining
 
Digital Experiences Using a Conversational Interface
Digital Experiences Using a Conversational InterfaceDigital Experiences Using a Conversational Interface
Digital Experiences Using a Conversational Interface
 
Online data sources and information exposure
Online data sources and information exposureOnline data sources and information exposure
Online data sources and information exposure
 
IT for management
IT for managementIT for management
IT for management
 
Innovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David OsimoInnovation in Future Enterprise, by David Osimo
Innovation in Future Enterprise, by David Osimo
 
Big Data: Opportunity & Challenges
Big Data: Opportunity & ChallengesBig Data: Opportunity & Challenges
Big Data: Opportunity & Challenges
 
Big Data: Opportunity & Challenges
Big Data: Opportunity & ChallengesBig Data: Opportunity & Challenges
Big Data: Opportunity & Challenges
 
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital StrategiesUniv. of AZ Global Racing Symposium 2015 - Digital Strategies
Univ. of AZ Global Racing Symposium 2015 - Digital Strategies
 
Data Commons Garvan - 2016
Data Commons Garvan -  2016 Data Commons Garvan -  2016
Data Commons Garvan - 2016
 
Big data
Big dataBig data
Big data
 
You Name Here1. List several products or services subject to n.docx
You Name Here1. List several products or services subject to n.docxYou Name Here1. List several products or services subject to n.docx
You Name Here1. List several products or services subject to n.docx
 

More from Cornelius Puschmann

Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
Cornelius Puschmann
 
(Academic) Community Management in the Humanities and Social Sciences for Pub...
(Academic) Community Management in the Humanities and Social Sciences for Pub...(Academic) Community Management in the Humanities and Social Sciences for Pub...
(Academic) Community Management in the Humanities and Social Sciences for Pub...
Cornelius Puschmann
 
Studying Twitter conversations as (dynamic) graphs: visualization and structu...
Studying Twitter conversations as (dynamic) graphs: visualization and structu...Studying Twitter conversations as (dynamic) graphs: visualization and structu...
Studying Twitter conversations as (dynamic) graphs: visualization and structu...
Cornelius Puschmann
 
Hourly Twitter activity under the #Jan25 hashtag
Hourly Twitter activity under the #Jan25 hashtagHourly Twitter activity under the #Jan25 hashtag
Hourly Twitter activity under the #Jan25 hashtag
Cornelius Puschmann
 
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
Cornelius Puschmann
 

More from Cornelius Puschmann (20)

A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
A Tale of Two Platforms: Emerging communicative patterns in two scientific bl...
 
Digitale Methoden in den Sozial- und Geisteswissenschaften: Chancen und Herau...
Digitale Methoden in den Sozial- und Geisteswissenschaften: Chancen und Herau...Digitale Methoden in den Sozial- und Geisteswissenschaften: Chancen und Herau...
Digitale Methoden in den Sozial- und Geisteswissenschaften: Chancen und Herau...
 
Twitter as a data source for (socio)linguistic research
Twitter as a data source for (socio)linguistic researchTwitter as a data source for (socio)linguistic research
Twitter as a data source for (socio)linguistic research
 
Form and Function of Digital Genres of Scholarly Communication: Results of th...
Form and Function of Digital Genres of Scholarly Communication: Results of th...Form and Function of Digital Genres of Scholarly Communication: Results of th...
Form and Function of Digital Genres of Scholarly Communication: Results of th...
 
Vernetzung, Sichtbarkeit, Information: Nutzungsmotive informeller digitaler K...
Vernetzung, Sichtbarkeit, Information: Nutzungsmotive informeller digitaler K...Vernetzung, Sichtbarkeit, Information: Nutzungsmotive informeller digitaler K...
Vernetzung, Sichtbarkeit, Information: Nutzungsmotive informeller digitaler K...
 
The Pragmatics of Retweeting
The Pragmatics of RetweetingThe Pragmatics of Retweeting
The Pragmatics of Retweeting
 
Knowledge or Credit? The (Un)changing Face of Academic Publishing from the Ph...
Knowledge or Credit? The (Un)changing Face of Academic Publishing from the Ph...Knowledge or Credit? The (Un)changing Face of Academic Publishing from the Ph...
Knowledge or Credit? The (Un)changing Face of Academic Publishing from the Ph...
 
Wissenschaftliche Blogs: Nutzungsweisen und Nutzer
Wissenschaftliche Blogs: Nutzungsweisen und NutzerWissenschaftliche Blogs: Nutzungsweisen und Nutzer
Wissenschaftliche Blogs: Nutzungsweisen und Nutzer
 
Was ist ein Wissenschaftsblog?
Was ist ein Wissenschaftsblog?Was ist ein Wissenschaftsblog?
Was ist ein Wissenschaftsblog?
 
Wissenschaftliche Blogs: Schnittstelle zur Öffentlichkeit oder virtueller Elf...
Wissenschaftliche Blogs: Schnittstelle zur Öffentlichkeit oder virtueller Elf...Wissenschaftliche Blogs: Schnittstelle zur Öffentlichkeit oder virtueller Elf...
Wissenschaftliche Blogs: Schnittstelle zur Öffentlichkeit oder virtueller Elf...
 
Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
Beyond the stars: Interpreting discourse cohesion in Twitter as an indicator ...
 
(Academic) Community Management in the Humanities and Social Sciences for Pub...
(Academic) Community Management in the Humanities and Social Sciences for Pub...(Academic) Community Management in the Humanities and Social Sciences for Pub...
(Academic) Community Management in the Humanities and Social Sciences for Pub...
 
Doing A Small-Scale Diachronic Twitter User Study
Doing A Small-Scale Diachronic Twitter User StudyDoing A Small-Scale Diachronic Twitter User Study
Doing A Small-Scale Diachronic Twitter User Study
 
Social data: what it is, who owns it, and why you should care
Social data: what it is, who owns it, and why you should careSocial data: what it is, who owns it, and why you should care
Social data: what it is, who owns it, and why you should care
 
Twitter zwischen Nachrichtenkanal und Mikronarrativ
Twitter zwischen Nachrichtenkanal und MikronarrativTwitter zwischen Nachrichtenkanal und Mikronarrativ
Twitter zwischen Nachrichtenkanal und Mikronarrativ
 
#www2010 user activity chart
#www2010 user activity chart#www2010 user activity chart
#www2010 user activity chart
 
#s21 user activity chart
#s21 user activity chart#s21 user activity chart
#s21 user activity chart
 
Studying Twitter conversations as (dynamic) graphs: visualization and structu...
Studying Twitter conversations as (dynamic) graphs: visualization and structu...Studying Twitter conversations as (dynamic) graphs: visualization and structu...
Studying Twitter conversations as (dynamic) graphs: visualization and structu...
 
Hourly Twitter activity under the #Jan25 hashtag
Hourly Twitter activity under the #Jan25 hashtagHourly Twitter activity under the #Jan25 hashtag
Hourly Twitter activity under the #Jan25 hashtag
 
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
Elektronisches Publizieren und Open Access für Geistes- und Sozialwissenschaf...
 

Recently uploaded

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 

Data Access, Ownership and Control in Social Web Services: Issues for Twitter Research

  • 1. Cornelius Puschmann, Humboldt-Universität zu Berlin Jean Burgess, Queensland University of Technology Axel Bruns, Queensland University of Technology Merja Mahrt, Heinrich-Heine-Universität Düsseldorf Data Access, Ownership and Control in Social Web Services: Issues for Twitter Research ICA 2012 Track: Communication and Technology Session: Researching Social Media: Ethical and Methodological Challenges 26 May 2012, Phoenix
  • 2. “There are also significant questions of truth, control, and power in Big Data studies: researchers have the tools and the access, while social media users as a whole do not. Their data were created in highly context-sensitive spaces, and it is entirely possible that some users would not give permission for their data to be used elsewhere.” (boyd & Crawford, 2012, p.12)
  • 3. #1 Access, control, ownership and interpretation of data are interrelated facets that raise questions of power. #2 Market, legislation, social norms and code are dynamic regulatory forces in social web platforms.
  • 4. Access (technology) Control (ability) TOS API “law” defines Data enables “code” Ownership (law) Interpretation (competence)
  • 5. • founded in 2006 by Jack Dorsey • 140 mio active users • 340 mio tweets per day • source of real-time information on a breadth of issues from pop culture to politics • increasingly used as a data source among researchers (e.g. on election prediction via Twitter: Tumasjan et al, 2010, Jungherr et al, 2011, Gayo-Avello, 2012)
  • 6. • Twitter‘s (future) business model is based on advertising • ad revenue of $260 mio in 2012 • sources of revenue: • promoted accounts • promoted tweets • promoted trends
  • 7. Twitter Rules “Don‘t do what gets us into trouble” Terms of Service “What‘s yours is yours (but also ours)” API Rules “..but only if you know how to get it”
  • 8. The TOS “By submitting, posting or displaying Content on or through the Services, you grant us a worldwide, non-exclusive, royalty-free license (with the right to sublicense) to use, copy, reproduce, process, adapt, modify, publish, transmit, display and distribute such Content in any and all media or distribution methods (now known or later developed).” “You agree that this license includes the right for Twitter to make such Content available to other companies, organizations or individuals who partner with Twitter for the syndication, broadcast, distribution or publication of such Content on other media and services, subject to our terms and conditions for such Content use.” “We encourage and permit broad re-use of Content. The Twitter API exists to enable this.”
  • 9. API Rules “You will not attempt or encourage others to: sell, rent, lease, sublicense, redistribute, or syndicate access to the Twitter API or Twitter Content to any third party without prior written approval from Twitter. If you provide an API that returns Twitter data, you may only return IDs (including tweet IDs and user IDs).You may export or extract non- programmatic, GUI-driven Twitter Content as a PDF or spreadsheet by using "save as" or similar functionality. Exporting Twitter Content to a datastore as a service or other cloud based service, however, is not permitted.” “Except as permitted through the Services (or these Terms), you have to use the Twitter API if you want to reproduce, modify, create derivative works, distribute, sell, transfer, publicly display, publicly perform, transmit, or otherwise use the Content or Services.”
  • 10. The APIs Search API REST API Streaming API • similar to site • allows interaction • real-time access to search functionality with Twitter similar information moving • originally a third- to an individual through Twitter party product user (“core” data) • for developers with • rate-limited • rate-limited “data-intensive • use of Streaming • whitelisting was needs” API for high previously velocity queries is possible, now recommended discontinued
  • 11. Intermediaries of Data • Twitter doesn‘t look to analytics as a source of revenue • providing data is costly in terms of computing resources • analytics are left to companies like Gnip and Datasift • these data resellers have little to gain by catering to the scientific community or Twitter‘s users
  • 12. Actors and Options Data reseller Large data Small data Individual Twitter (Gnip, interpreter interpreter user Datasift) (orga.) (individual) Log data Historical data Real-time data (all) Real-time data (sample)
  • 13. Conclusions • the exact sample size and quality of any data from Twitter is unknown (see e.g. Gnip‘s Power Track) • TOS and API regulate access to Twitter data for different actors (users, researchers) on different levels (access, control, ownership, interpretation) • for users, the API is the only point of access to “their” data apart from the web interface • the implicit audience for virtually all services built on Twitter data are companies • both users and scholars lacking access to high- performance computing infrastructure are likely to be sidelined by the trend towards Big Twitter Data
  • 14. images retrieved from Twitter 1% random sample Thank you for your attention! Contact: Cornelius Puschmann puschmann@ibi.hu-berlin.de / @coffee001