SlideShare una empresa de Scribd logo
1 de 19
Future Data Collection

CARMA Internet Research Module
         Jeff Stanton
Several Promising Environments and
                Techniques
•   Visual Surveys
•   Audio and video interviewing
•   Virtual Worlds
•   Web scraping
•   Network extraction and mapping
•   Polls Everywhere
•   Mobility
Visual Surveys
Visual DNA: http://www.visualdna.com/
Also try: http://www.youniverse.com/
Visual Surveys
Provides an engaging alternative to text-based
  surveys; more fun for respondents
Requires considerable set-up time; each screen
  is like an item; each picture is like an item
  response; every item and response must be
  keyed against one or more criteria
  Example: Previous page, “How do you approach
    stress,” could be keyed against other subjective or
    objective measures of stress, coping, general
    health, immune response, etc.
Audio and Video Interviewing
Methods: Structured, semi-
  structured, and unstructured
  interviewing; focus groups
Products: Skype, WebEx, Adobe
  Connect, Cisco Telepresence
Advantages: Reduced travel costs,
  speed
Disadvantages: High bandwidth,
  user technology requirements,
  unreliable connections
Virtual Worlds
VastPark: http://www.vastpark.com/
Virtual Worlds
Combine text and audio chat with social networking and
  3D model building
Methods: Structured, semi-structured, and unstructured
  interviewing; focus groups; unobtrusive observation;
  participant observation; possibly some experimental
  perceptual, cooperative, or navigational tasks
Products: VastPark, OpenSim, EduSim, Teleplace
Advantages: Speed, low cost
Disadvantages: Steep learning curve; high bandwidth;
  user technology requirements; unreliable connections
Web Scraping
Web Scraping
Retrieval and processing of text or images, e.g., from
  blogs; processing may include semantic analysis of
  people, events, emotions
Methods: Archival document analysis
Products: 100s of commercial, mainly focused on brand,
  reputation, marketing; open source product:
  WebHarvest
Advantages: Data are plentiful and cover a wide range of
  topics
Disadvantages: Technology hard to master; even after
  considerable automated processing, analysis has an
  intensive, qualitative flavor
Make a Wordcloud with Twitter and R
• Download R, the open source statistical platform;
  for more fun, also download R-Studio; both
  available for Windows, Mac, and Linux
• You will need four packages to make a word
  cloud: twitteR, stringr, tm, and wordcloud
  – Use install.packages() and library() commands to
    prepare packages for use in R
• Code appears on the following page; explanation
  is in my free eBook, Introduction to Data Science
  on the iTunes Bookstore
# TweetFrame() - Return a dataframe based on a search of Twitter
TweetFrame<-function(searchTerm, maxTweets)
{
  tweetList <- searchTwitter(searchTerm, n=maxTweets)
  tweetDF<- do.call("rbind", lapply(tweetList,as.data.frame))

 # This last step sorts the tweets in arrival order
 return(tweetDF[order(as.integer(tweetDF$created)), ])
}
# CleanTweets() - Takes the junk out of a vector of tweet texts
CleanTweets<-function(tweets)
{
  tweets <- str_replace_all(tweets," "," ")
  tweets <- str_replace_all(tweets, + "http://t.co/[a-z,A-Z,0-9]{8}","")
  tweets <- str_replace(tweets,"RT @[a-z,A-Z]*: ","")
  tweets <- str_replace_all(tweets,"#[a-z,A-Z]*","")
  tweets <- str_replace_all(tweets,"@[a-z,A-Z]*","")
  return(tweets)
}

# Command line code
tweetDF <- TweetFrame(”#yourhashtag",100)
cleanText<-CleanTweets(tweetDF$text)
tweetCorpus<-Corpus(VectorSource(cleanText))
tweetTDM<-TermDocumentMatrix(tweetCorpus)
tdMatrix <- as.matrix(tweetTDM)
sortedMatrix<-sort(rowSums(tdMatrix), decreasing=TRUE)
cloudFrame<-data.frame( word=names(sortedMatrix),freq=sortedMatrix)
wordcloud(cloudFrame$word,cloudFrame$freq)
Example Wordcloud: Hashtag “#solar”
Network Mapping
Mapping Social Networks
Nicholas Christakis of the Framingham Heart Study has shown
   the power of social networks to influence a variety of
   health outcomes
Methods: Traditional self-report & objective measures;
   topographical measures such as network centrality;
   “neighbor” measures
Products: Depends on data types; TouchGraph is a network
   web search engine; InFlow; UCInet; See:
   http://en.wikipedia.org/wiki/Social_network_analysis_soft
   ware
Advantages: Meaningful improvement in predictive capability
Disadvantages: Intensive technique requires careful planning
   and setup; data collection difficult and time consuming
Facebook Polls
Embedded Polls
Collection of short-format survey data from social
  networking and membership sites
Methods: Primarily standard, closed-ended self-
  report; single item scales
Products: Example: Vizu provides a “widget” that
  allows embedding of polls on Facebook pages
Advantages: Quick, cheap, possible to get a large
  sample in a short time
Disadvantages: Difficult to control access, short
  format limits use of multi-item scales
Mobility
http://www.surveyonthespot.com/
Data Collection from Mobile Devices
Using smartphones and other mobile devices as a basis
  for interacting with participants
Methods: Primarily self-report but can include location
  and movement data
Products: Example: Survey On The Spot allows location
  aware surveys to be delivered to smart phones;
  TrailGuru collects route data from hikers and joggers
Advantages: Platform is becoming ubiquitous, location
  data provides new options for understanding behavior
Disadvantages: Privacy issues, small screen, complex
  programming interfaces
iPhone Fun
Reaching Mobile Participants
• Micropayment system built
  into the platform
• Feasible for short
  instruments
• Can be tied to particular
  experiences, e.g., museum
  visits
• Responses can be
  geotagged to support
  mapping

Más contenido relacionado

Similar a Carma internet research module: Future data collection

Remote Working in a 2.0 World
Remote Working in a 2.0 WorldRemote Working in a 2.0 World
Remote Working in a 2.0 WorldMarieke Guy
 
Overview Of Network Analysis Platforms
Overview Of Network Analysis PlatformsOverview Of Network Analysis Platforms
Overview Of Network Analysis PlatformsNoah Flower
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringTao Xie
 
Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...Merlien Institute
 
Overview Of Network Mapping And Analysis Platforms
Overview Of Network Mapping And Analysis PlatformsOverview Of Network Mapping And Analysis Platforms
Overview Of Network Mapping And Analysis Platformsdianascearce
 
2008 10 21 Top Ten Tech Tools Agents E Xtension
2008 10 21 Top Ten Tech Tools Agents E Xtension2008 10 21 Top Ten Tech Tools Agents E Xtension
2008 10 21 Top Ten Tech Tools Agents E Xtensiondkp205
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
Cloud computing higdon_scott_conway
Cloud computing higdon_scott_conwayCloud computing higdon_scott_conway
Cloud computing higdon_scott_conwayPrerna Agarwal
 
Digital Practices - introductions
Digital Practices - introductionsDigital Practices - introductions
Digital Practices - introductionsprisca schmarsow
 
E Learning Management System By Tuhin Roy Using PHP
E Learning Management System By Tuhin Roy Using PHPE Learning Management System By Tuhin Roy Using PHP
E Learning Management System By Tuhin Roy Using PHPTuhin Ray
 
Big data analytic platform
Big data analytic platformBig data analytic platform
Big data analytic platformJesse Wang
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignAmanda Dinscore
 
Remote usability testing and remote user research for usability
Remote usability testing and remote user research for usabilityRemote usability testing and remote user research for usability
Remote usability testing and remote user research for usabilityUser Vision
 
KISD Board Presentation November 18 2008
KISD Board Presentation November 18 2008KISD Board Presentation November 18 2008
KISD Board Presentation November 18 2008Joel Adkins
 

Similar a Carma internet research module: Future data collection (20)

Remote Working in a 2.0 World
Remote Working in a 2.0 WorldRemote Working in a 2.0 World
Remote Working in a 2.0 World
 
Overview Of Network Analysis Platforms
Overview Of Network Analysis PlatformsOverview Of Network Analysis Platforms
Overview Of Network Analysis Platforms
 
nonprof2007.ppt
nonprof2007.pptnonprof2007.ppt
nonprof2007.ppt
 
Chapter 7)
Chapter 7)Chapter 7)
Chapter 7)
 
Software Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software EngineeringSoftware Analytics: Data Analytics for Software Engineering
Software Analytics: Data Analytics for Software Engineering
 
Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...Choosing the right software for your research study : an overview of leading ...
Choosing the right software for your research study : an overview of leading ...
 
Overview Of Network Mapping And Analysis Platforms
Overview Of Network Mapping And Analysis PlatformsOverview Of Network Mapping And Analysis Platforms
Overview Of Network Mapping And Analysis Platforms
 
2008 10 21 Top Ten Tech Tools Agents E Xtension
2008 10 21 Top Ten Tech Tools Agents E Xtension2008 10 21 Top Ten Tech Tools Agents E Xtension
2008 10 21 Top Ten Tech Tools Agents E Xtension
 
Remote Workers
Remote WorkersRemote Workers
Remote Workers
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Web Information Systems Introduction and Origin of World Wide Web
Web Information Systems Introduction and Origin of World Wide WebWeb Information Systems Introduction and Origin of World Wide Web
Web Information Systems Introduction and Origin of World Wide Web
 
Cloud computing higdon_scott_conway
Cloud computing higdon_scott_conwayCloud computing higdon_scott_conway
Cloud computing higdon_scott_conway
 
Digital Practices - introductions
Digital Practices - introductionsDigital Practices - introductions
Digital Practices - introductions
 
E Learning Management System By Tuhin Roy Using PHP
E Learning Management System By Tuhin Roy Using PHPE Learning Management System By Tuhin Roy Using PHP
E Learning Management System By Tuhin Roy Using PHP
 
29.4 mb
29.4 mb29.4 mb
29.4 mb
 
29.4 Mb
29.4 Mb29.4 Mb
29.4 Mb
 
Big data analytic platform
Big data analytic platformBig data analytic platform
Big data analytic platform
 
ACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web DesignACRL 2011 Data-Driven Library Web Design
ACRL 2011 Data-Driven Library Web Design
 
Remote usability testing and remote user research for usability
Remote usability testing and remote user research for usabilityRemote usability testing and remote user research for usability
Remote usability testing and remote user research for usability
 
KISD Board Presentation November 18 2008
KISD Board Presentation November 18 2008KISD Board Presentation November 18 2008
KISD Board Presentation November 18 2008
 

Más de Syracuse University

Basic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultyBasic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultySyracuse University
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale developmentSyracuse University
 
Carma internet research module getting started with question pro
Carma internet research module   getting started with question proCarma internet research module   getting started with question pro
Carma internet research module getting started with question proSyracuse University
 
Carma internet research module visual design issues
Carma internet research module   visual design issuesCarma internet research module   visual design issues
Carma internet research module visual design issuesSyracuse University
 
Introduction to Advance Analytics Course
Introduction to Advance Analytics CourseIntroduction to Advance Analytics Course
Introduction to Advance Analytics CourseSyracuse University
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Syracuse University
 
Carma internet research module: Sampling for internet
Carma internet research module: Sampling for internetCarma internet research module: Sampling for internet
Carma internet research module: Sampling for internetSyracuse University
 

Más de Syracuse University (20)

Discovery informaticsstanton
Discovery informaticsstantonDiscovery informaticsstanton
Discovery informaticsstanton
 
Basic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultyBasic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University Faculty
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
Chapter9 r studio2
Chapter9 r studio2Chapter9 r studio2
Chapter9 r studio2
 
Basic Overview of Data Mining
Basic Overview of Data MiningBasic Overview of Data Mining
Basic Overview of Data Mining
 
Strategic planning
Strategic planningStrategic planning
Strategic planning
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale development
 
Carma internet research module getting started with question pro
Carma internet research module   getting started with question proCarma internet research module   getting started with question pro
Carma internet research module getting started with question pro
 
Carma internet research module visual design issues
Carma internet research module   visual design issuesCarma internet research module   visual design issues
Carma internet research module visual design issues
 
Siop impact of social media
Siop impact of social mediaSiop impact of social media
Siop impact of social media
 
Basic Graphics with R
Basic Graphics with RBasic Graphics with R
Basic Graphics with R
 
R-Studio Vs. Rcmdr
R-Studio Vs. RcmdrR-Studio Vs. Rcmdr
R-Studio Vs. Rcmdr
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
Introduction to Advance Analytics Course
Introduction to Advance Analytics CourseIntroduction to Advance Analytics Course
Introduction to Advance Analytics Course
 
Installing R and R-Studio
Installing R and R-StudioInstalling R and R-Studio
Installing R and R-Studio
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Reducing Response Burden
Reducing Response BurdenReducing Response Burden
Reducing Response Burden
 
PACIS Survey Workshop
PACIS Survey WorkshopPACIS Survey Workshop
PACIS Survey Workshop
 
Carma internet research module: Sampling for internet
Carma internet research module: Sampling for internetCarma internet research module: Sampling for internet
Carma internet research module: Sampling for internet
 

Último

Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45MysoreMuleSoftMeetup
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...Nguyen Thanh Tu Collection
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxneillewis46
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...Nguyen Thanh Tu Collection
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxheathfieldcps1
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptxVishal Singh
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital ManagementMBA Assignment Experts
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17Celine George
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...Gary Wood
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/siemaillard
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppCeline George
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptxPoojaSen20
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................MirzaAbrarBaig5
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文中 央社
 
An Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppAn Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppCeline George
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhleson0603
 

Último (20)

Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
Exploring Gemini AI and Integration with MuleSoft | MuleSoft Mysore Meetup #45
 
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
BỘ LUYỆN NGHE TIẾNG ANH 8 GLOBAL SUCCESS CẢ NĂM (GỒM 12 UNITS, MỖI UNIT GỒM 3...
 
Graduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptxGraduate Outcomes Presentation Slides - English (v3).pptx
Graduate Outcomes Presentation Slides - English (v3).pptx
 
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
24 ĐỀ THAM KHẢO KÌ THI TUYỂN SINH VÀO LỚP 10 MÔN TIẾNG ANH SỞ GIÁO DỤC HẢI DƯ...
 
The basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptxThe basics of sentences session 4pptx.pptx
The basics of sentences session 4pptx.pptx
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptx
 
Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"Mattingly "AI & Prompt Design: Named Entity Recognition"
Mattingly "AI & Prompt Design: Named Entity Recognition"
 
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
Mattingly "AI and Prompt Design: LLMs with Text Classification and Open Source"
 
8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management8 Tips for Effective Working Capital Management
8 Tips for Effective Working Capital Management
 
How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17How to Analyse Profit of a Sales Order in Odoo 17
How to Analyse Profit of a Sales Order in Odoo 17
 
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...When Quality Assurance Meets Innovation in Higher Education - Report launch w...
When Quality Assurance Meets Innovation in Higher Education - Report launch w...
 
Championnat de France de Tennis de table/
Championnat de France de Tennis de table/Championnat de France de Tennis de table/
Championnat de France de Tennis de table/
 
Improved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio AppImproved Approval Flow in Odoo 17 Studio App
Improved Approval Flow in Odoo 17 Studio App
 
ANTI PARKISON DRUGS.pptx
ANTI         PARKISON          DRUGS.pptxANTI         PARKISON          DRUGS.pptx
ANTI PARKISON DRUGS.pptx
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
An Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge AppAn Overview of the Odoo 17 Knowledge App
An Overview of the Odoo 17 Knowledge App
 
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinhĐề tieng anh thpt 2024 danh cho cac ban hoc sinh
Đề tieng anh thpt 2024 danh cho cac ban hoc sinh
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
“O BEIJO” EM ARTE .
“O BEIJO” EM ARTE                       .“O BEIJO” EM ARTE                       .
“O BEIJO” EM ARTE .
 

Carma internet research module: Future data collection

  • 1. Future Data Collection CARMA Internet Research Module Jeff Stanton
  • 2. Several Promising Environments and Techniques • Visual Surveys • Audio and video interviewing • Virtual Worlds • Web scraping • Network extraction and mapping • Polls Everywhere • Mobility
  • 3. Visual Surveys Visual DNA: http://www.visualdna.com/ Also try: http://www.youniverse.com/
  • 4. Visual Surveys Provides an engaging alternative to text-based surveys; more fun for respondents Requires considerable set-up time; each screen is like an item; each picture is like an item response; every item and response must be keyed against one or more criteria Example: Previous page, “How do you approach stress,” could be keyed against other subjective or objective measures of stress, coping, general health, immune response, etc.
  • 5. Audio and Video Interviewing Methods: Structured, semi- structured, and unstructured interviewing; focus groups Products: Skype, WebEx, Adobe Connect, Cisco Telepresence Advantages: Reduced travel costs, speed Disadvantages: High bandwidth, user technology requirements, unreliable connections
  • 7. Virtual Worlds Combine text and audio chat with social networking and 3D model building Methods: Structured, semi-structured, and unstructured interviewing; focus groups; unobtrusive observation; participant observation; possibly some experimental perceptual, cooperative, or navigational tasks Products: VastPark, OpenSim, EduSim, Teleplace Advantages: Speed, low cost Disadvantages: Steep learning curve; high bandwidth; user technology requirements; unreliable connections
  • 9. Web Scraping Retrieval and processing of text or images, e.g., from blogs; processing may include semantic analysis of people, events, emotions Methods: Archival document analysis Products: 100s of commercial, mainly focused on brand, reputation, marketing; open source product: WebHarvest Advantages: Data are plentiful and cover a wide range of topics Disadvantages: Technology hard to master; even after considerable automated processing, analysis has an intensive, qualitative flavor
  • 10. Make a Wordcloud with Twitter and R • Download R, the open source statistical platform; for more fun, also download R-Studio; both available for Windows, Mac, and Linux • You will need four packages to make a word cloud: twitteR, stringr, tm, and wordcloud – Use install.packages() and library() commands to prepare packages for use in R • Code appears on the following page; explanation is in my free eBook, Introduction to Data Science on the iTunes Bookstore
  • 11. # TweetFrame() - Return a dataframe based on a search of Twitter TweetFrame<-function(searchTerm, maxTweets) { tweetList <- searchTwitter(searchTerm, n=maxTweets) tweetDF<- do.call("rbind", lapply(tweetList,as.data.frame)) # This last step sorts the tweets in arrival order return(tweetDF[order(as.integer(tweetDF$created)), ]) } # CleanTweets() - Takes the junk out of a vector of tweet texts CleanTweets<-function(tweets) { tweets <- str_replace_all(tweets," "," ") tweets <- str_replace_all(tweets, + "http://t.co/[a-z,A-Z,0-9]{8}","") tweets <- str_replace(tweets,"RT @[a-z,A-Z]*: ","") tweets <- str_replace_all(tweets,"#[a-z,A-Z]*","") tweets <- str_replace_all(tweets,"@[a-z,A-Z]*","") return(tweets) } # Command line code tweetDF <- TweetFrame(”#yourhashtag",100) cleanText<-CleanTweets(tweetDF$text) tweetCorpus<-Corpus(VectorSource(cleanText)) tweetTDM<-TermDocumentMatrix(tweetCorpus) tdMatrix <- as.matrix(tweetTDM) sortedMatrix<-sort(rowSums(tdMatrix), decreasing=TRUE) cloudFrame<-data.frame( word=names(sortedMatrix),freq=sortedMatrix) wordcloud(cloudFrame$word,cloudFrame$freq)
  • 14. Mapping Social Networks Nicholas Christakis of the Framingham Heart Study has shown the power of social networks to influence a variety of health outcomes Methods: Traditional self-report & objective measures; topographical measures such as network centrality; “neighbor” measures Products: Depends on data types; TouchGraph is a network web search engine; InFlow; UCInet; See: http://en.wikipedia.org/wiki/Social_network_analysis_soft ware Advantages: Meaningful improvement in predictive capability Disadvantages: Intensive technique requires careful planning and setup; data collection difficult and time consuming
  • 16. Embedded Polls Collection of short-format survey data from social networking and membership sites Methods: Primarily standard, closed-ended self- report; single item scales Products: Example: Vizu provides a “widget” that allows embedding of polls on Facebook pages Advantages: Quick, cheap, possible to get a large sample in a short time Disadvantages: Difficult to control access, short format limits use of multi-item scales
  • 18. Data Collection from Mobile Devices Using smartphones and other mobile devices as a basis for interacting with participants Methods: Primarily self-report but can include location and movement data Products: Example: Survey On The Spot allows location aware surveys to be delivered to smart phones; TrailGuru collects route data from hikers and joggers Advantages: Platform is becoming ubiquitous, location data provides new options for understanding behavior Disadvantages: Privacy issues, small screen, complex programming interfaces
  • 19. iPhone Fun Reaching Mobile Participants • Micropayment system built into the platform • Feasible for short instruments • Can be tied to particular experiences, e.g., museum visits • Responses can be geotagged to support mapping