Boegershausen et al. (2022).pptx

•Descargar como PPTX, PDF•

0 recomendaciones•687 vistas

American Marketing Association | Journals

While marketing researchers increasingly employ web data, the idiosyncratic and sometimes insidious challenges in its collection have received limited attention. How can researchers ensure that the datasets generated via web scraping and APIs are valid? A new article in the Journal of Marketing proposes a methodological framework that highlights how addressing validity concerns requires the joint consideration of idiosyncratic technical and legal/ethical questions. The framework covers the broad spectrum of validity concerns arising from the automatic collection of web data for academic use along the three stages of collecting web data: selecting data sources, designing the data collection, and extracting the data.

Educación

Fields of Gold
Scraping Web Data
for Marketing Insights
Boegershausen, Datta, Borah, and Stephen (2022)

A Wealth of Data for Marketing Research
is Created on the Internet
Boegershausen, Datta, Borah, and Stephen (2022)
~ 244m reviews
> 1b reviews & opinions
556K projects
500m/day
7:11
hours
time spent online per
day by the average
American consumer
85%
proportion of US
consumers that
use the Internet
every single day
based on available company and market research statistics in May 2022

Boegershausen, Datta, Borah, and Stephen (2022)
Web Scraping
EXAMPLE SOURCES
… allow programmatic access to the internal
databases or algorithms of data providers
Example articles:
Tellis et al. (2019); Toubia and Stephen (2013)
… the process of developing software to automatically
collect information displayed in a web browser
EXAMPLE SOURCES
Example articles:
Chevalier and Mayzlin (2006); Ludwig et al. (2013)
Web Scraping & APIs Can be Used
to Extract Web Data at Scale

Boosting ecological value
This Data Collection Technique can be Used in a
Variety of Settings
Boegershausen, Datta, Borah, and Stephen (2022)
Studying new phenomena
Facilitating methodological advancement Improving measurement
Pathway
①
Pathway
②
Pathway
③
Pathway
④
e.g., Zervas et al. (2017); Datta et al. (2018) e.g., Du et al. (2015); Ludwig et al. (2013)
e.g., Netzer et al. (2012); Liu et al. (2020) e.g., Li et al. (2017); Datta et al. (2022)

Collecting Valid Web Data Poses Many Challenges…
Validity concerns may arise from:
• Failing to capture contextual information in a rapidly changing environment
(e.g., updates to the website’s data-generating process, such as changes to how and where information is
displayed)
• Not sufficiently aligning the psychological processes of interest with the
frequency of data extraction on review platforms
(e.g., the collected information does not capture the time when the behavior occurred)
• Overlooking the influence of algorithmic interference on e-commerce websites
(e.g., the effect of personalization algorithms on information display)
• …and many more.
Boegershausen, Datta, Borah, and Stephen (2022)

How to Extract Valid Web Data?
Boegershausen, Datta, Borah, and Stephen (2022)
Validity
Technical
feasibility
Legal and
ethical risks
2. Collection Design
3. Data Extraction
1. Source Selection
- Jointly consider validity concerns, alongside
technical and legal/ethical questions
- Selected examples and solutions
- Collecting user data from social networks
may infringe upon users’ privacy rights 
anonymize user IDs
- Product review data may be biased by
personalization algorithms  check whether
own browsing behavior affects information
display
- Extraction of all of the information from a
website may take too long  consider taking
a sample

Want to get started collecting and using web data?
Read the paper, and visit https://web-scraping.org.
Boegershausen, Datta, Borah, and Stephen (2022)
o Explore a database with 300+ published
marketing articles using web data
& get inspired!
o Discover web datasets & APIs for your
research projects.
o Find tutorials and example code for
collecting web data using web scraping &
APIs

Más contenido relacionado

La actualidad más candente

Marketing Monster Energy Drink Presentationjderemo

Presentation nike finance marketing india asiaShruti Srivastava

Point of sales materialsAnton Razumov

Burger King - global marketingBeeGroup

Redbull company overview and mini case solution of kotler kellerFayaz Ahamad

Coca cola presentationInternational Communication Center

7 P's of marketing (Coca cola)Harshal Jaiswal

Nike IMC campaignASWIN NAMBURI

Brand positioning part 3Vaishnavi Ketharnathan

Monster Presentationrahell89

Adidas marketing planPrasadTayade2

Lucky Charms Campaign BookRachel Burns

Travel Retail - PR Chivas brandCamino Deniz Vazquez

WNBA SWOT AnalysisIngrid Little

How to write a creative briefMichael Fomichev

7 P's of Marketing of Coca ColaMohit Mahajan

TARGET CORP MARKETING STRATEGY Harsha vardhana

New Balance Pitch BriefCubeyou Inc

ORLANDO’S® Beer Brand Strategyorlandosbeer

Marketing strategy of Coca-ColaKunal Gawade, CFE

La actualidad más candente (20)

Marketing Monster Energy Drink Presentation

Presentation nike finance marketing india asia

Point of sales materials

Burger King - global marketing

Redbull company overview and mini case solution of kotler keller

Coca cola presentation

7 P's of marketing (Coca cola)

Nike IMC campaign

Brand positioning part 3

Monster Presentation

Adidas marketing plan

Lucky Charms Campaign Book

Travel Retail - PR Chivas brand

WNBA SWOT Analysis

How to write a creative brief

7 P's of Marketing of Coca Cola

TARGET CORP MARKETING STRATEGY

New Balance Pitch Brief

ORLANDO’S® Beer Brand Strategy

Marketing strategy of Coca-Cola

Similar a Boegershausen et al. (2022).pptx

Web miningTanjarul Islam Mishu

A Clustering Based Approach for knowledge discovery on web.NIET Journal of Engineering & Technology (NIETJET)

A Study Web Data Mining Challenges And Application For Information ExtractionScott Bou

Business Analytics and Data mining.pdfssuser0413ec

BIG DATA CHAPTER 2 IN DSS.pptxmuflehaljarrah

IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...IEEEMEMTECHSTUDENTPROJECTS

Data Strategy Best PracticesDATAVERSITY

Introduction to Business and Data Analysis Undergraduate.pdfAbdulrahimShaibuIssa

IRJET - Big Data Analysis its ChallengesIRJET Journal

Big data IntroductionMusa Kalimullah

Web mining and social media miningRoxana Tadayon

WebROXTAD71

Sample MohdAnasSiddiqui

Structured data and metadata evaluation methodology for organizations looking...Emily Kolvitz

Creating Your Own Technology Plan ToledoMichigan Nonprofit Association

IRJET - Re-Ranking of Google Search ResultsIRJET Journal

A study on web analytics with reference to select sports websitesBhanu Prakash

Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals

2.What Data Collection Method Involves Tracking_.pdfBelayet Hossain

Search Engine ScrapperIRJET Journal

Similar a Boegershausen et al. (2022).pptx (20)

Web mining

A Clustering Based Approach for knowledge discovery on web.

A Study Web Data Mining Challenges And Application For Information Extraction

Business Analytics and Data mining.pdf

BIG DATA CHAPTER 2 IN DSS.pptx

IEEE 2014 DOTNET CLOUD COMPUTING PROJECTS A scientometric analysis of cloud c...

Data Strategy Best Practices

Introduction to Business and Data Analysis Undergraduate.pdf

IRJET - Big Data Analysis its Challenges

Big data Introduction

Web mining and social media mining

Web

Sample

Structured data and metadata evaluation methodology for organizations looking...

Creating Your Own Technology Plan Toledo

IRJET - Re-Ranking of Google Search Results

A study on web analytics with reference to select sports websites

Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...

2.What Data Collection Method Involves Tracking_.pdf

Search Engine Scrapper

Más de American Marketing Association | Journals

Liadeli, Sotgiu, and Verlegh (2022).pptxAmerican Marketing Association | Journals

Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptxAmerican Marketing Association | Journals

Wiseman et al. (2022).pptxAmerican Marketing Association | Journals

Wies, Moorman &Chandy (2022).pptxAmerican Marketing Association | Journals

Thompson & Kumar (2022).pptxAmerican Marketing Association | Journals

Musarra, Robson, and Katsikeas (2022).pptxAmerican Marketing Association | Journals

Maesen & Lamey (2022).pptxAmerican Marketing Association | Journals

Herhausen et al (2022).pptxAmerican Marketing Association | Journals

Dellaert et al (2022).pptxAmerican Marketing Association | Journals

Dolbec et al. (2022).pptxAmerican Marketing Association | Journals

Heide, Bell & Tracey (2022).pptxAmerican Marketing Association | Journals

Wang, Wang & Jiang (2022).pptxAmerican Marketing Association | Journals

Malhotra & Bhattacharyya (2022).pptxAmerican Marketing Association | Journals

Kim, Kim & Arora (2021).pptxAmerican Marketing Association | Journals

Jia, Yang, and Jiang 2022.pptxAmerican Marketing Association | Journals

Goldfarb, Tucker & Wang (2022).pptxAmerican Marketing Association | Journals

Anatoli Colicev: The PhD JourneyAmerican Marketing Association | Journals

Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...American Marketing Association | Journals

Why Salespeople Avoid Big-Whale Sales OpportunitiesAmerican Marketing Association | Journals

Más de American Marketing Association | Journals (20)

Liadeli, Sotgiu, and Verlegh (2022).pptx

Ghosh Dastidar, Sunder, and Shah (2022)pptx (2).pptx

Wiseman et al. (2022).pptx

Wies, Moorman &Chandy (2022).pptx

Thompson & Kumar (2022).pptx

Musarra, Robson, and Katsikeas (2022).pptx

Maesen & Lamey (2022).pptx

Herhausen et al (2022).pptx

Dellaert et al (2022).pptx

Dolbec et al. (2022).pptx

Heide, Bell & Tracey (2022).pptx

Wang, Wang & Jiang (2022).pptx

Malhotra & Bhattacharyya (2022).pptx

Kim, Kim & Arora (2021).pptx

Jia, Yang, and Jiang 2022.pptx

Goldfarb, Tucker & Wang (2022).pptx

Anatoli Colicev: The PhD Journey

Befriending the Enemy: The Effects of Observing Brand-to-Brand Praise on Cons...

Why Salespeople Avoid Big-Whale Sales Opportunities

Último

Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith

Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University of Engineering & Technology, Jamshoro

FSB Advising Checklist - Orientation 2024Elizabeth Walsh

1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh

Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136

General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil

Single or Multiple melodic lines structuredhanjurrannsibayan2

Google Gemini An AI Revolution in Education.pptxDr. Sarita Anand

How to Manage Global Discount in Odoo 17 POSCeline George

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics

This PowerPoint helps students to consider the concept of infinity.christianmathematics

Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi

Introduction to Nonprofit Accounting: The BasicsTechSoup

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection

How to Give a Domain for a Field in Odoo 17Celine George

Spatium Project Simulation student briefAssociation for Project Management

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George

Food safety_Challenges food safety laboratories_.pdfSherif Taha

Boegershausen et al. (2022).pptx

1. Fields of Gold Scraping Web Data for Marketing Insights Boegershausen, Datta, Borah, and Stephen (2022)

2. A Wealth of Data for Marketing Research is Created on the Internet Boegershausen, Datta, Borah, and Stephen (2022) ~ 244m reviews > 1b reviews & opinions 556K projects 500m/day 7:11 hours time spent online per day by the average American consumer 85% proportion of US consumers that use the Internet every single day based on available company and market research statistics in May 2022

3. Boegershausen, Datta, Borah, and Stephen (2022) Web Scraping EXAMPLE SOURCES … allow programmatic access to the internal databases or algorithms of data providers Example articles: Tellis et al. (2019); Toubia and Stephen (2013) … the process of developing software to automatically collect information displayed in a web browser EXAMPLE SOURCES Example articles: Chevalier and Mayzlin (2006); Ludwig et al. (2013) Web Scraping & APIs Can be Used to Extract Web Data at Scale

4. Boosting ecological value This Data Collection Technique can be Used in a Variety of Settings Boegershausen, Datta, Borah, and Stephen (2022) Studying new phenomena Facilitating methodological advancement Improving measurement Pathway ① Pathway ② Pathway ③ Pathway ④ e.g., Zervas et al. (2017); Datta et al. (2018) e.g., Du et al. (2015); Ludwig et al. (2013) e.g., Netzer et al. (2012); Liu et al. (2020) e.g., Li et al. (2017); Datta et al. (2022)

5. Collecting Valid Web Data Poses Many Challenges… Validity concerns may arise from: • Failing to capture contextual information in a rapidly changing environment (e.g., updates to the website’s data-generating process, such as changes to how and where information is displayed) • Not sufficiently aligning the psychological processes of interest with the frequency of data extraction on review platforms (e.g., the collected information does not capture the time when the behavior occurred) • Overlooking the influence of algorithmic interference on e-commerce websites (e.g., the effect of personalization algorithms on information display) • …and many more. Boegershausen, Datta, Borah, and Stephen (2022)

6. How to Extract Valid Web Data? Boegershausen, Datta, Borah, and Stephen (2022) Validity Technical feasibility Legal and ethical risks 2. Collection Design 3. Data Extraction 1. Source Selection - Jointly consider validity concerns, alongside technical and legal/ethical questions - Selected examples and solutions - Collecting user data from social networks may infringe upon users’ privacy rights  anonymize user IDs - Product review data may be biased by personalization algorithms  check whether own browsing behavior affects information display - Extraction of all of the information from a website may take too long  consider taking a sample

7. Want to get started collecting and using web data? Read the paper, and visit https://web-scraping.org. Boegershausen, Datta, Borah, and Stephen (2022) o Explore a database with 300+ published marketing articles using web data & get inspired! o Discover web datasets & APIs for your research projects. o Find tutorials and example code for collecting web data using web scraping & APIs

Boegershausen et al. (2022).pptx

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Boegershausen et al. (2022).pptx

Similar a Boegershausen et al. (2022).pptx (20)

Más de American Marketing Association | Journals

Más de American Marketing Association | Journals (20)

Último

Último (20)

Boegershausen et al. (2022).pptx