SlideShare una empresa de Scribd logo
1 de 20
Attribution Modelling 101:
Credit Where Credit is
Due!
Scott Burger
6/25/19
(alumni)
Agenda
• Intro to multi-touch attribution
• End to end modelling approach
• Handling big data in Google Bigquery
• Markov chain-based modelling in R
• Model outputs and insights
About Me
• WWU Physics 2010
• UCL Astrophyics 2012
• Data Scientist at Microsoft, Tableau
• Author of Intro to Machine Learning with R
• Race bikes for LiquidVelo
• Blog at http://svburger.com
• (slides available here)
Motivation for Attribution – the Customer Journey
• Goal: how much of our total budget
per channel?
• Problem: last touch model gives too
much credit to final channel
• Solution: multi-touch attribution and
sharing the credit!
Touch 1 Touch 2 Touch 3 Convert
Channel A Channel A Channel B
Channel A
Channel B
Budget (LT) Budget (MTA)
0%
100%
66%
33%
Modelling Process Overview
• Develop user journeys via
Google Analytics data in
Google BigQuery
• Establish conversion rates
with unique journey paths
• Pass through Markov model
in R
• Push results from R back to
Google Bigquery
• Visualization in Tableau
Markov / other
Statistical models
Dashboarding
Google Bigquery Intro
• Just need a Google
account to get started
• https://console.cloud.
google.com/bigquery
Generating Hit-Level User Data: Table Sharding
Date Sharding
One day: select * from `project.ga_sessions_20190306`
One month: select * from `project.ga_sessions_201903*`
One year: select * from `project.ga_sessions_2019*`
All of it: select * from `project.ga_sessions_*`
Generating Hit-Level User Data: Table Sharding
Querying select days:
select * from `project.ga_sessions_*`
where _TABLE_SUFFIX between ‘20190301’ and ‘20190304’
Future-state bounded:
select * from `project.ga_sessions_*`
where _TABLE_SUFFIX between ‘20190101’ and ‘20191231’
This will query the daily tables when available
Generating Hit-Level User Data: Hit Unnesting
Querying nested table data
when using UNNEST(hits)
use:
hits.type = ‘PAGE’ for page hits
hits.type = ‘EVENT’ for page
interactions
project_name
Touchpoint Journey Path Examples
• String_agg() function in
GBQ to pivot journeys to
unique paths, then group-by
• Focus on 2+ journey
lengths
• If using impression data, be
cautious about it
overwhelming your journeys
• Channel uniqueness per
journey is another
interesting field of
investigation
• Data must be in this shape
(path, conversions, unique
Modelling Process Overview
• Attribution models available
in Google Analytics UI
• Documentation for the more
statistical models is lacking
• Heuristic (simple) models
can be done in SQL on GA
hit data. Complex models in
R.
Modelling in R
• Packages “bigrquery”, “ChannelAttribution”
• Data stored as a table in Bigquery
• Data pulled in to R instance (bigrquery)
• Markov chain calculated (ChannelAttribution)
• Markov_model() in R
• Data pushed back to Google Bigquery as a table
VM Layer
Attribution Output
• R: channel credit based on
various models
• GBQ: applied fractional
credit to hit-level data
• Tableau: live visualizations
of actual attributed credit
Findings and Summary
• Multi-touch Markov model showed good results for 2+ touch journeys
• Credit shifted away from “Direct” channels to more paid channels (paid search, paid social)
• Cautionary tale of impressions: adding more types of impression channels will
bias results significantly in that direction and shift credit
• Big difference between viewable and measurable impression types
• Markov chain order: slight improvements with Markov chain length 2, diminishing
returns after length 3
More to Explore
• Google Analytics UI demo account:
https://analytics.google.com/analytics/web/?utm_source
=demoaccount&utm_medium=demoaccount&utm_cam
paign=demoaccount#/report-
home/a54516992w87479473p92320289
• Google Bigquery:
https://console.cloud.google.com/bigquery
• More on the R package ChannelAttribution:
https://towardsdatascience.com/multi-channel-
attribution-model-with-r-part-1-markov-chains-concept-
fdd964017626
• Slides and more at: https://svburger.com
Appendix
Markov Chains – How do they work?
First Order Markov Chains :
• P_{n,n-1}=P(w_n|w_{n-1})
• The next touchpoint depends on the
previous touchpoint
x1 x2 x3
x1 x2 x3 x4
Second Order Markov Chains :
• P_{n,n-1}=P(w_n|w_{n-1}, w_{n-2})
• The next touchpoint depends on the
previous TWO touchpoints
Markov Chains – Channel Paths
Single Lead touchpoint Examples via Google
Analytics
• In this example, a user was exposed to numerous impressions and then finally
clicked to come to the main site
• Users will typically have a lot of impression touches in their journey

Más contenido relacionado

Similar a Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle/Redmond June 25th 2019

Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...Jin Young Kim
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...nazen2
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentationNeerajNishad4
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLMárton Kodok
 
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Octopus Events
 
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionBig Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionMatt Stubbs
 
Understanding Web Analytics and Google Analytics
Understanding Web Analytics and Google AnalyticsUnderstanding Web Analytics and Google Analytics
Understanding Web Analytics and Google AnalyticsPrathamesh Kulkarni
 
Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...José Pablo Gómez Barrón S.
 
BMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckBMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckSasha Lazarevic
 
Transport Modelling for managers 2014 willumsen
Transport Modelling for managers 2014 willumsenTransport Modelling for managers 2014 willumsen
Transport Modelling for managers 2014 willumsenLuis Willumsen
 
Seminar report on google Advertising
Seminar report on  google AdvertisingSeminar report on  google Advertising
Seminar report on google Advertisingnarayan dudhe
 
Understanding Google Analytics
Understanding Google AnalyticsUnderstanding Google Analytics
Understanding Google AnalyticsShubha Brota Raha
 
Top tips for driving leads and optimising conversions in 2020
Top tips for driving leads and optimising conversions in 2020Top tips for driving leads and optimising conversions in 2020
Top tips for driving leads and optimising conversions in 2020Smart Insights
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014darafitzgerald
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesDevOps.com
 
Mining competitors from large unstructured data
Mining competitors from large unstructured dataMining competitors from large unstructured data
Mining competitors from large unstructured dataTejasSalunkhe11
 

Similar a Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle/Redmond June 25th 2019 (20)

Advanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITIAdvanced Analytics in Banking, CITI
Advanced Analytics in Banking, CITI
 
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
Data Science for Online Services: Problems & Frontiers (Changbal Conference 2...
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...
 
laptop price prediction presentation
laptop price prediction presentationlaptop price prediction presentation
laptop price prediction presentation
 
BigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQLBigQuery ML - Machine learning at scale using SQL
BigQuery ML - Machine learning at scale using SQL
 
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
Andrey Sukhovoy. Marketing Mix Modeling: Define the Optimal Budget Allocation...
 
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing AttributionBig Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
Big Data LDN 2017: Advanced Analytics Applied to Marketing Attribution
 
Understanding Web Analytics and Google Analytics
Understanding Web Analytics and Google AnalyticsUnderstanding Web Analytics and Google Analytics
Understanding Web Analytics and Google Analytics
 
Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...Volunteering assistance to online geocoding services through a distributed kn...
Volunteering assistance to online geocoding services through a distributed kn...
 
BMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist DeckBMDSE v1 - Data Scientist Deck
BMDSE v1 - Data Scientist Deck
 
Cam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-mediaCam specialist-unit-web-analytics-and-social-media
Cam specialist-unit-web-analytics-and-social-media
 
Transport Modelling for managers 2014 willumsen
Transport Modelling for managers 2014 willumsenTransport Modelling for managers 2014 willumsen
Transport Modelling for managers 2014 willumsen
 
Seminar report on google Advertising
Seminar report on  google AdvertisingSeminar report on  google Advertising
Seminar report on google Advertising
 
CitySituationAppSeminar
CitySituationAppSeminarCitySituationAppSeminar
CitySituationAppSeminar
 
Internship Presentation.pdf
Internship Presentation.pdfInternship Presentation.pdf
Internship Presentation.pdf
 
Understanding Google Analytics
Understanding Google AnalyticsUnderstanding Google Analytics
Understanding Google Analytics
 
Top tips for driving leads and optimising conversions in 2020
Top tips for driving leads and optimising conversions in 2020Top tips for driving leads and optimising conversions in 2020
Top tips for driving leads and optimising conversions in 2020
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014
 
Value Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the TrenchesValue Stream Mapping – Stories From the Trenches
Value Stream Mapping – Stories From the Trenches
 
Mining competitors from large unstructured data
Mining competitors from large unstructured dataMining competitors from large unstructured data
Mining competitors from large unstructured data
 

Más de Grid Dynamics

Are you keeping up with your customer
Are you keeping up with your customer Are you keeping up with your customer
Are you keeping up with your customer Grid Dynamics
 
"Implementing data quality automation with open source stack" - Max Martynov,...
"Implementing data quality automation with open source stack" - Max Martynov,..."Implementing data quality automation with open source stack" - Max Martynov,...
"Implementing data quality automation with open source stack" - Max Martynov,...Grid Dynamics
 
"How to build cool & useful voice commerce applications (such as devices like...
"How to build cool & useful voice commerce applications (such as devices like..."How to build cool & useful voice commerce applications (such as devices like...
"How to build cool & useful voice commerce applications (such as devices like...Grid Dynamics
 
"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.DGrid Dynamics
 
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...Grid Dynamics
 
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...Grid Dynamics
 
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Grid Dynamics
 
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...Grid Dynamics
 
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul..."Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...Grid Dynamics
 
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019Grid Dynamics
 
Dynamic Talks: "Implementing data quality automation with open source stack" ...
Dynamic Talks: "Implementing data quality automation with open source stack" ...Dynamic Talks: "Implementing data quality automation with open source stack" ...
Dynamic Talks: "Implementing data quality automation with open source stack" ...Grid Dynamics
 
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav..."Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...Grid Dynamics
 
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...Grid Dynamics
 
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...Grid Dynamics
 
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud..."ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...Grid Dynamics
 
Realtime Contextual Product Recommendations…that scale and generate revenue -...
Realtime Contextual Product Recommendations…that scale and generate revenue -...Realtime Contextual Product Recommendations…that scale and generate revenue -...
Realtime Contextual Product Recommendations…that scale and generate revenue -...Grid Dynamics
 
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...Grid Dynamics
 
Best practices for enterprise-grade microservices implementations with Google...
Best practices for enterprise-grade microservices implementations with Google...Best practices for enterprise-grade microservices implementations with Google...
Best practices for enterprise-grade microservices implementations with Google...Grid Dynamics
 
Building an algorithmic price management system using ML: Dynamic talks Seatt...
Building an algorithmic price management system using ML: Dynamic talks Seatt...Building an algorithmic price management system using ML: Dynamic talks Seatt...
Building an algorithmic price management system using ML: Dynamic talks Seatt...Grid Dynamics
 
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2 Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2 Grid Dynamics
 

Más de Grid Dynamics (20)

Are you keeping up with your customer
Are you keeping up with your customer Are you keeping up with your customer
Are you keeping up with your customer
 
"Implementing data quality automation with open source stack" - Max Martynov,...
"Implementing data quality automation with open source stack" - Max Martynov,..."Implementing data quality automation with open source stack" - Max Martynov,...
"Implementing data quality automation with open source stack" - Max Martynov,...
 
"How to build cool & useful voice commerce applications (such as devices like...
"How to build cool & useful voice commerce applications (such as devices like..."How to build cool & useful voice commerce applications (such as devices like...
"How to build cool & useful voice commerce applications (such as devices like...
 
"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D"Challenges for AI in Healthcare" - Peter Graven Ph.D
"Challenges for AI in Healthcare" - Peter Graven Ph.D
 
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
Dynamic Talks: "Applications of Big Data, Machine Learning and Artificial Int...
 
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
Dynamic Talks: "Digital Transformation in Banking & Financial Services… a per...
 
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
Dynamic Talks: "Data Strategy as a Conduit for Data Maturity and Monetization...
 
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
Dynamics Talks: "Writing Spark Pipelines with Less Boilerplate Code" - Egor P...
 
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul..."Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
"Trends in Building Advanced Analytics Platform for Large Enterprises" - Atul...
 
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
The New Era of Public Safety Records Management: Dynamic talks Chicago 9/24/2019
 
Dynamic Talks: "Implementing data quality automation with open source stack" ...
Dynamic Talks: "Implementing data quality automation with open source stack" ...Dynamic Talks: "Implementing data quality automation with open source stack" ...
Dynamic Talks: "Implementing data quality automation with open source stack" ...
 
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav..."Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
"Implementing AI for New Business Models and Efficiencies" - Parag Shrivastav...
 
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
Reducing No-shows and Late Cancelations in Healthcare Enterprise" - Shervin M...
 
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
Customer intelligence: a Machine Learning Approach: Dynamic talks Atlanta 8/2...
 
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud..."ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
"ML Services - How do you begin and when do you start scaling?" - Madhura Dud...
 
Realtime Contextual Product Recommendations…that scale and generate revenue -...
Realtime Contextual Product Recommendations…that scale and generate revenue -...Realtime Contextual Product Recommendations…that scale and generate revenue -...
Realtime Contextual Product Recommendations…that scale and generate revenue -...
 
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
Decision Automation in Marketing Systems using Reinforcement Learning: Dynami...
 
Best practices for enterprise-grade microservices implementations with Google...
Best practices for enterprise-grade microservices implementations with Google...Best practices for enterprise-grade microservices implementations with Google...
Best practices for enterprise-grade microservices implementations with Google...
 
Building an algorithmic price management system using ML: Dynamic talks Seatt...
Building an algorithmic price management system using ML: Dynamic talks Seatt...Building an algorithmic price management system using ML: Dynamic talks Seatt...
Building an algorithmic price management system using ML: Dynamic talks Seatt...
 
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2 Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2
Customer intelligence: a machine learning approach- Dynamic talks Dallas Q2
 

Último

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Attribution Modelling 101: Credit Where Credit is Due!: Dynamic talks Seattle/Redmond June 25th 2019

  • 1. Attribution Modelling 101: Credit Where Credit is Due! Scott Burger 6/25/19 (alumni)
  • 2. Agenda • Intro to multi-touch attribution • End to end modelling approach • Handling big data in Google Bigquery • Markov chain-based modelling in R • Model outputs and insights
  • 3. About Me • WWU Physics 2010 • UCL Astrophyics 2012 • Data Scientist at Microsoft, Tableau • Author of Intro to Machine Learning with R • Race bikes for LiquidVelo • Blog at http://svburger.com • (slides available here)
  • 4. Motivation for Attribution – the Customer Journey • Goal: how much of our total budget per channel? • Problem: last touch model gives too much credit to final channel • Solution: multi-touch attribution and sharing the credit! Touch 1 Touch 2 Touch 3 Convert Channel A Channel A Channel B Channel A Channel B Budget (LT) Budget (MTA) 0% 100% 66% 33%
  • 5. Modelling Process Overview • Develop user journeys via Google Analytics data in Google BigQuery • Establish conversion rates with unique journey paths • Pass through Markov model in R • Push results from R back to Google Bigquery • Visualization in Tableau Markov / other Statistical models Dashboarding
  • 6. Google Bigquery Intro • Just need a Google account to get started • https://console.cloud. google.com/bigquery
  • 7. Generating Hit-Level User Data: Table Sharding Date Sharding One day: select * from `project.ga_sessions_20190306` One month: select * from `project.ga_sessions_201903*` One year: select * from `project.ga_sessions_2019*` All of it: select * from `project.ga_sessions_*`
  • 8. Generating Hit-Level User Data: Table Sharding Querying select days: select * from `project.ga_sessions_*` where _TABLE_SUFFIX between ‘20190301’ and ‘20190304’ Future-state bounded: select * from `project.ga_sessions_*` where _TABLE_SUFFIX between ‘20190101’ and ‘20191231’ This will query the daily tables when available
  • 9. Generating Hit-Level User Data: Hit Unnesting Querying nested table data when using UNNEST(hits) use: hits.type = ‘PAGE’ for page hits hits.type = ‘EVENT’ for page interactions project_name
  • 10. Touchpoint Journey Path Examples • String_agg() function in GBQ to pivot journeys to unique paths, then group-by • Focus on 2+ journey lengths • If using impression data, be cautious about it overwhelming your journeys • Channel uniqueness per journey is another interesting field of investigation • Data must be in this shape (path, conversions, unique
  • 11. Modelling Process Overview • Attribution models available in Google Analytics UI • Documentation for the more statistical models is lacking • Heuristic (simple) models can be done in SQL on GA hit data. Complex models in R.
  • 12.
  • 13. Modelling in R • Packages “bigrquery”, “ChannelAttribution” • Data stored as a table in Bigquery • Data pulled in to R instance (bigrquery) • Markov chain calculated (ChannelAttribution) • Markov_model() in R • Data pushed back to Google Bigquery as a table VM Layer
  • 14. Attribution Output • R: channel credit based on various models • GBQ: applied fractional credit to hit-level data • Tableau: live visualizations of actual attributed credit
  • 15. Findings and Summary • Multi-touch Markov model showed good results for 2+ touch journeys • Credit shifted away from “Direct” channels to more paid channels (paid search, paid social) • Cautionary tale of impressions: adding more types of impression channels will bias results significantly in that direction and shift credit • Big difference between viewable and measurable impression types • Markov chain order: slight improvements with Markov chain length 2, diminishing returns after length 3
  • 16. More to Explore • Google Analytics UI demo account: https://analytics.google.com/analytics/web/?utm_source =demoaccount&utm_medium=demoaccount&utm_cam paign=demoaccount#/report- home/a54516992w87479473p92320289 • Google Bigquery: https://console.cloud.google.com/bigquery • More on the R package ChannelAttribution: https://towardsdatascience.com/multi-channel- attribution-model-with-r-part-1-markov-chains-concept- fdd964017626 • Slides and more at: https://svburger.com
  • 18. Markov Chains – How do they work? First Order Markov Chains : • P_{n,n-1}=P(w_n|w_{n-1}) • The next touchpoint depends on the previous touchpoint x1 x2 x3 x1 x2 x3 x4 Second Order Markov Chains : • P_{n,n-1}=P(w_n|w_{n-1}, w_{n-2}) • The next touchpoint depends on the previous TWO touchpoints
  • 19. Markov Chains – Channel Paths
  • 20. Single Lead touchpoint Examples via Google Analytics • In this example, a user was exposed to numerous impressions and then finally clicked to come to the main site • Users will typically have a lot of impression touches in their journey

Notas del editor

  1. Title Slide
  2. https://support.google.com/analytics/answer/6367342?hl=en