SlideShare a Scribd company logo
1 of 22
Download to read offline
Google Analytics sampling
limitations
and how to overcome them
George	
  Papadongonas
Web	
  Analyst,	
  Amazee	
  Metrics
16/7/2013
How Google Analytics stores data
• All unfiltered data of a web property (up to 10 million hits per
month) are stored in the Google Analytics database
• Each standard report has an associated data table in the Google
Analytics database with unsampled data
• Reports for accounts with more than 200,000 visits per day are
processed daily
• Reports for accounts with less than 200,000 visits per day are
processed more often
2
Google Analytics
Report Sampling
• Sampling starts when the requested date range has more than
500,000 visits
• The sample size can be arranged by using the Google Analytics
sampling slider. The default setting is for 250,000 visits, maximum
setting is 500,000
• Visits are counted for the specific date range on a Web Property level,
not on a Profile level
• Standard reports without filters, advanced segments or secondary
dimensions use always unsampled data
• Sampling applies to custom reports
3
Organic Search traffic report
is unsampled
4
By adding a second dimension,
sampling is applied
5
Adjust the sampling size
6
Prefer the
higher precision setting
7
Google Analytics
Report Sampling
8
How sampling is calculated
• Web property has 24,580,303 visits
• Profile has 492,786 visits
• Default sampling is: 250,000x492,786/24,580,303 = 5,012 visits (1%)
• Maximum sampling is: 500,000x492,786/24,580,303 = 10,024 visits (2%)
9
Avoid the
faster processing setting
10
Prefer the
higher precision setting
11
Pageviews or events reports
can be unreliable
12
Google Analytics
Report Sampling
• Visits and Visitors reports are usually reliable, even with a small
sample
• E-commerce transactions, individual pageviews, adwords data,
revenue and goal conversions are less reliable
13
Solutions
14
1. Buy Google Analytics Premium
• “Only”$150.000 / year
• 1 Billion hits processed per month
• Unsampled reports
• Data processing every 4 hours
15
2. Create custom profiles
• Instead of creating reports with specific advanced segments,
create custom profiles using filters
• The default reports of all profiles are always unsampled, even if the
visitors are more than 500,000
16
3. Enable Data Sampling
• Sample your data , by adding a line in the Google Analytics
tracking code
code_gaq.push(['_setSampleRate', '80']); Sets
sampling rate at 80%
• Not a perfect solution, as the data are still sampled, but you have
control and can avoid tracking interruption (for more than 10
million hits per month)
17
4. Use smaller date range
• Break you report in smaller data ranges, each one having less than
500.000 visits
• This ensures that the data are unsampled
• Export the reports using the Google Analytics API
• Aggregate the data in Excel and create the master report
18
5. Use analyticscanvas.com
• Analytics Canvas offers query partitioning, using the Google
Analytics API.
• Reports are exported in smaller date ranges, so that they are
unsampled and they are then merged automatically with Analytics
Canvas.
19
6. Download Google Analytics
data locally
• It is possible to keep a local copy of Google Analytics data
• Add a line in the Google Analytics tracking code
_gaq.push(['_setLocalRemoteServerMode']);
• Add _utm.gif to your web server root
20
6. Download Google Analytics
data locally
86.138.209.96 www.mysite.com - [01/Oct/2007:03:34:02 +0100] "GET /__utm.gif?utmwv=1&utmt=var&utmn=
2108116629 HTTP/1.1" 200 35 "http://www.mysite.com/pageX.htm" "Mozilla/4.0 (compatible; MSIE 6.0;
Windows NT 5.1; SV1; .NET CLR 1.1.4322)" "__utma=1.117971038.1175394730.1175394730.1175394730.1;
__utmb=1; __utmc=1; __utmz=1.1175394730.1.1.utmcid=23|utmgclid=CP-Bssq-oIsCFQMrlAodeUThgA|
utmccn=(not+set)|utmcmd=(not+set)|utmctr=looking+for+site; __utmv=1.Section One"
21
• Data are recorded in the server log files
• Use http://analytics.angelfishstats.com/ to analyze them, as Urchin is
discontiniued
Thanks!
22

More Related Content

What's hot

MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?Michaela Linhart
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
 
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsData Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsChristopher Gutknecht
 
How to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsHow to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsYevheniia Chernenko
 
One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowGiuseppe Gaviani
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisStewart Duncan
 
What's new in Google Analytics 4
What's new in Google Analytics 4What's new in Google Analytics 4
What's new in Google Analytics 4Stephen Ellis
 
Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Shuki Mann
 
BCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationBCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationPhilippe Taza
 
Beyond the clouds with digital analytics
Beyond the clouds with digital analyticsBeyond the clouds with digital analytics
Beyond the clouds with digital analyticsMark Edmondson
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementationidan_by
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupRich Plakas
 
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use CasesTatvic Analytics
 
Google analytics version 4 in details
Google analytics version 4 in detailsGoogle analytics version 4 in details
Google analytics version 4 in detailsOmkar Nandi
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfigyalisassoon
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowGiuseppe Gaviani
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016yalisassoon
 
Google Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderGoogle Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderRod Jacka
 

What's hot (20)

MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
MeasureCamp #10 - WTF are Related Products in Google Analytics Ecommerce?
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
 
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov ChainsData Driven Attribution in BigQuery with Shapley Values and Markov Chains
Data Driven Attribution in BigQuery with Shapley Values and Markov Chains
 
How to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google AnalyticsHow to Avoid Sampling in Google Analytics
How to Avoid Sampling in Google Analytics
 
One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4One Further - Spektrix and Google Analytics 4
One Further - Spektrix and Google Analytics 4
 
Data driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & Snowplow
 
Simply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution Analysis
 
What's new in Google Analytics 4
What's new in Google Analytics 4What's new in Google Analytics 4
What's new in Google Analytics 4
 
Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017Jonathan Weber - All Things DATA 2017
Jonathan Weber - All Things DATA 2017
 
BCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead GenerationBCCCA Google Analytics for Improved Lead Generation
BCCCA Google Analytics for Improved Lead Generation
 
Beyond the clouds with digital analytics
Beyond the clouds with digital analyticsBeyond the clouds with digital analytics
Beyond the clouds with digital analytics
 
The culture trip snowplow implementation
The culture trip snowplow implementationThe culture trip snowplow implementation
The culture trip snowplow implementation
 
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress MeetupGoogle Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
Google Analytics with an Intro to Google Tag Manager for Austin WordPress Meetup
 
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
[Webinar] Getting Started with BigQuery: Basics, Its Appilcations & Use Cases
 
Google analytics version 4 in details
Google analytics version 4 in detailsGoogle analytics version 4 in details
Google analytics version 4 in details
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfig
 
How Incuda builds user journey models with Snowplow
How Incuda builds user journey models with SnowplowHow Incuda builds user journey models with Snowplow
How Incuda builds user journey models with Snowplow
 
Adobe part 1
Adobe part 1Adobe part 1
Adobe part 1
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
 
Google Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in OrderGoogle Analytics For Retailers - Getting Your House in Order
Google Analytics For Retailers - Getting Your House in Order
 

Similar to Google Analytics sampling limitations and how to overcome them

Usage Trend Reporting
Usage Trend Reporting Usage Trend Reporting
Usage Trend Reporting Gainsight
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017Nate Plaunt
 
10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to ExpertsTatvic Analytics
 
Geek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreGeek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreIDERA Software
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014darafitzgerald
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsMariaDB plc
 
Google Analytics for SEO Beginners
Google Analytics for SEO BeginnersGoogle Analytics for SEO Beginners
Google Analytics for SEO BeginnersAditya Todawal
 
Implementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataImplementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataCountants
 
Google Analytics - Webmaster Tools Pro Setup & Tips
Google Analytics -  Webmaster Tools Pro Setup & TipsGoogle Analytics -  Webmaster Tools Pro Setup & Tips
Google Analytics - Webmaster Tools Pro Setup & TipsRank Fuse Digital Marketing
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisationShwetabh Jaiswal
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Rittman Analytics
 
Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Sarah Blake
 
Comparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmComparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmCountants
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductSergey Smetanin
 
Google_Analytics_PPT.pptx
Google_Analytics_PPT.pptxGoogle_Analytics_PPT.pptx
Google_Analytics_PPT.pptxitsmahek2001
 
Bdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenBdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenChristopher Bergh
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...nazen2
 
GAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierGAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierOnline Dialogue
 

Similar to Google Analytics sampling limitations and how to overcome them (20)

Usage Trend Reporting
Usage Trend Reporting Usage Trend Reporting
Usage Trend Reporting
 
Google Analytics Training - full 2017
Google Analytics Training - full 2017Google Analytics Training - full 2017
Google Analytics Training - full 2017
 
10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts10 Most Underused Features of Google Analytics 360 According to Experts
10 Most Underused Features of Google Analytics 360 According to Experts
 
Geek Sync | Intro to Query Store
Geek Sync | Intro to Query StoreGeek Sync | Intro to Query Store
Geek Sync | Intro to Query Store
 
Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014Universal Analytics Out of Beta - BrightonSEO 2014
Universal Analytics Out of Beta - BrightonSEO 2014
 
Delivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analyticsDelivering fast, powerful and scalable analytics
Delivering fast, powerful and scalable analytics
 
Google Analytics for SEO Beginners
Google Analytics for SEO BeginnersGoogle Analytics for SEO Beginners
Google Analytics for SEO Beginners
 
Implementing google big query automation using google analytics data
Implementing google big query automation using google analytics dataImplementing google big query automation using google analytics data
Implementing google big query automation using google analytics data
 
Google Analytics - Webmaster Tools Pro Setup & Tips
Google Analytics -  Webmaster Tools Pro Setup & TipsGoogle Analytics -  Webmaster Tools Pro Setup & Tips
Google Analytics - Webmaster Tools Pro Setup & Tips
 
Business analytics and data visualisation
Business analytics and data visualisationBusiness analytics and data visualisation
Business analytics and data visualisation
 
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
Budapest Data Forum 2017 - BigQuery, Looker And Big Data Analytics At Petabyt...
 
Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1Tech4Africa Google Workshop 1
Tech4Africa Google Workshop 1
 
Comparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibmComparing google analytics vs adobe analytics vs ibm
Comparing google analytics vs adobe analytics vs ibm
 
Google Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your ProductGoogle Cloud Platform as a Backend Solution for your Product
Google Cloud Platform as a Backend Solution for your Product
 
WGP Universal Analytics Breakfast Seminar
WGP Universal Analytics Breakfast SeminarWGP Universal Analytics Breakfast Seminar
WGP Universal Analytics Breakfast Seminar
 
Google_Analytics_PPT.pptx
Google_Analytics_PPT.pptxGoogle_Analytics_PPT.pptx
Google_Analytics_PPT.pptx
 
Bdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchenBdf16 big-data-warehouse-case-study-data kitchen
Bdf16 big-data-warehouse-case-study-data kitchen
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...web analysis document | web analysis document | web analysis document | web a...
web analysis document | web analysis document | web analysis document | web a...
 
GAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en ReinierGAUC 2020 - presentatie Hans en Reinier
GAUC 2020 - presentatie Hans en Reinier
 

Recently uploaded

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 

Recently uploaded (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 

Google Analytics sampling limitations and how to overcome them

  • 1. Google Analytics sampling limitations and how to overcome them George  Papadongonas Web  Analyst,  Amazee  Metrics 16/7/2013
  • 2. How Google Analytics stores data • All unfiltered data of a web property (up to 10 million hits per month) are stored in the Google Analytics database • Each standard report has an associated data table in the Google Analytics database with unsampled data • Reports for accounts with more than 200,000 visits per day are processed daily • Reports for accounts with less than 200,000 visits per day are processed more often 2
  • 3. Google Analytics Report Sampling • Sampling starts when the requested date range has more than 500,000 visits • The sample size can be arranged by using the Google Analytics sampling slider. The default setting is for 250,000 visits, maximum setting is 500,000 • Visits are counted for the specific date range on a Web Property level, not on a Profile level • Standard reports without filters, advanced segments or secondary dimensions use always unsampled data • Sampling applies to custom reports 3
  • 4. Organic Search traffic report is unsampled 4
  • 5. By adding a second dimension, sampling is applied 5
  • 9. How sampling is calculated • Web property has 24,580,303 visits • Profile has 492,786 visits • Default sampling is: 250,000x492,786/24,580,303 = 5,012 visits (1%) • Maximum sampling is: 500,000x492,786/24,580,303 = 10,024 visits (2%) 9
  • 12. Pageviews or events reports can be unreliable 12
  • 13. Google Analytics Report Sampling • Visits and Visitors reports are usually reliable, even with a small sample • E-commerce transactions, individual pageviews, adwords data, revenue and goal conversions are less reliable 13
  • 15. 1. Buy Google Analytics Premium • “Only”$150.000 / year • 1 Billion hits processed per month • Unsampled reports • Data processing every 4 hours 15
  • 16. 2. Create custom profiles • Instead of creating reports with specific advanced segments, create custom profiles using filters • The default reports of all profiles are always unsampled, even if the visitors are more than 500,000 16
  • 17. 3. Enable Data Sampling • Sample your data , by adding a line in the Google Analytics tracking code code_gaq.push(['_setSampleRate', '80']); Sets sampling rate at 80% • Not a perfect solution, as the data are still sampled, but you have control and can avoid tracking interruption (for more than 10 million hits per month) 17
  • 18. 4. Use smaller date range • Break you report in smaller data ranges, each one having less than 500.000 visits • This ensures that the data are unsampled • Export the reports using the Google Analytics API • Aggregate the data in Excel and create the master report 18
  • 19. 5. Use analyticscanvas.com • Analytics Canvas offers query partitioning, using the Google Analytics API. • Reports are exported in smaller date ranges, so that they are unsampled and they are then merged automatically with Analytics Canvas. 19
  • 20. 6. Download Google Analytics data locally • It is possible to keep a local copy of Google Analytics data • Add a line in the Google Analytics tracking code _gaq.push(['_setLocalRemoteServerMode']); • Add _utm.gif to your web server root 20
  • 21. 6. Download Google Analytics data locally 86.138.209.96 www.mysite.com - [01/Oct/2007:03:34:02 +0100] "GET /__utm.gif?utmwv=1&utmt=var&utmn= 2108116629 HTTP/1.1" 200 35 "http://www.mysite.com/pageX.htm" "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1; .NET CLR 1.1.4322)" "__utma=1.117971038.1175394730.1175394730.1175394730.1; __utmb=1; __utmc=1; __utmz=1.1175394730.1.1.utmcid=23|utmgclid=CP-Bssq-oIsCFQMrlAodeUThgA| utmccn=(not+set)|utmcmd=(not+set)|utmctr=looking+for+site; __utmv=1.Section One" 21 • Data are recorded in the server log files • Use http://analytics.angelfishstats.com/ to analyze them, as Urchin is discontiniued