The math-behind-ab-testing

The math behind A/B testing
How to perform a non-‐biased test

A/B testing
Not a replacement for common sense

It only gives you a level of conﬁdence

Helps you achieve only local maxima

AB experiment: Toss a coin
Heads = successful conversion. Tails = no conversion.

Hypothesis: Wearing a Red color t-‐shirt will increase conversion

46 heads out
of 100

54 heads out
of 100

Conversion increased by 17%

Changing your t-‐shirt to red increases conversion

Whats wrong
Conversion is never a single number. Its a range.

Probability

µ

Variance/Noise

Whats wrong
Sample = 100 tosses.

µ
=
0.5

µ
=
0.46

∞
coin
tosses

Whats wrong
Sample = 100 tosses.

µ
=
0.5

µ
==
0.46
µ

0.46

∞
coin
tosses 100
coin
tosses

Sample mean ≠ population mean

The role of chance

Red Blue

Comparison between two noisy samples

Statistical signiﬁcance

Red Blue

Standard Error (SE) = Square root of (p * (1-p) / n)
p = conversion rate, n = sample size

How much deviation from average conversion rate (p)
can be expected if this experiment is repeated multiple
times.

95% confidence:
True conversion rate lies within this range: p ± 2 * SE

95% confidence:
True conversion rate lies within this range: p ± 2 * SE

h3p://visualwebsiteop=mizer.com/ab-‐split-‐signiﬁcance-‐calculator/

Sample size

Sample size

Min. sample size to calculate the statistical signiLicance

Statistical conLidence
Existing conversion rate of website
Difference in conversion rate you want to detect
Number of variations you want to test

h3p://www.testsigniﬁcance.com/

Ideal test
Determine the sample size

Check the results only once you have reached the sample size

Determine the statistical signiLicance

Pick based on long term plan if no clear winner

The math-behind-ab-testing

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (12)

Similar a The math-behind-ab-testing

Similar a The math-behind-ab-testing (20)

Más de Amit Sawhney

Más de Amit Sawhney (7)

The math-behind-ab-testing