SlideShare una empresa de Scribd logo
1 de 57
Descargar para leer sin conexión
1 Het begint met een idee
Statistical tests and effect size
Ivano Malavolta
Vrije Universiteit Amsterdam
2 Ivano Malavolta / S2 group / Statistical tests and effect size
Roadmap
● Warm up
● Check for normality
● Main statistical tests
● p-value corrections
● Effect size
Vrije Universiteit Amsterdam
3 Ivano Malavolta / S2 group / Statistical tests and effect size
Context and assumptions
● We focus on quantitative variables only
○ nominal
○ ordinal
○ interval
○ ratio
● Factors are nominal or ordinal
○ dependent variables can be any
● Our statistical tests detect differences between the means
of the dependent variable
● Factor alternatives are fixed a priori
Vrije Universiteit Amsterdam
4 Ivano Malavolta / S2 group / Statistical tests and effect size
Tasks for data analysis
1. Descriptive statistics
○ for understanding the “shape” of collected data
2. Select statistical test
○ according to collected metrics and data distribution
3. Hypothesis testing
○ for providing evidence about your findings
i. statistical significance
4. Effect size calculation
○ for understanding if your (statistically significant) results are actually
relevant in practice
5. Power analysis
○ for knowing if your results are emerging only by luck [1]
Vrije Universiteit Amsterdam
5 Ivano Malavolta / S2 group / Statistical tests and effect size
What is a statistical test?
● Calculation of a sample statistic assuming that the null
hypothesis is true
● The calculated value of the statistic has a certain probability
given that the null hypothesis is true (p-value)
Vrije Universiteit Amsterdam
6 Ivano Malavolta / S2 group / Statistical tests and effect size
First choice: parametric VS non-parametric tests
● Parametric tests assume a specific distribution of the data
○ typically, normal distribution
○ more powerful
→ smaller sample sizes are needed
● Non-parametric tests do not make any assumption about
data distribution
○ more general
○ less powerful
→ larger samples are needed
Vrije Universiteit Amsterdam
7 Ivano Malavolta / S2 group / Statistical tests and effect size
How to choose?
● Applicability
○ type of variable
■ nominal, ordinal scale → non-parametric
○ distribution of the values
■ normality
○ other assumptions
● Power
If you can apply it, always prefer a parametric test
In your report explain in details why you choose a specific test
Vrije Universiteit Amsterdam
8 Ivano Malavolta / S2 group / Statistical tests and effect size
How to choose?
Non-parametric methods Parametric methods
Model
restrictions
(eg, normality)
satisfied?
Nominal or
ordinal scale?
YES
NO
YES
NO
Vrije Universiteit Amsterdam
9
Check for normality
Ivano Malavolta / S2 group / Statistical tests and effect size
Vrije Universiteit Amsterdam
10 Ivano Malavolta / S2 group / Statistical tests and effect size
Graphical check (Q-Q plot)
Vrije Universiteit Amsterdam
11 Ivano Malavolta / S2 group / Statistical tests and effect size
Normality tests
● Normality tests
○ H0
: sample is drawn from a normal distribution
● Shapiro-Wilk test (AKA Shapiro-Wilk’s W)
● If p-value <α for a given sample, we can conclude data is
NOT normally distributed
Vrije Universiteit Amsterdam
12 Ivano Malavolta / S2 group / Statistical tests and effect size
Shapiro-Wilk test
Vrije Universiteit Amsterdam
13 Ivano Malavolta / S2 group / Statistical tests and effect size
Shapiro-Wilk test
Vrije Universiteit Amsterdam
14
Shapiro-Wilk test
● Warning: Shapiro-Wilk is not robust for small samples!
○ Additional verification (e.g. via Q-Q plot) is always needed
Vrije Universiteit Amsterdam
15
Main statistical tests
Ivano Malavolta / S2 group / Statistical tests and effect size
Vrije Universiteit Amsterdam
16 Ivano Malavolta / S2 group / Statistical tests and effect size
Statistical tests VS experiment design
Vrije Universiteit Amsterdam
17 Ivano Malavolta / S2 group / Statistical tests and effect size
One factor - 2 treatments - random design
Vrije Universiteit Amsterdam
18 Ivano Malavolta / S2 group / Statistical tests and effect size
t-Test
Goal: compare independent samples
○ Values of the dependent variable obtained with different treatments
○ For each treatment you are measuring different subjects
Parametric
Hypotheses:
● Two-tailed
○ H0
: μ2
= μ1
Ha
: μ2
≠ μ1
● One-tailed (alternative: greater)
○ H0
: μ2
= μ1
Ha
: μ2
> μ1
● One-tailed (alternative: less)
○ H0
: μ2
= μ1
Ha
: μ2
< μ1
● More powerful
● Cannot say anything in the
opposite direction
Vrije Universiteit Amsterdam
19 Ivano Malavolta / S2 group / Statistical tests and effect size
t-Test in R
Vrije Universiteit Amsterdam
20 Ivano Malavolta / S2 group / Statistical tests and effect size
t-Test: example
Vrije Universiteit Amsterdam
21 Ivano Malavolta / S2 group / Statistical tests and effect size
t-Test: example 2
Vrije Universiteit Amsterdam
22 Ivano Malavolta / S2 group / Statistical tests and effect size
Mann-Whitney test
Goal: compare independent samples
● It can be used instead of the t-test when data is not normal
Non-parametric
Hypotheses:
● Two-tailed
○ H0
: μ2
= μ1
Ha
: μ2
≠ μ1
● One-tailed (alternative: greater)
○ H0
: μ2
= μ1
Ha
: μ2
> μ1
● One-tailed (alternative: less)
○ H0
: μ2
= μ1
Ha
: μ2
< μ1
● Same hypotheses as the t-test
Vrije Universiteit Amsterdam
23
Mann-Whitney test in R
Vrije Universiteit Amsterdam
24 Ivano Malavolta / S2 group / Statistical tests and effect size
Mann-Whitney test: example
Vrije Universiteit Amsterdam
25 Ivano Malavolta / S2 group / Statistical tests and effect size
One factor - 2 treatments - paired design
Vrije Universiteit Amsterdam
26 Ivano Malavolta / S2 group / Statistical tests and effect size
Paired t-Test
Goal: compare independent samples from repeated measures
○ Each subject receives different treatments
○ We focus on the differences exhibited by subjects with different treatments
○ Samples must be equals in size
Parametric
Hypotheses:
● Two-tailed
○ H0
: μd
= 0 Ha
: μd
≠ 0
● One-tailed (alternative: greater)
○ H0
: μd
= 0 Ha
: μd
> 0
● One-tailed (alternative: less)
○ H0
: μd
= 0 Ha
: μd
< 0
Vrije Universiteit Amsterdam
27 Ivano Malavolta / S2 group / Statistical tests and effect size
Paired t-Test: example
Vrije Universiteit Amsterdam
28 Ivano Malavolta / S2 group / Statistical tests and effect size
Wilcoxon signed-rank test
Goal: compare independent samples from repeated measures
● It can be used instead of the paired t-test in case of not normal data
Non-parametric
● Same hypotheses as
the paired t-test
Hypotheses:
● Two-tailed
○ H0
: μd
= 0 Ha
: μd
≠ 0
● One-tailed (alternative: greater)
○ H0
: μd
= 0 Ha
: μd
> 0
● One-tailed (alternative: less)
○ H0
: μd
= 0 Ha
: μd
< 0
Vrije Universiteit Amsterdam
29 Ivano Malavolta / S2 group / Statistical tests and effect size
Wilcoxon signed-rank test: example
Vrije Universiteit Amsterdam
30 Ivano Malavolta / S2 group / Statistical tests and effect size
>=1 factors - >2 treatments
Vrije Universiteit Amsterdam
31 Ivano Malavolta / S2 group / Statistical tests and effect size
ANOVA (ANalysis Of VAriance)
Goal: understand how much of the total variance is due to differences
within factors, and how much is due to differences across factors
○ Many types of ANOVA tests
○ Works for many experiment designs
Parametric
Hypotheses:
H0
: μ1
= μ2
= μ3
Ha
: μ1
≠ μ2
V μ1
≠ μ3
V μ2
≠ μ3
Vrije Universiteit Amsterdam
32
F-statistic
F = Variation among sample means / variation within the samples
when H0
→ F follows a known F-distribution
● the mean of the F-distribution tends to be 1
https://goo.gl/jHWo8A
Vrije Universiteit Amsterdam
33
Significance
F tends to be larger if H0
is false
→ the more F deviates from 1, the
stronger the evidence for unequal
population variances
● Methods to determine significance level:
○ textbook: compare F against a table of critical values
(according to DF and α). If F > Fcritical
, reject H0
○ computer-based: compute the p-value of finding F greater
than the observed value. If p < α, reject H0
Ivano Malavolta / S2 group / Statistical tests and effect size
Vrije Universiteit Amsterdam
34
Types of ANOVA
Ivano Malavolta / S2 group / Statistical tests and effect size
● One-way ANOVA
○ one factor, >2 treatments
○ if 2 treatments: equivalent to t-test (almost never used)
Vrije Universiteit Amsterdam
35
Types of ANOVA
Ivano Malavolta / S2 group / Statistical tests and effect size
● Factorial ANOVA
○ 2 (two-way) or more factors
○ any number of treatments
○ also computes interactions
Vrije Universiteit Amsterdam
36
How to know which treatments really differ?
Tukey’s test
Vrije Universiteit Amsterdam
37
Types of ANOVA
● Repeated Measures ANOVA (rANOVA)
○ computes changes of a treatment over time
○ used when applying each treatment to the same subjects
(within-subject comparison)
Vrije Universiteit Amsterdam
38
Types of ANOVA
● Multivariate ANOVA (mANOVA)
○ more than one outcome (dependent variable)
○ simultaneous analysis
Individual variables Combined
Vrije Universiteit Amsterdam
39
ANOVA assumptions
Ivano Malavolta / S2 group / Statistical tests and effect size
● The dependent variable should be continuous
● Samples must be independent
● Normal distribution of the dependent variable between
the groups (approximately)
● Errors (aka residuals) should be normally distributed
○ qqPlot(residuals(myData.aov))
● Homoscedasticity
○ variance between groups should be the same
■ bartlett.test(x ~ y, data=myData)
■ leveneTest(x ~ y, data=myData)
Assumptions violated → non-parametric alternative
Vrije Universiteit Amsterdam
40
ANOVA: non-parametric alternative
Ivano Malavolta / S2 group / Statistical tests and effect size
● Kruskal-Wallis: one-way non-parametric ANOVA
○ one factor, multiple treatments
○ no estimate of the treatment effect (due to ranking)
Non-parametric
Vrije Universiteit Amsterdam
41 Ivano Malavolta / S2 group / Statistical tests and effect size
Main statistical tests
You are measuring different subjects
You are measuring the same subject
against different treatments
Use this in case the values
of your dep. var are not
normally distributed
Vrije Universiteit Amsterdam
42
p-value corrections
Ivano Malavolta / S2 group / Statistical tests and effect size
Vrije Universiteit Amsterdam
43
Example
Ivano Malavolta / S2 group / Statistical tests and effect size
Dependent variable = energy consumption of the app
Independent variables =
● A: Image encoding algorithm: {png, jpeg}
● B: Mobile device type: {high-end, low-end}
● C: Network conditions: {wifi, 3G}
You perform 3 tests:
● t.test(A, B)
● t.test(A, C)
● t.test(B, C)
P(at least one significant result) = 1 − P(no significant results)
= 1 − (1 − 0.05)3
≈ 0.15
→ 15% chance of seeing relevant results, when there may be none
Vrije Universiteit Amsterdam
44
The problem
Ivano Malavolta / S2 group / Statistical tests and effect size
Multiple tests → higher probability of getting (statistically
significant ) results by chance
→ you have to adjust your α (it was 0.05)
Two main correction techniques:
● Bonferroni
● Holm
Vrije Universiteit Amsterdam
45
Bonferroni correction
Supposing we are doing N tests,
we can reject H0
if the p-values of those tests are below α/N
We can reject the H0
if all tests provide a p-value < 0.05/3=0.016
→ 0.016 is our new significance threshold!
Vrije Universiteit Amsterdam
Less stringent that Bonferroni’s
Procedure:
● rank your p-values from the smallest to the largest
● multiply the first by N, the second by N-1, etc.
● a p-value is significant if after multiplied is <0.05
P-values of the tests: {0.01, 0.02, 0.03}
Bonferroni:
● 0.01 * 3 = 0.03
● 0.02 * 3 = 0.06
● 0.03 * 3 = 0.09
46
Holm’s correction
Ivano Malavolta / S2 group / Statistical tests and effect size
Holm:
● 0.01 * 3 = 0.03
● 0.02 * 2 = 0.04
● 0.03 * 1 = 0.03
Vrije Universiteit Amsterdam
47
Effect size
Ivano Malavolta / S2 group / Statistical tests and effect size
Vrije Universiteit Amsterdam
48
Recall
Vrije Universiteit Amsterdam
49
Effect size measures
Ivano Malavolta / S2 group / Statistical tests and effect size
● Cohen’s d
○ parametric statistics
● Cliff’s delta
○ non-parametric statistics
Vrije Universiteit Amsterdam
50
Cohen’s d
Ivano Malavolta / S2 group / Statistical tests and effect size
The magnitude of a main factor treatment effect on the dependent
variable
Where:
● x1
, x2
= the means of the two groups
● s = standard deviation
○ Pooled standard deviation for independent samples
Values:
0 = full overlap
1 = 1-sigma distance
between the means
…
3 = 3-sigma distance →
~no overlap
Vrije Universiteit Amsterdam
51
Cohen’s d in R
Independent samples Dependent samples
Vrije Universiteit Amsterdam
52
Cliff’s delta
Ivano Malavolta / S2 group / Statistical tests and effect size
Represents the degree of overlap between the two distributions of
scores
Where:
● xi
… xj
= the specific values of each sample
● m, n = the cardinalities of the two groups
Values:
0 = full overlap
+1 = all the values of one
group > all the values of
the other one
...
-1 = the inverse
Vrije Universiteit Amsterdam
53
Intuition
Vrije Universiteit Amsterdam
54
Cliff’s delta in R
Vrije Universiteit Amsterdam
55
What this lecture means to you?
Vrije Universiteit Amsterdam
56 Ivano Malavolta / S2 group / Empirical software engineering
Readings
[1] Dybå, Tore, Vigdis By Kampenes, and Dag IK Sjøberg. "A systematic review of statistical
power in software engineering experiments." Information and Software Technology 48.8
(2006): 745-755.
Chapter 10 Part 3 Chapter 6
Vrije Universiteit Amsterdam
57 Ivano Malavolta / S2 group / Empirical software engineering
Some contents of lecture extracted from:
● Giuseppe Procaccianti’s lectures at VU
Acknowledgements

Más contenido relacionado

La actualidad más candente

[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect sizeIvano Malavolta
 
[02-A] The experimental process
[02-A] The experimental process[02-A] The experimental process
[02-A] The experimental processIvano Malavolta
 
The Green Lab - [03 A] Experiment planning
The Green Lab - [03 A] Experiment planningThe Green Lab - [03 A] Experiment planning
The Green Lab - [03 A] Experiment planningIvano Malavolta
 
The Green Lab - [09 B] Experiment validity
The Green Lab - [09  B] Experiment validityThe Green Lab - [09  B] Experiment validity
The Green Lab - [09 B] Experiment validityIvano Malavolta
 
The Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationThe Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationGiuseppe Procaccianti
 
[03-A] Experiment planning
[03-A] Experiment planning[03-A] Experiment planning
[03-A] Experiment planningIvano Malavolta
 
[02-B] Experiment scoping
[02-B] Experiment scoping[02-B] Experiment scoping
[02-B] Experiment scopingIvano Malavolta
 
The Green Lab - [02 A] The experimental process
The Green Lab - [02 A] The experimental processThe Green Lab - [02 A] The experimental process
The Green Lab - [02 A] The experimental processIvano Malavolta
 
[13 - A] Experiment validity
[13 - A] Experiment validity[13 - A] Experiment validity
[13 - A] Experiment validityIvano Malavolta
 
[03-B] Measurement theory basics
[03-B] Measurement theory basics[03-B] Measurement theory basics
[03-B] Measurement theory basicsIvano Malavolta
 
The Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingThe Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingIvano Malavolta
 
[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testingIvano Malavolta
 
The Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RThe Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RGiuseppe Procaccianti
 
[01-B] Empirical software engineering
[01-B] Empirical software engineering[01-B] Empirical software engineering
[01-B] Empirical software engineeringIvano Malavolta
 
[07-A] Descriptive Statistics and data exploration
[07-A] Descriptive Statistics and data exploration[07-A] Descriptive Statistics and data exploration
[07-A] Descriptive Statistics and data explorationIvano Malavolta
 
Analysis results-of-multiple-choice-tests
Analysis results-of-multiple-choice-testsAnalysis results-of-multiple-choice-tests
Analysis results-of-multiple-choice-testsyousra_zarli
 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataData Driven Innovation
 
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...Matthias Braunhofer
 

La actualidad más candente (20)

[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size
 
[02-A] The experimental process
[02-A] The experimental process[02-A] The experimental process
[02-A] The experimental process
 
The Green Lab - [03 A] Experiment planning
The Green Lab - [03 A] Experiment planningThe Green Lab - [03 A] Experiment planning
The Green Lab - [03 A] Experiment planning
 
The Green Lab - [09 B] Experiment validity
The Green Lab - [09  B] Experiment validityThe Green Lab - [09  B] Experiment validity
The Green Lab - [09 B] Experiment validity
 
The Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationThe Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data Visualization
 
[03-A] Experiment planning
[03-A] Experiment planning[03-A] Experiment planning
[03-A] Experiment planning
 
[02-B] Experiment scoping
[02-B] Experiment scoping[02-B] Experiment scoping
[02-B] Experiment scoping
 
The Green Lab - [02 A] The experimental process
The Green Lab - [02 A] The experimental processThe Green Lab - [02 A] The experimental process
The Green Lab - [02 A] The experimental process
 
[13 - A] Experiment validity
[13 - A] Experiment validity[13 - A] Experiment validity
[13 - A] Experiment validity
 
[03-B] Measurement theory basics
[03-B] Measurement theory basics[03-B] Measurement theory basics
[03-B] Measurement theory basics
 
The Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingThe Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scoping
 
[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing
 
The Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RThe Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in R
 
[01-B] Empirical software engineering
[01-B] Empirical software engineering[01-B] Empirical software engineering
[01-B] Empirical software engineering
 
[07-A] Descriptive Statistics and data exploration
[07-A] Descriptive Statistics and data exploration[07-A] Descriptive Statistics and data exploration
[07-A] Descriptive Statistics and data exploration
 
Analysis results-of-multiple-choice-tests
Analysis results-of-multiple-choice-testsAnalysis results-of-multiple-choice-tests
Analysis results-of-multiple-choice-tests
 
Predire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big DataPredire il futuro con Machine Learning & Big Data
Predire il futuro con Machine Learning & Big Data
 
Resume_xuezhi
Resume_xuezhiResume_xuezhi
Resume_xuezhi
 
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...
Parsimonious and Adaptive Contextual Information Acquisition in Recommender S...
 
Uses of accelerometer sensor and its application in m-Learning environments: ...
Uses of accelerometer sensor and its application in m-Learning environments: ...Uses of accelerometer sensor and its application in m-Learning environments: ...
Uses of accelerometer sensor and its application in m-Learning environments: ...
 

Destacado

The Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingThe Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingGiuseppe Procaccianti
 
The Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesThe Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesIvano Malavolta
 
The Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupThe Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupIvano Malavolta
 
The Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsThe Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsGiuseppe Procaccianti
 
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Ivano Malavolta
 
The Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsThe Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsIvano Malavolta
 
The Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationThe Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationGiuseppe Procaccianti
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...Lora Aroyo
 
WebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingWebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingLora Aroyo
 
PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors Victor de Boer
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
Talk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsTalk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsAstrid van Aggelen
 
Dive exploring history presentation
Dive exploring history presentationDive exploring history presentation
Dive exploring history presentationVictor de Boer
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator DemosLora Aroyo
 
[2016/2017] Modern development paradigms
[2016/2017] Modern development paradigms [2016/2017] Modern development paradigms
[2016/2017] Modern development paradigms Ivano Malavolta
 
Future TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialFuture TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialLora Aroyo
 

Destacado (16)

The Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingThe Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis Testing
 
The Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesThe Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challenges
 
The Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupThe Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setup
 
The Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsThe Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and tools
 
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
 
The Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsThe Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web apps
 
The Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationThe Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentation
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
WebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingWebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in Crowdsourcing
 
PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
Talk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsTalk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament Proceedings
 
Dive exploring history presentation
Dive exploring history presentationDive exploring history presentation
Dive exploring history presentation
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator Demos
 
[2016/2017] Modern development paradigms
[2016/2017] Modern development paradigms [2016/2017] Modern development paradigms
[2016/2017] Modern development paradigms
 
Future TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialFuture TV is Now: Personalized & Social
Future TV is Now: Personalized & Social
 

Similar a The Green Lab - [09 A] Statistical tests and effect size

Statistical analysis.pptx
Statistical analysis.pptxStatistical analysis.pptx
Statistical analysis.pptxChinna Chadayan
 
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...The Stockker
 
Introduction to Analysis of Variance
Introduction to Analysis of VarianceIntroduction to Analysis of Variance
Introduction to Analysis of Variancejasondroesch
 
Day 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdfDay 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdfElih Sutisna Yanto
 
non parametric statistics
non parametric statisticsnon parametric statistics
non parametric statisticsAnchal Garg
 
ANOVA_PDF.pdf biostatistics course material
ANOVA_PDF.pdf biostatistics course materialANOVA_PDF.pdf biostatistics course material
ANOVA_PDF.pdf biostatistics course materialAmanuelIbrahim
 
MPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxMPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxrodrickrajamanickam
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research dataAtula Ahuja
 
Testing of hypothesis.pptx
Testing of hypothesis.pptxTesting of hypothesis.pptx
Testing of hypothesis.pptxSyedaKumail
 
Marketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptxMarketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptxxababid981
 
Non-parametric Statistical tests for Hypotheses testing
Non-parametric Statistical tests for Hypotheses testingNon-parametric Statistical tests for Hypotheses testing
Non-parametric Statistical tests for Hypotheses testingSundar B N
 
Mann Whitney U Test
Mann Whitney U TestMann Whitney U Test
Mann Whitney U TestJohn Barlow
 
Repeated-Measures and Two-Factor Analysis of Variance
Repeated-Measures and Two-Factor Analysis of VarianceRepeated-Measures and Two-Factor Analysis of Variance
Repeated-Measures and Two-Factor Analysis of Variancejasondroesch
 
Tutorial mean difference
Tutorial mean differenceTutorial mean difference
Tutorial mean differencenumberanalytics
 
3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdfMitikuTeka1
 

Similar a The Green Lab - [09 A] Statistical tests and effect size (20)

Statistical analysis.pptx
Statistical analysis.pptxStatistical analysis.pptx
Statistical analysis.pptx
 
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...
Research methodology - Estimation Theory & Hypothesis Testing, Techniques of ...
 
Analysis of Variance
Analysis of VarianceAnalysis of Variance
Analysis of Variance
 
biostat__final_ppt_unit_3.pptx
biostat__final_ppt_unit_3.pptxbiostat__final_ppt_unit_3.pptx
biostat__final_ppt_unit_3.pptx
 
Introduction to Analysis of Variance
Introduction to Analysis of VarianceIntroduction to Analysis of Variance
Introduction to Analysis of Variance
 
Non parametric-tests
Non parametric-testsNon parametric-tests
Non parametric-tests
 
Day 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdfDay 12 t test for dependent samples and single samples pdf
Day 12 t test for dependent samples and single samples pdf
 
non parametric statistics
non parametric statisticsnon parametric statistics
non parametric statistics
 
ANOVA_PDF.pdf biostatistics course material
ANOVA_PDF.pdf biostatistics course materialANOVA_PDF.pdf biostatistics course material
ANOVA_PDF.pdf biostatistics course material
 
MPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptxMPhil clinical psy Non-parametric statistics.pptx
MPhil clinical psy Non-parametric statistics.pptx
 
classmar16.ppt
classmar16.pptclassmar16.ppt
classmar16.ppt
 
classmar16.ppt
classmar16.pptclassmar16.ppt
classmar16.ppt
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research data
 
Testing of hypothesis.pptx
Testing of hypothesis.pptxTesting of hypothesis.pptx
Testing of hypothesis.pptx
 
Marketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptxMarketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptx
 
Non-parametric Statistical tests for Hypotheses testing
Non-parametric Statistical tests for Hypotheses testingNon-parametric Statistical tests for Hypotheses testing
Non-parametric Statistical tests for Hypotheses testing
 
Mann Whitney U Test
Mann Whitney U TestMann Whitney U Test
Mann Whitney U Test
 
Repeated-Measures and Two-Factor Analysis of Variance
Repeated-Measures and Two-Factor Analysis of VarianceRepeated-Measures and Two-Factor Analysis of Variance
Repeated-Measures and Two-Factor Analysis of Variance
 
Tutorial mean difference
Tutorial mean differenceTutorial mean difference
Tutorial mean difference
 
3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf
 

Más de Ivano Malavolta

Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Ivano Malavolta
 
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)Ivano Malavolta
 
Software sustainability and Green IT
Software sustainability and Green ITSoftware sustainability and Green IT
Software sustainability and Green ITIvano Malavolta
 
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Ivano Malavolta
 
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]Ivano Malavolta
 
Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Ivano Malavolta
 
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Ivano Malavolta
 
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Ivano Malavolta
 
Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Ivano Malavolta
 
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Ivano Malavolta
 
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Ivano Malavolta
 
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Ivano Malavolta
 
Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Ivano Malavolta
 
[2017/2018] Agile development
[2017/2018] Agile development[2017/2018] Agile development
[2017/2018] Agile developmentIvano Malavolta
 
Reconstructing microservice-based architectures
Reconstructing microservice-based architecturesReconstructing microservice-based architectures
Reconstructing microservice-based architecturesIvano Malavolta
 
[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design LanguageIvano Malavolta
 
[2017/2018] Architectural languages
[2017/2018] Architectural languages[2017/2018] Architectural languages
[2017/2018] Architectural languagesIvano Malavolta
 
[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software ArchitectureIvano Malavolta
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineeringIvano Malavolta
 

Más de Ivano Malavolta (20)

Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
 
The H2020 experience
The H2020 experienceThe H2020 experience
The H2020 experience
 
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
 
Software sustainability and Green IT
Software sustainability and Green ITSoftware sustainability and Green IT
Software sustainability and Green IT
 
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
 
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
 
Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...
 
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
 
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
 
Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...
 
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
 
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
 
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
 
Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...
 
[2017/2018] Agile development
[2017/2018] Agile development[2017/2018] Agile development
[2017/2018] Agile development
 
Reconstructing microservice-based architectures
Reconstructing microservice-based architecturesReconstructing microservice-based architectures
Reconstructing microservice-based architectures
 
[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language
 
[2017/2018] Architectural languages
[2017/2018] Architectural languages[2017/2018] Architectural languages
[2017/2018] Architectural languages
 
[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering
 

Último

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 

Último (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 

The Green Lab - [09 A] Statistical tests and effect size

  • 1. 1 Het begint met een idee Statistical tests and effect size Ivano Malavolta
  • 2. Vrije Universiteit Amsterdam 2 Ivano Malavolta / S2 group / Statistical tests and effect size Roadmap ● Warm up ● Check for normality ● Main statistical tests ● p-value corrections ● Effect size
  • 3. Vrije Universiteit Amsterdam 3 Ivano Malavolta / S2 group / Statistical tests and effect size Context and assumptions ● We focus on quantitative variables only ○ nominal ○ ordinal ○ interval ○ ratio ● Factors are nominal or ordinal ○ dependent variables can be any ● Our statistical tests detect differences between the means of the dependent variable ● Factor alternatives are fixed a priori
  • 4. Vrije Universiteit Amsterdam 4 Ivano Malavolta / S2 group / Statistical tests and effect size Tasks for data analysis 1. Descriptive statistics ○ for understanding the “shape” of collected data 2. Select statistical test ○ according to collected metrics and data distribution 3. Hypothesis testing ○ for providing evidence about your findings i. statistical significance 4. Effect size calculation ○ for understanding if your (statistically significant) results are actually relevant in practice 5. Power analysis ○ for knowing if your results are emerging only by luck [1]
  • 5. Vrije Universiteit Amsterdam 5 Ivano Malavolta / S2 group / Statistical tests and effect size What is a statistical test? ● Calculation of a sample statistic assuming that the null hypothesis is true ● The calculated value of the statistic has a certain probability given that the null hypothesis is true (p-value)
  • 6. Vrije Universiteit Amsterdam 6 Ivano Malavolta / S2 group / Statistical tests and effect size First choice: parametric VS non-parametric tests ● Parametric tests assume a specific distribution of the data ○ typically, normal distribution ○ more powerful → smaller sample sizes are needed ● Non-parametric tests do not make any assumption about data distribution ○ more general ○ less powerful → larger samples are needed
  • 7. Vrije Universiteit Amsterdam 7 Ivano Malavolta / S2 group / Statistical tests and effect size How to choose? ● Applicability ○ type of variable ■ nominal, ordinal scale → non-parametric ○ distribution of the values ■ normality ○ other assumptions ● Power If you can apply it, always prefer a parametric test In your report explain in details why you choose a specific test
  • 8. Vrije Universiteit Amsterdam 8 Ivano Malavolta / S2 group / Statistical tests and effect size How to choose? Non-parametric methods Parametric methods Model restrictions (eg, normality) satisfied? Nominal or ordinal scale? YES NO YES NO
  • 9. Vrije Universiteit Amsterdam 9 Check for normality Ivano Malavolta / S2 group / Statistical tests and effect size
  • 10. Vrije Universiteit Amsterdam 10 Ivano Malavolta / S2 group / Statistical tests and effect size Graphical check (Q-Q plot)
  • 11. Vrije Universiteit Amsterdam 11 Ivano Malavolta / S2 group / Statistical tests and effect size Normality tests ● Normality tests ○ H0 : sample is drawn from a normal distribution ● Shapiro-Wilk test (AKA Shapiro-Wilk’s W) ● If p-value <α for a given sample, we can conclude data is NOT normally distributed
  • 12. Vrije Universiteit Amsterdam 12 Ivano Malavolta / S2 group / Statistical tests and effect size Shapiro-Wilk test
  • 13. Vrije Universiteit Amsterdam 13 Ivano Malavolta / S2 group / Statistical tests and effect size Shapiro-Wilk test
  • 14. Vrije Universiteit Amsterdam 14 Shapiro-Wilk test ● Warning: Shapiro-Wilk is not robust for small samples! ○ Additional verification (e.g. via Q-Q plot) is always needed
  • 15. Vrije Universiteit Amsterdam 15 Main statistical tests Ivano Malavolta / S2 group / Statistical tests and effect size
  • 16. Vrije Universiteit Amsterdam 16 Ivano Malavolta / S2 group / Statistical tests and effect size Statistical tests VS experiment design
  • 17. Vrije Universiteit Amsterdam 17 Ivano Malavolta / S2 group / Statistical tests and effect size One factor - 2 treatments - random design
  • 18. Vrije Universiteit Amsterdam 18 Ivano Malavolta / S2 group / Statistical tests and effect size t-Test Goal: compare independent samples ○ Values of the dependent variable obtained with different treatments ○ For each treatment you are measuring different subjects Parametric Hypotheses: ● Two-tailed ○ H0 : μ2 = μ1 Ha : μ2 ≠ μ1 ● One-tailed (alternative: greater) ○ H0 : μ2 = μ1 Ha : μ2 > μ1 ● One-tailed (alternative: less) ○ H0 : μ2 = μ1 Ha : μ2 < μ1 ● More powerful ● Cannot say anything in the opposite direction
  • 19. Vrije Universiteit Amsterdam 19 Ivano Malavolta / S2 group / Statistical tests and effect size t-Test in R
  • 20. Vrije Universiteit Amsterdam 20 Ivano Malavolta / S2 group / Statistical tests and effect size t-Test: example
  • 21. Vrije Universiteit Amsterdam 21 Ivano Malavolta / S2 group / Statistical tests and effect size t-Test: example 2
  • 22. Vrije Universiteit Amsterdam 22 Ivano Malavolta / S2 group / Statistical tests and effect size Mann-Whitney test Goal: compare independent samples ● It can be used instead of the t-test when data is not normal Non-parametric Hypotheses: ● Two-tailed ○ H0 : μ2 = μ1 Ha : μ2 ≠ μ1 ● One-tailed (alternative: greater) ○ H0 : μ2 = μ1 Ha : μ2 > μ1 ● One-tailed (alternative: less) ○ H0 : μ2 = μ1 Ha : μ2 < μ1 ● Same hypotheses as the t-test
  • 24. Vrije Universiteit Amsterdam 24 Ivano Malavolta / S2 group / Statistical tests and effect size Mann-Whitney test: example
  • 25. Vrije Universiteit Amsterdam 25 Ivano Malavolta / S2 group / Statistical tests and effect size One factor - 2 treatments - paired design
  • 26. Vrije Universiteit Amsterdam 26 Ivano Malavolta / S2 group / Statistical tests and effect size Paired t-Test Goal: compare independent samples from repeated measures ○ Each subject receives different treatments ○ We focus on the differences exhibited by subjects with different treatments ○ Samples must be equals in size Parametric Hypotheses: ● Two-tailed ○ H0 : μd = 0 Ha : μd ≠ 0 ● One-tailed (alternative: greater) ○ H0 : μd = 0 Ha : μd > 0 ● One-tailed (alternative: less) ○ H0 : μd = 0 Ha : μd < 0
  • 27. Vrije Universiteit Amsterdam 27 Ivano Malavolta / S2 group / Statistical tests and effect size Paired t-Test: example
  • 28. Vrije Universiteit Amsterdam 28 Ivano Malavolta / S2 group / Statistical tests and effect size Wilcoxon signed-rank test Goal: compare independent samples from repeated measures ● It can be used instead of the paired t-test in case of not normal data Non-parametric ● Same hypotheses as the paired t-test Hypotheses: ● Two-tailed ○ H0 : μd = 0 Ha : μd ≠ 0 ● One-tailed (alternative: greater) ○ H0 : μd = 0 Ha : μd > 0 ● One-tailed (alternative: less) ○ H0 : μd = 0 Ha : μd < 0
  • 29. Vrije Universiteit Amsterdam 29 Ivano Malavolta / S2 group / Statistical tests and effect size Wilcoxon signed-rank test: example
  • 30. Vrije Universiteit Amsterdam 30 Ivano Malavolta / S2 group / Statistical tests and effect size >=1 factors - >2 treatments
  • 31. Vrije Universiteit Amsterdam 31 Ivano Malavolta / S2 group / Statistical tests and effect size ANOVA (ANalysis Of VAriance) Goal: understand how much of the total variance is due to differences within factors, and how much is due to differences across factors ○ Many types of ANOVA tests ○ Works for many experiment designs Parametric Hypotheses: H0 : μ1 = μ2 = μ3 Ha : μ1 ≠ μ2 V μ1 ≠ μ3 V μ2 ≠ μ3
  • 32. Vrije Universiteit Amsterdam 32 F-statistic F = Variation among sample means / variation within the samples when H0 → F follows a known F-distribution ● the mean of the F-distribution tends to be 1 https://goo.gl/jHWo8A
  • 33. Vrije Universiteit Amsterdam 33 Significance F tends to be larger if H0 is false → the more F deviates from 1, the stronger the evidence for unequal population variances ● Methods to determine significance level: ○ textbook: compare F against a table of critical values (according to DF and α). If F > Fcritical , reject H0 ○ computer-based: compute the p-value of finding F greater than the observed value. If p < α, reject H0 Ivano Malavolta / S2 group / Statistical tests and effect size
  • 34. Vrije Universiteit Amsterdam 34 Types of ANOVA Ivano Malavolta / S2 group / Statistical tests and effect size ● One-way ANOVA ○ one factor, >2 treatments ○ if 2 treatments: equivalent to t-test (almost never used)
  • 35. Vrije Universiteit Amsterdam 35 Types of ANOVA Ivano Malavolta / S2 group / Statistical tests and effect size ● Factorial ANOVA ○ 2 (two-way) or more factors ○ any number of treatments ○ also computes interactions
  • 36. Vrije Universiteit Amsterdam 36 How to know which treatments really differ? Tukey’s test
  • 37. Vrije Universiteit Amsterdam 37 Types of ANOVA ● Repeated Measures ANOVA (rANOVA) ○ computes changes of a treatment over time ○ used when applying each treatment to the same subjects (within-subject comparison)
  • 38. Vrije Universiteit Amsterdam 38 Types of ANOVA ● Multivariate ANOVA (mANOVA) ○ more than one outcome (dependent variable) ○ simultaneous analysis Individual variables Combined
  • 39. Vrije Universiteit Amsterdam 39 ANOVA assumptions Ivano Malavolta / S2 group / Statistical tests and effect size ● The dependent variable should be continuous ● Samples must be independent ● Normal distribution of the dependent variable between the groups (approximately) ● Errors (aka residuals) should be normally distributed ○ qqPlot(residuals(myData.aov)) ● Homoscedasticity ○ variance between groups should be the same ■ bartlett.test(x ~ y, data=myData) ■ leveneTest(x ~ y, data=myData) Assumptions violated → non-parametric alternative
  • 40. Vrije Universiteit Amsterdam 40 ANOVA: non-parametric alternative Ivano Malavolta / S2 group / Statistical tests and effect size ● Kruskal-Wallis: one-way non-parametric ANOVA ○ one factor, multiple treatments ○ no estimate of the treatment effect (due to ranking) Non-parametric
  • 41. Vrije Universiteit Amsterdam 41 Ivano Malavolta / S2 group / Statistical tests and effect size Main statistical tests You are measuring different subjects You are measuring the same subject against different treatments Use this in case the values of your dep. var are not normally distributed
  • 42. Vrije Universiteit Amsterdam 42 p-value corrections Ivano Malavolta / S2 group / Statistical tests and effect size
  • 43. Vrije Universiteit Amsterdam 43 Example Ivano Malavolta / S2 group / Statistical tests and effect size Dependent variable = energy consumption of the app Independent variables = ● A: Image encoding algorithm: {png, jpeg} ● B: Mobile device type: {high-end, low-end} ● C: Network conditions: {wifi, 3G} You perform 3 tests: ● t.test(A, B) ● t.test(A, C) ● t.test(B, C) P(at least one significant result) = 1 − P(no significant results) = 1 − (1 − 0.05)3 ≈ 0.15 → 15% chance of seeing relevant results, when there may be none
  • 44. Vrije Universiteit Amsterdam 44 The problem Ivano Malavolta / S2 group / Statistical tests and effect size Multiple tests → higher probability of getting (statistically significant ) results by chance → you have to adjust your α (it was 0.05) Two main correction techniques: ● Bonferroni ● Holm
  • 45. Vrije Universiteit Amsterdam 45 Bonferroni correction Supposing we are doing N tests, we can reject H0 if the p-values of those tests are below α/N We can reject the H0 if all tests provide a p-value < 0.05/3=0.016 → 0.016 is our new significance threshold!
  • 46. Vrije Universiteit Amsterdam Less stringent that Bonferroni’s Procedure: ● rank your p-values from the smallest to the largest ● multiply the first by N, the second by N-1, etc. ● a p-value is significant if after multiplied is <0.05 P-values of the tests: {0.01, 0.02, 0.03} Bonferroni: ● 0.01 * 3 = 0.03 ● 0.02 * 3 = 0.06 ● 0.03 * 3 = 0.09 46 Holm’s correction Ivano Malavolta / S2 group / Statistical tests and effect size Holm: ● 0.01 * 3 = 0.03 ● 0.02 * 2 = 0.04 ● 0.03 * 1 = 0.03
  • 47. Vrije Universiteit Amsterdam 47 Effect size Ivano Malavolta / S2 group / Statistical tests and effect size
  • 49. Vrije Universiteit Amsterdam 49 Effect size measures Ivano Malavolta / S2 group / Statistical tests and effect size ● Cohen’s d ○ parametric statistics ● Cliff’s delta ○ non-parametric statistics
  • 50. Vrije Universiteit Amsterdam 50 Cohen’s d Ivano Malavolta / S2 group / Statistical tests and effect size The magnitude of a main factor treatment effect on the dependent variable Where: ● x1 , x2 = the means of the two groups ● s = standard deviation ○ Pooled standard deviation for independent samples Values: 0 = full overlap 1 = 1-sigma distance between the means … 3 = 3-sigma distance → ~no overlap
  • 51. Vrije Universiteit Amsterdam 51 Cohen’s d in R Independent samples Dependent samples
  • 52. Vrije Universiteit Amsterdam 52 Cliff’s delta Ivano Malavolta / S2 group / Statistical tests and effect size Represents the degree of overlap between the two distributions of scores Where: ● xi … xj = the specific values of each sample ● m, n = the cardinalities of the two groups Values: 0 = full overlap +1 = all the values of one group > all the values of the other one ... -1 = the inverse
  • 55. Vrije Universiteit Amsterdam 55 What this lecture means to you?
  • 56. Vrije Universiteit Amsterdam 56 Ivano Malavolta / S2 group / Empirical software engineering Readings [1] Dybå, Tore, Vigdis By Kampenes, and Dag IK Sjøberg. "A systematic review of statistical power in software engineering experiments." Information and Software Technology 48.8 (2006): 745-755. Chapter 10 Part 3 Chapter 6
  • 57. Vrije Universiteit Amsterdam 57 Ivano Malavolta / S2 group / Empirical software engineering Some contents of lecture extracted from: ● Giuseppe Procaccianti’s lectures at VU Acknowledgements