SlideShare una empresa de Scribd logo
1 de 20
Will G Hopkins
Auckland University of Technology
Auckland NZ
Quantitative Data Analysis
Summarizing Data: variables; simple statistics; effect statistics
and statistical models; complex models.
Generalizing from Sample to Population: precision of estimate,
confidence limits, statistical significance, p value, errors.
Reference: Hopkins WG (2002). Quantitative data analysis (Slideshow).
Sportscience 6, sportsci.org/jour/0201/Quantitative_analysis.ppt (2046 words)
Summarizing Data
• Data are a bunch of values of one or more variables.
• A variable is something that has different values.
• Values can be numbers or names, depending on the variable:
• Numeric, e.g. weight
• Counting, e.g. number of injuries
• Ordinal, e.g. competitive level (values are numbers/names)
• Nominal, e.g. sex (values are names
• When values are numbers, visualize the distribution of all
values in stem and leaf plots or in a frequency histogram.
• Can also use normal probability plots to visualize how well
the values fit a normal distribution.
• When values are names, visualize the frequency of each value
with a pie chart or a just a list of values and frequencies.
• A statistic is a number summarizing a bunch of values.
• Simple or univariate statistics summarize values of one variable.
• Effect or outcome statistics summarize the relationship between
values of two or more variables.
• Simple statistics for numeric variables…
• Mean: the average
• Standard deviation: the typical variation
• Standard error of the mean: the typical variation in the mean with
repeated sampling
• Multiply by √(sample size) to convert to standard deviation.
• Use these also for counting and ordinal variables.
• Use median (middle value or 50th percentile) and quartiles (25th
and 75th percentiles) for grossly non-normally distributed data.
• Summarize these and other simple statistics visually with box
and whisker plots.
• Simple statistics for nominal variables
• Frequencies, proportions, or odds.
• Can also use these for ordinal variables.
• Effect statistics…
• Derived from statistical model (equation) of the form
Y (dependent) vs X (predictor or independent).
• Depend on type of Y and X . Main ones:
Y X Effect statisticsModel/Test
numeric numeric slope, intercept, correlationregression
numeric nominal
nominal nominal
nominal numeric
mean difference
frequency difference or ratio
frequency ratio per…
t test, ANOVA
chi-square
categorical
• Model: numeric vs numeric
e.g. body fat vs sum of skinfolds
• Model or test:
linear regression
• Effect statistics:
• slope and intercept
= parameters
• correlation coefficient or variance explained (= 100·correlation2
)
= measures of goodness of fit
• Other statistics:
• typical or standard error of the estimate
= residual error
= best measure of validity (with criterion variable on the Y axis)
sum skinfolds (mm)sum skinfolds (mm)
body fatbody fat
(%BM)(%BM)
• Model: numeric vs nominal
e.g. strength vs sex
• Model or test:
• t test (2 groups)
• 1-way ANOVA (>2 groups)
• Effect statistics:
• difference between means
expressed as raw difference, percent difference, or fraction of
the root mean square error (Cohen's effect-size statistic)
• variance explained or better √(variance explained/100)
= measures of goodness of fit
• Other statistics:
• root mean square error
= average standard deviation of the two groups
femalefemale malemale
strengthstrength
sexsex
• More on expressing the magnitude of the effect
• What often matters is the difference between means relative to
the standard deviation:
strength
females
males
Trivial effect:
strength
females
males
Very large effect:
• Fraction or multiple of a standard deviation is known as the
effect-size statistic (or Cohen's "d").
• Cohen suggested thresholds for correlations and effect sizes.
• Hopkins agrees with the thresholds for correlations but
suggests others for the effect size:
trivialtrivial smallsmall moderatemoderate largelarge very largevery large !!!!!!
0.10.1 0.30.3 0.50.5 0.70.700 0.90.9 11Hopkins:Hopkins:
Correlations
Cohen:Cohen: 0.10.1 0.30.3 0.50.500
0.20.2Hopkins:Hopkins: 0.60.6 1.21.2 2.02.000 4.04.0 ∞∞
Effect Sizes
0.20.2Cohen:Cohen: 0.50.5 0.80.800
• For studies of athletic performance, percent differences or
changes in the mean are better than Cohen effect sizes.
• Model: numeric vs nominal
(repeated measures)
e.g. strength vs trial
• Model or test:
• paired t test (2 trials)
• repeated-measures ANOVA with
one within-subject factor (>2 trials)
• Effect statistics:
• change in mean expressed as raw change, percent change, or
fraction of the pre standard deviation
• Other statistics:
• within-subject standard deviation (not visible on above plot)
= typical error: conveys error of measurement
– useful to gauge reliability, individual responses, and
magnitude of effects (for measures of athletic performance).
pre post
strengthstrength
trial
• Model: nominal vs nominal
e.g. sport vs sex
• Model or test:
• chi-squared test or
contingency table
• Effect statistics:
• Relative frequencies, expressed
as a difference in frequencies,
ratio of frequencies (relative risk),
or ratio of odds (odds ratio)
• Relative risk is appropriate for cross-sectional or prospective
designs.
– risk of having rugby disease for males relative to females is
(75/100)/(30/100) = 2.5
• Odds ratio is appropriate for case-control designs.
– calculated as (75/25)/(30/70) = 7.0
femalesfemales malesmales
30%
75%
rugby yesrugby yes
rugby norugby no
• Model: nominal vs numeric
e.g. heart disease vs age
• Model or test:
• categorical modeling
• Effect statistics:
• relative risk or odds ratio
per unit of the numeric variable
(e.g., 2.3 per decade)
• Model: ordinal or counts vs whatever
• Can sometimes be analyzed as numeric variables using
regression or t tests
• Otherwise logistic regression or generalized linear modeling
• Complex models
• Most reducible to t tests, regression, or relative frequencies.
• Example…
age (y)age (y)
heartheart
diseasedisease
(%)(%)
00
100100
3030 5050 7070
• Model: controlled trial
(numeric vs 2 nominals)
e.g. strength vs trial vs group
• Model or test:
• unpaired t test of
change scores (2 trials, 2 groups)
• repeated-measures ANOVA with
within- and between-subject factors
(>2 trials or groups)
• Note: use line diagram, not bar graph, for repeated measures.
• Effect statistics:
• difference in change in mean expressed as raw difference,
percent difference, or fraction of the pre standard deviation
• Other statistics:
• standard deviation representing individual responses (derived
from within-subject standard deviations in the two groups)
pre post
strengthstrength
trial
drug
placebo
• Model: extra predictor variable to "control for something"
e.g. heart disease vs physical activity vs age
• Can't reduce to anything simpler.
• Model or test:
• multiple linear regression or analysis of covariance (ANCOVA)
• Equivalent to the effect of physical activity with everyone at the
same age.
• Reduction in the effect of physical activity on disease when
age is included implies age is at least partly the reason or
mechanism for the effect.
• Same analysis gives the effect of age with everyone at same
level of physical activity.
• Can use special analysis (mixed modeling) to include a
mechanism variable in a repeated-measures model. See
separate presentation at newstats.org.
• Problem: some models don't fit uniformly for different subjects
• That is, between- or within-subject standard deviations differ
between some subjects.
• Equivalently, the residuals are non-uniform (have different
standard deviations for different subjects).
• Determine by examining standard deviations or plots of
residuals vs predicteds.
• Non-uniformity makes p values and confidence limits wrong.
• How to fix…
• Use unpaired t test for groups with unequal variances, or…
• Try taking log of dependent variable before analyzing, or…
• Find some other transformation. As a last resort…
• Use rank transformation: convert dependent variable to
ranks before analyzing (= non-parametric analysis–same as
Wilcoxon, Kruskal-Wallis and other tests).
Generalizing from a Sample to a Population
• You study a sample to find out about the population.
• The value of a statistic for a sample is only an estimate of the
true (population) value.
• Express precision or uncertainty in true value using 95%
confidence limits.
• Confidence limits represent likely range of the true value.
• They do NOT represent a range of values in different subjects.
• There's a 5% chance the true value is outside the 95%
confidence interval: the Type 0 error rate.
• Interpret the observed value and the confidence limits as
clinically or practically beneficial, trivial, or harmful.
• Even better, work out the probability that the effect is clinically or
practically beneficial/trivial/harmful. See sportsci.org.
• Statistical significance is an old-fashioned way of
generalizing, based on testing whether the true value could be
zero or null.
• Assume the null hypothesis: that the true value is zero (null).
• If your observed value falls in a region of extreme values that
would occur only 5% of the time, you reject the null hypothesis.
• That is, you decide that the true value is unlikely to be zero;
you can state that the result is statistically significant at the 5%
level.
• If the observed value does not fall in the 5% unlikely region,
most people mistakenly accept the null hypothesis: they
conclude that the true value is zero or null!
• The p value helps you decide whether your result falls in the
unlikely region.
• If p<0.05, your result is in the unlikely region.
• One meaning of the p value: the probability of a more extreme
observed value (positive or negative) when true value is zero.
• Better meaning of the p value: if you observe a positive effect,
1 - p/2 is the chance the true value is positive, and p/2 is the
chance the true value is negative. Ditto for a negative effect.
• Example: you observe a 1.5% enhancement of performance
(p=0.08). Therefore there is a 96% chance that the true effect
is any "enhancement" and a 4% chance that the true effect is
any "impairment".
• This interpretation does not take into account trivial
enhancements and impairments.
• Therefore, if you must use p values, show exact values, not
p<0.05 or p>0.05.
• Meta-analysts also need the exact p value (or confidence
limits).
• If the true value is zero, there's a 5% chance of getting
statistical significance: the Type I error rate, or rate of false
positives or false alarms.
• There's also a chance that the smallest worthwhile true value
will produce an observed value that is not statistically
significant: the Type II error rate, or rate of false negatives or
failed alarms.
• In the old-fashioned approach to research design, you are
supposed to have enough subjects to make a Type II error
rate of 20%: that is, your study is supposed to have a power
of 80% to detect the smallest worthwhile effect.
• If you look at lots of effects in a study, there's an increased
chance being wrong about at least one of them.
• Old-fashioned statisticians like to control this inflation of
the Type I error rate within an ANOVA to make sure the
increased chance is kept to 5%. This approach is misguided.
• The standard error of the mean (typical variation in the mean
from sample to sample) can convey statistical significance.
• Non-overlap of the error bars of two groups implies a
statistically significant difference, but only for groups of equal
size (e.g. males vs females).
• In particular, non-overlap does NOT convey statistical
significance in experiments:
what-
ever
postpre
High reliability
p = 0.003
postpre
Mean ± SEM
in both cases
postpre
Low reliability
p = 0.2
• In summary
• If you must use statistical significance, show exact p values.
• Better still, show confidence limits instead.
• NEVER show the standard error of the mean!
• Show the usual between-subject standard deviation to convey
the spread between subjects.
• In population studies, this standard deviation helps convey
magnitude of differences or changes in the mean.
• In interventions, show also the within-subject standard deviation
(the typical error) to convey precision of measurement.
• In athlete studies, this standard deviation helps convey
magnitude of differences or changes in mean performance.

Más contenido relacionado

La actualidad más candente

Commonly used statistical tests in research
Commonly used statistical tests in researchCommonly used statistical tests in research
Commonly used statistical tests in researchNaqeeb Ullah Khan
 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsJezhabeth Villegas
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsSarfraz Ahmad
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distributionswarna dey
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statisticsewhite00
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data AnalysisAsma Muhamad
 
Non parametric test
Non parametric testNon parametric test
Non parametric testNeetathakur3
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and StatisticsT.S. Lim
 
Parametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use whichParametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use whichGönenç Dalgıç
 
Quantitative data analysis
Quantitative data analysisQuantitative data analysis
Quantitative data analysisRonaldLucasia1
 
Types of error in hypothesis
Types of error in hypothesisTypes of error in hypothesis
Types of error in hypothesisShaifali Pandey
 
Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypothesesRajThakuri
 

La actualidad más candente (20)

Chapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATIONChapter 10-DATA ANALYSIS & PRESENTATION
Chapter 10-DATA ANALYSIS & PRESENTATION
 
Commonly used statistical tests in research
Commonly used statistical tests in researchCommonly used statistical tests in research
Commonly used statistical tests in research
 
Introduction to Probability and Probability Distributions
Introduction to Probability and Probability DistributionsIntroduction to Probability and Probability Distributions
Introduction to Probability and Probability Distributions
 
Data presentation 2
Data presentation 2Data presentation 2
Data presentation 2
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Quantitative Data Analysis
Quantitative Data AnalysisQuantitative Data Analysis
Quantitative Data Analysis
 
Data analysis
Data analysisData analysis
Data analysis
 
Hypothesis PPT
Hypothesis PPTHypothesis PPT
Hypothesis PPT
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Data Analysis and Statistics
Data Analysis and StatisticsData Analysis and Statistics
Data Analysis and Statistics
 
Parametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use whichParametric vs Nonparametric Tests: When to use which
Parametric vs Nonparametric Tests: When to use which
 
DATA Types
DATA TypesDATA Types
DATA Types
 
Quantitative data analysis
Quantitative data analysisQuantitative data analysis
Quantitative data analysis
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Types of error in hypothesis
Types of error in hypothesisTypes of error in hypothesis
Types of error in hypothesis
 
Data and its Types
Data and its TypesData and its Types
Data and its Types
 
Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypotheses
 
statistic
statisticstatistic
statistic
 

Similar a Quantitative analysis

Quantitative_analysis.ppt
Quantitative_analysis.pptQuantitative_analysis.ppt
Quantitative_analysis.pptmousaderhem1
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notesBob Smullen
 
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfBASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfAdamu Mohammad
 
Data Ananlysis lecture 7 Simon Fraser University
Data Ananlysis lecture 7 Simon Fraser UniversityData Ananlysis lecture 7 Simon Fraser University
Data Ananlysis lecture 7 Simon Fraser Universitysoniyamarghani
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in StatisticsVikash Keshri
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiranKiran Ramakrishna
 
COM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxCOM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxAkinsolaAyomidotun
 
Statistical-Tests-and-Hypothesis-Testing.pptx
Statistical-Tests-and-Hypothesis-Testing.pptxStatistical-Tests-and-Hypothesis-Testing.pptx
Statistical-Tests-and-Hypothesis-Testing.pptxCHRISTINE MAY CERDA
 
Summary of statistical tools used in spss
Summary of statistical tools used in spssSummary of statistical tools used in spss
Summary of statistical tools used in spssSubodh Khanal
 
Malimu statistical significance testing.
Malimu statistical significance testing.Malimu statistical significance testing.
Malimu statistical significance testing.Miharbi Ignasm
 
5. testing differences
5. testing differences5. testing differences
5. testing differencesSteve Saffhill
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1naranbatn
 
ststs nw.pptx
ststs nw.pptxststs nw.pptx
ststs nw.pptxMrymNb
 
Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminardrdeepika87
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethicsAbhishek Thakur
 

Similar a Quantitative analysis (20)

Quantitative_analysis.ppt
Quantitative_analysis.pptQuantitative_analysis.ppt
Quantitative_analysis.ppt
 
1. complete stats notes
1. complete stats notes1. complete stats notes
1. complete stats notes
 
Validity andreliability
Validity andreliabilityValidity andreliability
Validity andreliability
 
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdfBASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
BASIC STATISTICS AND THEIR INTERPRETATION AND USE IN EPIDEMIOLOGY 050822.pdf
 
Data Ananlysis lecture 7 Simon Fraser University
Data Ananlysis lecture 7 Simon Fraser UniversityData Ananlysis lecture 7 Simon Fraser University
Data Ananlysis lecture 7 Simon Fraser University
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
 
Statistics basics for oncologist kiran
Statistics basics for oncologist kiranStatistics basics for oncologist kiran
Statistics basics for oncologist kiran
 
COM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptxCOM 201_Inferential Statistics_18032022.pptx
COM 201_Inferential Statistics_18032022.pptx
 
Statistical-Tests-and-Hypothesis-Testing.pptx
Statistical-Tests-and-Hypothesis-Testing.pptxStatistical-Tests-and-Hypothesis-Testing.pptx
Statistical-Tests-and-Hypothesis-Testing.pptx
 
Summary of statistical tools used in spss
Summary of statistical tools used in spssSummary of statistical tools used in spss
Summary of statistical tools used in spss
 
Chapter34
Chapter34Chapter34
Chapter34
 
Malimu statistical significance testing.
Malimu statistical significance testing.Malimu statistical significance testing.
Malimu statistical significance testing.
 
5. testing differences
5. testing differences5. testing differences
5. testing differences
 
Research method ch07 statistical methods 1
Research method ch07 statistical methods 1Research method ch07 statistical methods 1
Research method ch07 statistical methods 1
 
LR 9 Estimation.pdf
LR 9 Estimation.pdfLR 9 Estimation.pdf
LR 9 Estimation.pdf
 
ststs nw.pptx
ststs nw.pptxststs nw.pptx
ststs nw.pptx
 
Stat topics
Stat topicsStat topics
Stat topics
 
Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminar
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethics
 
Data science
Data scienceData science
Data science
 

Último

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...gajnagarg
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...gajnagarg
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...amitlee9823
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...gajnagarg
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823
 

Último (20)

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Nandini Layout ☎ 7737669865 🥵 Book Your One night Stand
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
Just Call Vip call girls kakinada Escorts ☎️9352988975 Two shot with one girl...
 
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
Just Call Vip call girls Erode Escorts ☎️9352988975 Two shot with one girl (E...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
Just Call Vip call girls Mysore Escorts ☎️9352988975 Two shot with one girl (...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 

Quantitative analysis

  • 1. Will G Hopkins Auckland University of Technology Auckland NZ Quantitative Data Analysis Summarizing Data: variables; simple statistics; effect statistics and statistical models; complex models. Generalizing from Sample to Population: precision of estimate, confidence limits, statistical significance, p value, errors. Reference: Hopkins WG (2002). Quantitative data analysis (Slideshow). Sportscience 6, sportsci.org/jour/0201/Quantitative_analysis.ppt (2046 words)
  • 2. Summarizing Data • Data are a bunch of values of one or more variables. • A variable is something that has different values. • Values can be numbers or names, depending on the variable: • Numeric, e.g. weight • Counting, e.g. number of injuries • Ordinal, e.g. competitive level (values are numbers/names) • Nominal, e.g. sex (values are names • When values are numbers, visualize the distribution of all values in stem and leaf plots or in a frequency histogram. • Can also use normal probability plots to visualize how well the values fit a normal distribution. • When values are names, visualize the frequency of each value with a pie chart or a just a list of values and frequencies.
  • 3. • A statistic is a number summarizing a bunch of values. • Simple or univariate statistics summarize values of one variable. • Effect or outcome statistics summarize the relationship between values of two or more variables. • Simple statistics for numeric variables… • Mean: the average • Standard deviation: the typical variation • Standard error of the mean: the typical variation in the mean with repeated sampling • Multiply by √(sample size) to convert to standard deviation. • Use these also for counting and ordinal variables. • Use median (middle value or 50th percentile) and quartiles (25th and 75th percentiles) for grossly non-normally distributed data. • Summarize these and other simple statistics visually with box and whisker plots.
  • 4. • Simple statistics for nominal variables • Frequencies, proportions, or odds. • Can also use these for ordinal variables. • Effect statistics… • Derived from statistical model (equation) of the form Y (dependent) vs X (predictor or independent). • Depend on type of Y and X . Main ones: Y X Effect statisticsModel/Test numeric numeric slope, intercept, correlationregression numeric nominal nominal nominal nominal numeric mean difference frequency difference or ratio frequency ratio per… t test, ANOVA chi-square categorical
  • 5. • Model: numeric vs numeric e.g. body fat vs sum of skinfolds • Model or test: linear regression • Effect statistics: • slope and intercept = parameters • correlation coefficient or variance explained (= 100·correlation2 ) = measures of goodness of fit • Other statistics: • typical or standard error of the estimate = residual error = best measure of validity (with criterion variable on the Y axis) sum skinfolds (mm)sum skinfolds (mm) body fatbody fat (%BM)(%BM)
  • 6. • Model: numeric vs nominal e.g. strength vs sex • Model or test: • t test (2 groups) • 1-way ANOVA (>2 groups) • Effect statistics: • difference between means expressed as raw difference, percent difference, or fraction of the root mean square error (Cohen's effect-size statistic) • variance explained or better √(variance explained/100) = measures of goodness of fit • Other statistics: • root mean square error = average standard deviation of the two groups femalefemale malemale strengthstrength sexsex
  • 7. • More on expressing the magnitude of the effect • What often matters is the difference between means relative to the standard deviation: strength females males Trivial effect: strength females males Very large effect:
  • 8. • Fraction or multiple of a standard deviation is known as the effect-size statistic (or Cohen's "d"). • Cohen suggested thresholds for correlations and effect sizes. • Hopkins agrees with the thresholds for correlations but suggests others for the effect size: trivialtrivial smallsmall moderatemoderate largelarge very largevery large !!!!!! 0.10.1 0.30.3 0.50.5 0.70.700 0.90.9 11Hopkins:Hopkins: Correlations Cohen:Cohen: 0.10.1 0.30.3 0.50.500 0.20.2Hopkins:Hopkins: 0.60.6 1.21.2 2.02.000 4.04.0 ∞∞ Effect Sizes 0.20.2Cohen:Cohen: 0.50.5 0.80.800 • For studies of athletic performance, percent differences or changes in the mean are better than Cohen effect sizes.
  • 9. • Model: numeric vs nominal (repeated measures) e.g. strength vs trial • Model or test: • paired t test (2 trials) • repeated-measures ANOVA with one within-subject factor (>2 trials) • Effect statistics: • change in mean expressed as raw change, percent change, or fraction of the pre standard deviation • Other statistics: • within-subject standard deviation (not visible on above plot) = typical error: conveys error of measurement – useful to gauge reliability, individual responses, and magnitude of effects (for measures of athletic performance). pre post strengthstrength trial
  • 10. • Model: nominal vs nominal e.g. sport vs sex • Model or test: • chi-squared test or contingency table • Effect statistics: • Relative frequencies, expressed as a difference in frequencies, ratio of frequencies (relative risk), or ratio of odds (odds ratio) • Relative risk is appropriate for cross-sectional or prospective designs. – risk of having rugby disease for males relative to females is (75/100)/(30/100) = 2.5 • Odds ratio is appropriate for case-control designs. – calculated as (75/25)/(30/70) = 7.0 femalesfemales malesmales 30% 75% rugby yesrugby yes rugby norugby no
  • 11. • Model: nominal vs numeric e.g. heart disease vs age • Model or test: • categorical modeling • Effect statistics: • relative risk or odds ratio per unit of the numeric variable (e.g., 2.3 per decade) • Model: ordinal or counts vs whatever • Can sometimes be analyzed as numeric variables using regression or t tests • Otherwise logistic regression or generalized linear modeling • Complex models • Most reducible to t tests, regression, or relative frequencies. • Example… age (y)age (y) heartheart diseasedisease (%)(%) 00 100100 3030 5050 7070
  • 12. • Model: controlled trial (numeric vs 2 nominals) e.g. strength vs trial vs group • Model or test: • unpaired t test of change scores (2 trials, 2 groups) • repeated-measures ANOVA with within- and between-subject factors (>2 trials or groups) • Note: use line diagram, not bar graph, for repeated measures. • Effect statistics: • difference in change in mean expressed as raw difference, percent difference, or fraction of the pre standard deviation • Other statistics: • standard deviation representing individual responses (derived from within-subject standard deviations in the two groups) pre post strengthstrength trial drug placebo
  • 13. • Model: extra predictor variable to "control for something" e.g. heart disease vs physical activity vs age • Can't reduce to anything simpler. • Model or test: • multiple linear regression or analysis of covariance (ANCOVA) • Equivalent to the effect of physical activity with everyone at the same age. • Reduction in the effect of physical activity on disease when age is included implies age is at least partly the reason or mechanism for the effect. • Same analysis gives the effect of age with everyone at same level of physical activity. • Can use special analysis (mixed modeling) to include a mechanism variable in a repeated-measures model. See separate presentation at newstats.org.
  • 14. • Problem: some models don't fit uniformly for different subjects • That is, between- or within-subject standard deviations differ between some subjects. • Equivalently, the residuals are non-uniform (have different standard deviations for different subjects). • Determine by examining standard deviations or plots of residuals vs predicteds. • Non-uniformity makes p values and confidence limits wrong. • How to fix… • Use unpaired t test for groups with unequal variances, or… • Try taking log of dependent variable before analyzing, or… • Find some other transformation. As a last resort… • Use rank transformation: convert dependent variable to ranks before analyzing (= non-parametric analysis–same as Wilcoxon, Kruskal-Wallis and other tests).
  • 15. Generalizing from a Sample to a Population • You study a sample to find out about the population. • The value of a statistic for a sample is only an estimate of the true (population) value. • Express precision or uncertainty in true value using 95% confidence limits. • Confidence limits represent likely range of the true value. • They do NOT represent a range of values in different subjects. • There's a 5% chance the true value is outside the 95% confidence interval: the Type 0 error rate. • Interpret the observed value and the confidence limits as clinically or practically beneficial, trivial, or harmful. • Even better, work out the probability that the effect is clinically or practically beneficial/trivial/harmful. See sportsci.org.
  • 16. • Statistical significance is an old-fashioned way of generalizing, based on testing whether the true value could be zero or null. • Assume the null hypothesis: that the true value is zero (null). • If your observed value falls in a region of extreme values that would occur only 5% of the time, you reject the null hypothesis. • That is, you decide that the true value is unlikely to be zero; you can state that the result is statistically significant at the 5% level. • If the observed value does not fall in the 5% unlikely region, most people mistakenly accept the null hypothesis: they conclude that the true value is zero or null! • The p value helps you decide whether your result falls in the unlikely region. • If p<0.05, your result is in the unlikely region.
  • 17. • One meaning of the p value: the probability of a more extreme observed value (positive or negative) when true value is zero. • Better meaning of the p value: if you observe a positive effect, 1 - p/2 is the chance the true value is positive, and p/2 is the chance the true value is negative. Ditto for a negative effect. • Example: you observe a 1.5% enhancement of performance (p=0.08). Therefore there is a 96% chance that the true effect is any "enhancement" and a 4% chance that the true effect is any "impairment". • This interpretation does not take into account trivial enhancements and impairments. • Therefore, if you must use p values, show exact values, not p<0.05 or p>0.05. • Meta-analysts also need the exact p value (or confidence limits).
  • 18. • If the true value is zero, there's a 5% chance of getting statistical significance: the Type I error rate, or rate of false positives or false alarms. • There's also a chance that the smallest worthwhile true value will produce an observed value that is not statistically significant: the Type II error rate, or rate of false negatives or failed alarms. • In the old-fashioned approach to research design, you are supposed to have enough subjects to make a Type II error rate of 20%: that is, your study is supposed to have a power of 80% to detect the smallest worthwhile effect. • If you look at lots of effects in a study, there's an increased chance being wrong about at least one of them. • Old-fashioned statisticians like to control this inflation of the Type I error rate within an ANOVA to make sure the increased chance is kept to 5%. This approach is misguided.
  • 19. • The standard error of the mean (typical variation in the mean from sample to sample) can convey statistical significance. • Non-overlap of the error bars of two groups implies a statistically significant difference, but only for groups of equal size (e.g. males vs females). • In particular, non-overlap does NOT convey statistical significance in experiments: what- ever postpre High reliability p = 0.003 postpre Mean ± SEM in both cases postpre Low reliability p = 0.2
  • 20. • In summary • If you must use statistical significance, show exact p values. • Better still, show confidence limits instead. • NEVER show the standard error of the mean! • Show the usual between-subject standard deviation to convey the spread between subjects. • In population studies, this standard deviation helps convey magnitude of differences or changes in the mean. • In interventions, show also the within-subject standard deviation (the typical error) to convey precision of measurement. • In athlete studies, this standard deviation helps convey magnitude of differences or changes in mean performance.