SlideShare a Scribd company logo
1 of 43
Non-parametric statistics
Anchal, BalRam, Kush
Environment Management 2016
USEM
Learning objectives
Compare and contrast parametric and nonparametric tests
Perform and interpret the Mann Whitney U Test
Perform and interpret the Sign test and Wilcoxon Signed Rank
Test
Compare and contrast the Sign test and Wilcoxon Signed Rank
Test
Perform and interpret the Kruskal Wallis test
INFERENTIAL STATISTICS
PARAMETRIC
STATISTICS
NON-
PARAMETRIC
STATISTICS
Parametric Non-parametric
Assumed distribution normal any
Typical data Ratio or interval Nominal or ordinal
Usual central measures mean Median
Benefits Can draw many
conclusions
Simplicity less affected by
outliers
Tests
Independent measures, 2
groups
Independent measure t
test
Mann- whitney test
Independent measures, >2
groups
One way independent
measures ANOVA
Kruskal wallis test
Repeated measures, 2
conditions
Matched pair t-test Wilcoxon test
Table: Distinguish between parametric and non parametric statistics
Nonparametric tests
• Also known as distribution-free tests because they are based on
fewer assumptions (e.g., they do not assume that the outcome is
approximately normally distributed).
• Parametric tests involve specific probability distributions (e.g., the
normal distribution) and the tests involve estimation of the key
parameters of that distribution (e.g., the mean or difference in
means) from the sample data.
• There are some situations when it is clear that the outcome does
not follow a normal distribution. These include:
when the outcome is an ordinal variable or a rank,
when there are definite outliers or
when the outcome has clear limits of detection.
Non-parametric Methods
• Sign Test
• Wilcoxon Signed-Rank Test
• Mann-Whitney-Wilcoxon Test
• Kruskal-Wallis Test
How to assign the ranks
Ordered observed data 0 2 3 5 7 9
ranks 1 2 3 4 5 6
Ordered observed data 0 2 3 7 7 9
ranks 1 2 3 4.5 4.5 6
Ordered observed data 0 2 3 7 7 7
ranks 1 2 3 5 5 5
1. Sign Test
• A common application of the sign test involves using a sample
of n potential customers to identify a preference for one of
two brands of a product.
• The objective is to determine whether there is a difference in
preference between the two items being compared.
• To record the preference data, we use a plus sign if the
individual prefers one brand and a minus sign if the individual
prefers the other brand.
• Because the data are recorded as plus and minus signs, this
test is called the sign test.
Example: Butter Taste Test
• Sign Test: Large-Sample Case
As part of a market research study, a sample of 36
consumers were asked to taste two brands of butter and
indicate a preference. Do the data shown below indicate a
significant difference in the consumer preferences for the
two brands?
18 preferred Amul Butter (+ sign recorded)
12 preferred Mother Dairy Butter (_ sign recorded)
6 had no preference
The analysis is based on a sample size of 18 + 12 = 30.
Example: Butter Taste Test
H0: No preference for one brand over the other exists
Ha: A preference for one brand over the other exists
• Sampling Distribution
2.74
Sampling distribution
of the number of “+”
values if there is no
brand preference
 =30/2 =15
Example: Butter Taste Test
• Rejection Rule
Using 0.05 level of significance,
Reject H0 if z < -1.96 or z > 1.96
• Test Statistic
z = (18 - 15)/2.74 = 3/2.74 = 1.095
• Conclusion
Do not reject H0. There is insufficient evidence in the
sample to conclude that a difference in preference exists for
the two brands of butter.
2. Wilcoxon Signed-Rank Test
• The Wilcoxon test is used when we are unwilling to make assumptions
about the form of the underlying population probability distributions,
• but we want compare paired samples.
• Analogous to the dependent t-test we are interested in the difference
• in two measurements taken from each person.
• The rank sum of the positive (T+) and negative (T−) differences are
calculated, the smallest of these is used as the test statistic to test the
hypothesis.
• Two assumptions underlie the use of this technique.
1. The paired data are selected randomly.
2. The underlying distributions are symmetrical.
Small-Sample Case (n ≤15)
• When sample size is small, a critical value against which to
compare T can be found by table, to determine whether the null
hypothesis should be rejected. The critical value is located by using
n and α.
• If the observed value of T is less than or equal to the critical value
of T, the decision is to reject the null hypothesis.
Problem: The survey by American Demographics estimated the
average annual household spending on healthcare. The U.S.
metropolitan average was $1,800. Suppose six families in
Pittsburgh, Pennsylvania, are matched demographically with six
families in Oakland, California, and their amounts of household
spending on healthcare for last year are obtained. The data follow
on the next page.
A healthcare analyst uses α = 0.05 to test to determine whether
there is a significant difference in annual household healthcare
spending between these two cities.
STEP 1. The following hypotheses are being tested.
H0: Md= 0
Ha: Md≠ 0
STEP 2. Because the sample size of pairs is six, the small-
sample Wilcoxon matched pairs signed ranks test is
appropriate if the underlying distributions are assumed to
be symmetrical.
Step 3. calculated T = 3 is greater than critical T = 1 (at α=0.05,
the decision is to accept the null hypothesis.
Conclusion: there is no significant difference in annual
household healthcare spending between Pittsburgh and
Oakland.
Large-Sample Case (n >15)
For large samples, the T statistic is approximately normally
distributed and a z score can be used as the test statistic.
Problem: Suppose a company implemented a quality-
control program and has been operating under it for 2
years. The company’s president wants to determine
whether worker productivity significantly increased since
installation of the program. Use a non parametric statistics
for the following data at α=0.01.
STEP 1. The following hypotheses are being tested.
H0: Md= 0
Ha: Md≠ 0
STEP 2. Wilcoxon matched-pairs signed rank test to be
applied on the data to test the difference in productivity from
before to after.
STEP 3. Computes the difference values and because zero
differences are to be eliminated, deletes worker 3 from the
study. This reduces n from 20 to 19, then ranks the
differences regardless of sign.
STEP 4. This test is one tailed. The critical value is z =-2.33. Here calculate
z is less than critical z, hence we reject the null hypothesis.
Conclusion: The productivity is significantly greater after the
implementation of quality control at this company.
3. Mann-Whitney-Wilcoxon Test
(U test)
• Also known as Wilcoxon rank sum test.
• It is a nonparametric counterpart of the t test used to compare the
means of two independent populations.
• The following assumptions underlie the use of the Mann-Whitney
U test.
1. The samples are independent.
2. The level of data is at least ordinal.
• The two-tailed hypotheses being tested with the Mann-Whitney U
test are as follows.
H0: The two populations are identical.
Ha: The two populations are not identical.
Mann-Whitney-Wilcoxon Test
(U test)
• Computation of the U test begins by arbitrarily designating two
samples as group 1 and group 2. The data from the two groups are
combined into one group, with each data value retaining a group
identifier of its original group. The pooled values are then ranked
from 1 to n.
• W 1 and W2 are the sum of the ranks of values from group 1 and
group 2 respectively.
• Small sample case: when both n1 and n2 ≤10
• Large sample case: when both n1 and n2 >10
Small sample case
• The test statistic is the smallest of these two U values.
• Determine the p-value for a U from the table.
• Note: For a two-tailed test, double the p-value shown in the table.
• Because U2 is the smaller value of U, we use U = 3
as the test statistic. Because it is the smallest size,
let n1= 7; n2= 8.
• STEP 7. from table find a p-value of 0.0011. Because
this test is two tailed, we double the table p-value,
producing a final p-value of .0022. Because the p-
value is less than α = 0.05, the null hypothesis is
rejected.
• The statistical conclusion is that the populations are
not identical.
Large-Sample Case
• For large sample sizes, the value of U is approximately normally
distributed and hence compute a z score for the U value. A
decision is then made whether to reject the null hypothesis. A z
score can be calculated from U by the following formulas.
4. Kruskal Wallis Test
• Like the one-way analysis of variance, the Kruskal-Wallis test is
used to determine whether c ≥3 samples come from the same or
different populations.
• The Kruskal-Wallis test is based on the assumption that the c
groups are independent and that individual items are selected
randomly. The hypotheses tested by the Kruskal-Wallis test follow.
H0 :The c populations are identical.
Ha: At least one of the c populations is different.
Kruskal Wallis Test
o This test determines whether all of the groups come from the
same or equal populations or whether at least one group comes
from a different population.
o The process of computing a Kruskal-Wallis K statistic begins with
ranking the data in all the groups together, as though they were
from one group.
Start
Are the samples
independent? Use Mann
whitney U test
Are the data
atleast interval ?
Use sign test
Use Wilcoxon
signed rank test
yes
yes
no
no
Deciding which test to use
Advantages of
Nonparametric Tests
• Used with all scales
• Easier to compute
— Developed originally before wide
computer use
• Make fewer assumptions
• Need not involve population
parameters
• Results may be as exact as
parametric procedures
.
Disadvantages of
Nonparametric Tests
• May waste information
— If data permit using parametric
procedures
— Example: converting data from
ratio to ordinal scale
• Difficult to compute by hand for
large samples
• Tables not widely available
.
• Jarkko Isotalo, Basics of Statistics (Available online at:
http://www.mv.helsinki.fi/home/jmisotal/BoS.pdf)
• Ken Black, 6th edition, Business Statistics For Contemporary Decision
Making
• Lisa Sullivan, Non parametric statistics, Boston University School of Public
Health (available online at:
http://sphweb.bumc.bu.edu/otlt/MPHModules/BS/BS704_Nonparametri
c/BS704_Nonparametric_print.html)
• Arora, P.N and Malhan P.K; Biostatistics, 2009 Edition
• http://blog.minitab.com/blog/adventures-in-statistics/choosing-
between-a-nonparametric-test-and-a-parametric-test
non parametric statistics

More Related Content

What's hot

What's hot (20)

The mann whitney u test
The mann whitney u testThe mann whitney u test
The mann whitney u test
 
wilcoxon signed rank test
wilcoxon signed rank testwilcoxon signed rank test
wilcoxon signed rank test
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Parametric and non parametric test
Parametric and non parametric testParametric and non parametric test
Parametric and non parametric test
 
Null hypothesis
Null hypothesisNull hypothesis
Null hypothesis
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 
Type I and Type II Errors in Research Methodology
Type I and Type II Errors in Research MethodologyType I and Type II Errors in Research Methodology
Type I and Type II Errors in Research Methodology
 
The Kruskal-Wallis H Test
The Kruskal-Wallis H TestThe Kruskal-Wallis H Test
The Kruskal-Wallis H Test
 
Statistical tests of significance and Student`s T-Test
Statistical tests of significance and Student`s T-TestStatistical tests of significance and Student`s T-Test
Statistical tests of significance and Student`s T-Test
 
NULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptxNULL AND ALTERNATIVE HYPOTHESIS.pptx
NULL AND ALTERNATIVE HYPOTHESIS.pptx
 
Mann Whitney U test
Mann Whitney U testMann Whitney U test
Mann Whitney U test
 
Parametric Statistical tests
Parametric Statistical testsParametric Statistical tests
Parametric Statistical tests
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 
3.1 non parametric test
3.1 non parametric test3.1 non parametric test
3.1 non parametric test
 
Sample Size Determination
Sample Size DeterminationSample Size Determination
Sample Size Determination
 
Sample and sample size
Sample and sample sizeSample and sample size
Sample and sample size
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
T test statistics
T test statisticsT test statistics
T test statistics
 
Tests of significance
Tests of significanceTests of significance
Tests of significance
 
Parametric and non parametric test in biostatistics
Parametric and non parametric test in biostatistics Parametric and non parametric test in biostatistics
Parametric and non parametric test in biostatistics
 

Viewers also liked (11)

Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric tests
 
Nonparametric statistics
Nonparametric statisticsNonparametric statistics
Nonparametric statistics
 
Intelligent agent
Intelligent agentIntelligent agent
Intelligent agent
 
AI: AI & Problem Solving
AI: AI & Problem SolvingAI: AI & Problem Solving
AI: AI & Problem Solving
 
AI: Planning and AI
AI: Planning and AIAI: Planning and AI
AI: Planning and AI
 
2-Agents- Artificial Intelligence
2-Agents- Artificial Intelligence2-Agents- Artificial Intelligence
2-Agents- Artificial Intelligence
 
AI: Learning in AI
AI: Learning in AI AI: Learning in AI
AI: Learning in AI
 
Planning
PlanningPlanning
Planning
 
Ai Slides
Ai SlidesAi Slides
Ai Slides
 
Artificial Intelligence
Artificial IntelligenceArtificial Intelligence
Artificial Intelligence
 
Artificial Intelligence Presentation
Artificial Intelligence PresentationArtificial Intelligence Presentation
Artificial Intelligence Presentation
 

Similar to non parametric statistics

Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminar
drdeepika87
 
3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf
MitikuTeka1
 

Similar to non parametric statistics (20)

non parametric test.pptx
non parametric test.pptxnon parametric test.pptx
non parametric test.pptx
 
Non parametric-tests
Non parametric-testsNon parametric-tests
Non parametric-tests
 
Hypothesis Testing.pptx
Hypothesis Testing.pptxHypothesis Testing.pptx
Hypothesis Testing.pptx
 
UNIT 5.pptx
UNIT 5.pptxUNIT 5.pptx
UNIT 5.pptx
 
Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminar
 
tests of significance
tests of significancetests of significance
tests of significance
 
Non parametric study; Statistical approach for med student
Non parametric study; Statistical approach for med student Non parametric study; Statistical approach for med student
Non parametric study; Statistical approach for med student
 
T test^jsample size^j ethics
T test^jsample size^j ethicsT test^jsample size^j ethics
T test^jsample size^j ethics
 
3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf3Nonparametric Tests power point presentationpdf
3Nonparametric Tests power point presentationpdf
 
biostat__final_ppt_unit_3.pptx
biostat__final_ppt_unit_3.pptxbiostat__final_ppt_unit_3.pptx
biostat__final_ppt_unit_3.pptx
 
Marketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptxMarketing Research Hypothesis Testing.pptx
Marketing Research Hypothesis Testing.pptx
 
Friedman Test- A Presentation
Friedman Test- A PresentationFriedman Test- A Presentation
Friedman Test- A Presentation
 
T test
T test T test
T test
 
Non parametric
Non parametricNon parametric
Non parametric
 
Non parametric test
Non parametric testNon parametric test
Non parametric test
 
Parametric test
Parametric testParametric test
Parametric test
 
Amrita kumari
Amrita kumariAmrita kumari
Amrita kumari
 
Statr session 21 and 22
Statr session 21 and 22Statr session 21 and 22
Statr session 21 and 22
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
 
Biostatistics ii
Biostatistics iiBiostatistics ii
Biostatistics ii
 

More from Anchal Garg (8)

Episodic Levels of PM10 , PM2.5 and PM1 during Diwali : A study in urban area...
Episodic Levels of PM10 , PM2.5 and PM1 during Diwali : A study in urban area...Episodic Levels of PM10 , PM2.5 and PM1 during Diwali : A study in urban area...
Episodic Levels of PM10 , PM2.5 and PM1 during Diwali : A study in urban area...
 
Water quality deterioration factors
Water quality deterioration factorsWater quality deterioration factors
Water quality deterioration factors
 
SCOPE OF HIGHER EDUCATION AND RESEARCH IN URBAN FORESTRY, LANDSCAPE& URBAN B...
SCOPE	OF	HIGHER	EDUCATION	AND	 RESEARCH	IN	URBAN	FORESTRY, LANDSCAPE&	URBAN	B...SCOPE	OF	HIGHER	EDUCATION	AND	 RESEARCH	IN	URBAN	FORESTRY, LANDSCAPE&	URBAN	B...
SCOPE OF HIGHER EDUCATION AND RESEARCH IN URBAN FORESTRY, LANDSCAPE& URBAN B...
 
E waste and how to manage it
E waste and how to manage itE waste and how to manage it
E waste and how to manage it
 
BIODEGRADATION OF ORGANIC POLLUTANTS
BIODEGRADATION OF ORGANIC POLLUTANTSBIODEGRADATION OF ORGANIC POLLUTANTS
BIODEGRADATION OF ORGANIC POLLUTANTS
 
Energy flow by using energy models in ecosystem
Energy flow by using energy models  in ecosystemEnergy flow by using energy models  in ecosystem
Energy flow by using energy models in ecosystem
 
EIA for development projects
EIA for development projectsEIA for development projects
EIA for development projects
 
bioremediation of oil spills
bioremediation of oil spillsbioremediation of oil spills
bioremediation of oil spills
 

Recently uploaded

case-study-marcopper-disaster in the philippines.pdf
case-study-marcopper-disaster in the philippines.pdfcase-study-marcopper-disaster in the philippines.pdf
case-study-marcopper-disaster in the philippines.pdf
garthraymundo123
 

Recently uploaded (20)

Book Call Girls in Kathua { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
Book Call Girls in Kathua { 9332606886 } VVIP NISHA Call Girls Near 5 Star HotelBook Call Girls in Kathua { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
Book Call Girls in Kathua { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
 
Water Pollution
Water Pollution Water Pollution
Water Pollution
 
Mira Road Reasonable Call Girls ,09167354423,Kashimira Call Girls Service
Mira Road Reasonable Call Girls ,09167354423,Kashimira Call Girls ServiceMira Road Reasonable Call Girls ,09167354423,Kashimira Call Girls Service
Mira Road Reasonable Call Girls ,09167354423,Kashimira Call Girls Service
 
Russian Call girl Dubai 0503464457 Dubai Call girls
Russian Call girl Dubai 0503464457 Dubai Call girlsRussian Call girl Dubai 0503464457 Dubai Call girls
Russian Call girl Dubai 0503464457 Dubai Call girls
 
Climate Change
Climate ChangeClimate Change
Climate Change
 
High Profile Call Girls Service in Udhampur 9332606886 High Profile Call G...
High Profile Call Girls Service in Udhampur   9332606886  High Profile Call G...High Profile Call Girls Service in Udhampur   9332606886  High Profile Call G...
High Profile Call Girls Service in Udhampur 9332606886 High Profile Call G...
 
Low Rate Call Girls Boudh 9332606886 HOT & SEXY Models beautiful and charmin...
Low Rate Call Girls Boudh  9332606886 HOT & SEXY Models beautiful and charmin...Low Rate Call Girls Boudh  9332606886 HOT & SEXY Models beautiful and charmin...
Low Rate Call Girls Boudh 9332606886 HOT & SEXY Models beautiful and charmin...
 
Call Girl in Faridabad ₹7.5k Pick Up & Drop With Cash Payment #8168257667
Call Girl in Faridabad ₹7.5k Pick Up & Drop With Cash Payment #8168257667Call Girl in Faridabad ₹7.5k Pick Up & Drop With Cash Payment #8168257667
Call Girl in Faridabad ₹7.5k Pick Up & Drop With Cash Payment #8168257667
 
Test bank for beckmann and ling s obstetrics and gynecology 8th edition by ro...
Test bank for beckmann and ling s obstetrics and gynecology 8th edition by ro...Test bank for beckmann and ling s obstetrics and gynecology 8th edition by ro...
Test bank for beckmann and ling s obstetrics and gynecology 8th edition by ro...
 
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING  SYSTEMS- IGBC, GRIHA, LEED--.pptxRATING  SYSTEMS- IGBC, GRIHA, LEED--.pptx
RATING SYSTEMS- IGBC, GRIHA, LEED--.pptx
 
Fuel Cells and Hydrogen in Transportation - An Introduction
Fuel Cells and Hydrogen in Transportation - An IntroductionFuel Cells and Hydrogen in Transportation - An Introduction
Fuel Cells and Hydrogen in Transportation - An Introduction
 
Presentation: Farmer-led climate adaptation - Project launch and overview by ...
Presentation: Farmer-led climate adaptation - Project launch and overview by ...Presentation: Farmer-led climate adaptation - Project launch and overview by ...
Presentation: Farmer-led climate adaptation - Project launch and overview by ...
 
Call Girls in Tiruppur 9332606886 ust Genuine Escort Model Sevice
Call Girls in Tiruppur  9332606886  ust Genuine Escort Model SeviceCall Girls in Tiruppur  9332606886  ust Genuine Escort Model Sevice
Call Girls in Tiruppur 9332606886 ust Genuine Escort Model Sevice
 
Call Girls Brigade Road ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Brigade Road ( 8250092165 ) Cheap rates call girls | Get low budgetCall Girls Brigade Road ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Brigade Road ( 8250092165 ) Cheap rates call girls | Get low budget
 
A Review on Integrated River Basin Management and Development Master Plan of ...
A Review on Integrated River Basin Management and Development Master Plan of ...A Review on Integrated River Basin Management and Development Master Plan of ...
A Review on Integrated River Basin Management and Development Master Plan of ...
 
Top Call Girls in Dholpur { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
Top Call Girls in Dholpur { 9332606886 } VVIP NISHA Call Girls Near 5 Star HotelTop Call Girls in Dholpur { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
Top Call Girls in Dholpur { 9332606886 } VVIP NISHA Call Girls Near 5 Star Hotel
 
Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.Cyclone Case Study Odisha 1999 Super Cyclone in India.
Cyclone Case Study Odisha 1999 Super Cyclone in India.
 
Top Call Girls in Bishnupur 9332606886 High Profile Call Girls You Can Get...
Top Call Girls in Bishnupur   9332606886  High Profile Call Girls You Can Get...Top Call Girls in Bishnupur   9332606886  High Profile Call Girls You Can Get...
Top Call Girls in Bishnupur 9332606886 High Profile Call Girls You Can Get...
 
Environmental Topic : Soil Pollution by Afzalul Hoda.pptx
Environmental Topic : Soil Pollution by Afzalul Hoda.pptxEnvironmental Topic : Soil Pollution by Afzalul Hoda.pptx
Environmental Topic : Soil Pollution by Afzalul Hoda.pptx
 
case-study-marcopper-disaster in the philippines.pdf
case-study-marcopper-disaster in the philippines.pdfcase-study-marcopper-disaster in the philippines.pdf
case-study-marcopper-disaster in the philippines.pdf
 

non parametric statistics

  • 1. Non-parametric statistics Anchal, BalRam, Kush Environment Management 2016 USEM
  • 2. Learning objectives Compare and contrast parametric and nonparametric tests Perform and interpret the Mann Whitney U Test Perform and interpret the Sign test and Wilcoxon Signed Rank Test Compare and contrast the Sign test and Wilcoxon Signed Rank Test Perform and interpret the Kruskal Wallis test
  • 4. Parametric Non-parametric Assumed distribution normal any Typical data Ratio or interval Nominal or ordinal Usual central measures mean Median Benefits Can draw many conclusions Simplicity less affected by outliers Tests Independent measures, 2 groups Independent measure t test Mann- whitney test Independent measures, >2 groups One way independent measures ANOVA Kruskal wallis test Repeated measures, 2 conditions Matched pair t-test Wilcoxon test Table: Distinguish between parametric and non parametric statistics
  • 5. Nonparametric tests • Also known as distribution-free tests because they are based on fewer assumptions (e.g., they do not assume that the outcome is approximately normally distributed). • Parametric tests involve specific probability distributions (e.g., the normal distribution) and the tests involve estimation of the key parameters of that distribution (e.g., the mean or difference in means) from the sample data. • There are some situations when it is clear that the outcome does not follow a normal distribution. These include: when the outcome is an ordinal variable or a rank, when there are definite outliers or when the outcome has clear limits of detection.
  • 6. Non-parametric Methods • Sign Test • Wilcoxon Signed-Rank Test • Mann-Whitney-Wilcoxon Test • Kruskal-Wallis Test
  • 7. How to assign the ranks Ordered observed data 0 2 3 5 7 9 ranks 1 2 3 4 5 6 Ordered observed data 0 2 3 7 7 9 ranks 1 2 3 4.5 4.5 6 Ordered observed data 0 2 3 7 7 7 ranks 1 2 3 5 5 5
  • 8. 1. Sign Test • A common application of the sign test involves using a sample of n potential customers to identify a preference for one of two brands of a product. • The objective is to determine whether there is a difference in preference between the two items being compared. • To record the preference data, we use a plus sign if the individual prefers one brand and a minus sign if the individual prefers the other brand. • Because the data are recorded as plus and minus signs, this test is called the sign test.
  • 9. Example: Butter Taste Test • Sign Test: Large-Sample Case As part of a market research study, a sample of 36 consumers were asked to taste two brands of butter and indicate a preference. Do the data shown below indicate a significant difference in the consumer preferences for the two brands? 18 preferred Amul Butter (+ sign recorded) 12 preferred Mother Dairy Butter (_ sign recorded) 6 had no preference The analysis is based on a sample size of 18 + 12 = 30.
  • 10. Example: Butter Taste Test H0: No preference for one brand over the other exists Ha: A preference for one brand over the other exists • Sampling Distribution 2.74 Sampling distribution of the number of “+” values if there is no brand preference  =30/2 =15
  • 11. Example: Butter Taste Test • Rejection Rule Using 0.05 level of significance, Reject H0 if z < -1.96 or z > 1.96 • Test Statistic z = (18 - 15)/2.74 = 3/2.74 = 1.095 • Conclusion Do not reject H0. There is insufficient evidence in the sample to conclude that a difference in preference exists for the two brands of butter.
  • 12. 2. Wilcoxon Signed-Rank Test • The Wilcoxon test is used when we are unwilling to make assumptions about the form of the underlying population probability distributions, • but we want compare paired samples. • Analogous to the dependent t-test we are interested in the difference • in two measurements taken from each person. • The rank sum of the positive (T+) and negative (T−) differences are calculated, the smallest of these is used as the test statistic to test the hypothesis. • Two assumptions underlie the use of this technique. 1. The paired data are selected randomly. 2. The underlying distributions are symmetrical.
  • 13. Small-Sample Case (n ≤15) • When sample size is small, a critical value against which to compare T can be found by table, to determine whether the null hypothesis should be rejected. The critical value is located by using n and α. • If the observed value of T is less than or equal to the critical value of T, the decision is to reject the null hypothesis.
  • 14. Problem: The survey by American Demographics estimated the average annual household spending on healthcare. The U.S. metropolitan average was $1,800. Suppose six families in Pittsburgh, Pennsylvania, are matched demographically with six families in Oakland, California, and their amounts of household spending on healthcare for last year are obtained. The data follow on the next page. A healthcare analyst uses α = 0.05 to test to determine whether there is a significant difference in annual household healthcare spending between these two cities.
  • 15. STEP 1. The following hypotheses are being tested. H0: Md= 0 Ha: Md≠ 0 STEP 2. Because the sample size of pairs is six, the small- sample Wilcoxon matched pairs signed ranks test is appropriate if the underlying distributions are assumed to be symmetrical.
  • 16. Step 3. calculated T = 3 is greater than critical T = 1 (at α=0.05, the decision is to accept the null hypothesis. Conclusion: there is no significant difference in annual household healthcare spending between Pittsburgh and Oakland.
  • 17. Large-Sample Case (n >15) For large samples, the T statistic is approximately normally distributed and a z score can be used as the test statistic.
  • 18. Problem: Suppose a company implemented a quality- control program and has been operating under it for 2 years. The company’s president wants to determine whether worker productivity significantly increased since installation of the program. Use a non parametric statistics for the following data at α=0.01.
  • 19. STEP 1. The following hypotheses are being tested. H0: Md= 0 Ha: Md≠ 0 STEP 2. Wilcoxon matched-pairs signed rank test to be applied on the data to test the difference in productivity from before to after. STEP 3. Computes the difference values and because zero differences are to be eliminated, deletes worker 3 from the study. This reduces n from 20 to 19, then ranks the differences regardless of sign.
  • 20.
  • 21. STEP 4. This test is one tailed. The critical value is z =-2.33. Here calculate z is less than critical z, hence we reject the null hypothesis. Conclusion: The productivity is significantly greater after the implementation of quality control at this company.
  • 22. 3. Mann-Whitney-Wilcoxon Test (U test) • Also known as Wilcoxon rank sum test. • It is a nonparametric counterpart of the t test used to compare the means of two independent populations. • The following assumptions underlie the use of the Mann-Whitney U test. 1. The samples are independent. 2. The level of data is at least ordinal. • The two-tailed hypotheses being tested with the Mann-Whitney U test are as follows. H0: The two populations are identical. Ha: The two populations are not identical.
  • 23. Mann-Whitney-Wilcoxon Test (U test) • Computation of the U test begins by arbitrarily designating two samples as group 1 and group 2. The data from the two groups are combined into one group, with each data value retaining a group identifier of its original group. The pooled values are then ranked from 1 to n. • W 1 and W2 are the sum of the ranks of values from group 1 and group 2 respectively. • Small sample case: when both n1 and n2 ≤10 • Large sample case: when both n1 and n2 >10
  • 24. Small sample case • The test statistic is the smallest of these two U values. • Determine the p-value for a U from the table. • Note: For a two-tailed test, double the p-value shown in the table.
  • 25.
  • 26.
  • 27.
  • 28. • Because U2 is the smaller value of U, we use U = 3 as the test statistic. Because it is the smallest size, let n1= 7; n2= 8. • STEP 7. from table find a p-value of 0.0011. Because this test is two tailed, we double the table p-value, producing a final p-value of .0022. Because the p- value is less than α = 0.05, the null hypothesis is rejected. • The statistical conclusion is that the populations are not identical.
  • 29. Large-Sample Case • For large sample sizes, the value of U is approximately normally distributed and hence compute a z score for the U value. A decision is then made whether to reject the null hypothesis. A z score can be calculated from U by the following formulas.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34. 4. Kruskal Wallis Test • Like the one-way analysis of variance, the Kruskal-Wallis test is used to determine whether c ≥3 samples come from the same or different populations. • The Kruskal-Wallis test is based on the assumption that the c groups are independent and that individual items are selected randomly. The hypotheses tested by the Kruskal-Wallis test follow. H0 :The c populations are identical. Ha: At least one of the c populations is different.
  • 35. Kruskal Wallis Test o This test determines whether all of the groups come from the same or equal populations or whether at least one group comes from a different population. o The process of computing a Kruskal-Wallis K statistic begins with ranking the data in all the groups together, as though they were from one group.
  • 36.
  • 37.
  • 38.
  • 39. Start Are the samples independent? Use Mann whitney U test Are the data atleast interval ? Use sign test Use Wilcoxon signed rank test yes yes no no Deciding which test to use
  • 40. Advantages of Nonparametric Tests • Used with all scales • Easier to compute — Developed originally before wide computer use • Make fewer assumptions • Need not involve population parameters • Results may be as exact as parametric procedures .
  • 41. Disadvantages of Nonparametric Tests • May waste information — If data permit using parametric procedures — Example: converting data from ratio to ordinal scale • Difficult to compute by hand for large samples • Tables not widely available .
  • 42. • Jarkko Isotalo, Basics of Statistics (Available online at: http://www.mv.helsinki.fi/home/jmisotal/BoS.pdf) • Ken Black, 6th edition, Business Statistics For Contemporary Decision Making • Lisa Sullivan, Non parametric statistics, Boston University School of Public Health (available online at: http://sphweb.bumc.bu.edu/otlt/MPHModules/BS/BS704_Nonparametri c/BS704_Nonparametric_print.html) • Arora, P.N and Malhan P.K; Biostatistics, 2009 Edition • http://blog.minitab.com/blog/adventures-in-statistics/choosing- between-a-nonparametric-test-and-a-parametric-test