SlideShare a Scribd company logo
1 of 41
1
Research Methods in Health
Chapter 9. Estimation
Young Moon Chae, Ph.D.
Graduate School of Public Health
Yonsei University, Korea
ymchae@yuhs.ac
2
Correlation
3
Questions
• Why does the maximum
value of r equal 1.0?
• What does it mean when a
correlation is positive?
Negative?
• What is the purpose of the
Fisher r to z transformation?
• What is range restriction?
Range enhancement? What
do they do to r?
• Give an example in which
data properly analyzed by
ANOVA cannot be used to
infer causality.
• Why do we care about the
sampling distribution of the
correlation coefficient?
• What is the effect of reliability
on r?
4
Basic Ideas
• Nominal vs. continuous IV
• Degree (direction) & closeness (magnitude) of linear relations
-Sign (+ or -) for direction
-Absolute value for magnitude
• Pearson product-moment correlation coefficient
N
zz
r
YXå=
5
Illustrations
757269666360
Height
210
180
150
120
90
Weight
Plot of Weight by Height
4003002001000
Study Time
30
20
10
0
Errors
Plot of Errors by Study Time
1.91.81.71.61.5
Toe Size
700
600
500
400
SAT-V
Plot of SAT-V by Toe Size
Positive, negative, zero
6
Graphic Representation
757269666360
Height
210
180
150
120
90
Weight
Plot of Weight by Height
757269666360
Height
Plot of Weight by HeightPlot of Weight by Height
Mean = 66.8 Inches
Mean = 150.7 lbs.
210-1-2
Z-height
2
1
0
-1
-2
Z-weight
Plot of Weight by Height in Z-scores
2
1
0
-1
-2
Z-weight
Plot of Weight by Height in Z-scoresPlot of Weight by Height in Z-scores
+
-
-
+
1. Conversion from raw to z.
2. Points & quadrants. Positive & negative products.
3. Correlation is average of cross products. Sign & magnitude of r
depend on where the points fall.
4. Product at maximum (average =1) when points on line where zX=zY.
7
Correlation Analysis
• It measures the closeness of the relationship between two or more variables
• The degree of association or covariation between variables, no causality
• Measures of Association by Measurement
• Interpretation of Correlation
-T-test
8
Regression
9
Questions
• What are predictors and criteria?
• Write an equation for the linear
regression. Describe each term.
• How do changes in the slope and
intercept affect (move) the
regression line?
• What does it mean to test the
significance of the regression sum
of squares? R-square?
• What is R-square?
• What does it mean to choose a
regression line to satisfy the loss
function of least squares?
• How do we find the slope and
intercept for the regression line with
a single independent variable?
(Either formula for the slope is
acceptable.)
• Why does testing for the regression
sum of squares turn out to have the
same result as testing for R-
square?
10
Basic Ideas
• Jargon
-IV = X = Predictor (pl. predictors)
-DV = Y = Criterion (pl. criteria)
-Regression of Y on X e.g., GPA on SAT
• Linear Model = relations between IV and DV represented by straight line.
• A score on Y has 2 parts – (1) linear function of X and (2) error.
Y Xi i i= + +a b e (population values)
11
Regression Analysis
• It refers to the techniques used to derive an equation that relates the
criterion variable to one or more predictor variables
• Method of least squares
• Standardized coefficients
• Goodness of fit
-F test, t test, Coefficient of Determination
• multicollinearity
12
Multiple linear regression
13
ANOVA as linear regression
14
Results
15
Raw & Standardized Regression Weights
• Each X has a raw score slope, b.
• Slope tells expected change in Y if
X changes 1 unit*.
• Large b weights should indicate
important variables, but b depends
on variance of X.
• A b for height in inches would be
12 times larger than b for height in
feet.
• If we standardize X and Y, all units
of X are the same.
• Relative size of b now meaningful.
*strictly speaking, holding other X variables constant.
16
Tests of R2 vs Tests of b
• Slopes (b) tell about the relation between Y and the unique part of X. R2
tells about proportion of variance in Y accounted for by set of predictors all
together.
• Correlations among X variables increase the standard errors of b weights
but not R2
.
• Possible to get significant R2
, but no or few significant b weights
• Possible but unlikely to have significant b but not significant R2
. Look to R2
first. If it is n.s., avoid interpreting b weights.
17
Testing Incremental R2
You can start regression with a set of one or more variables and then
add predictors 1 or more at a time. When you add predictors, R
2
will
never go down. It usually goes up, and you can test whether the
increment in R
2
is significant or else if likely due to chance.
)1/()1(
)/()(
2
22
---
--
=
LL
SLSL
kNR
kkRR
F
2
LR
2
SR
Sk
Lk
=R-square for the larger model
=R-square for the smaller model
= number of predictors in the larger model
=number of predictors in the smaller model
18
(cont.)
• In regression problems, the most commonly used indices of
importance are the correlation, r, and the increment to R-square
when the variable of interest is considered last. The second is
sometimes called a last-in R-square change. The last-in increment
corresponds to the Type III sums of squares and is closely related to
the b weight.
• The correlation tells about the importance of the variable ignoring
all other predictors.
• The last-in increment tells about the importance of the variable as
a unique contributor to the prediction of Y, above and beyond all
other predictors in the model.
•“Importance” is not well defined statistically when IVs are correlated.
Doesn’t include mediated models (path analysis).
19
Collinearity Defined
• The problem of large correlations among the independent variables
• Within the set of IVs, one or more IVs are (nearly) totally predicted by the
other IVs.
• In such a case, the b or beta weights are poorly estimated.
• Problem of the “Bouncing Betas.”
20
Dealing with Collinearity
• Lump it. Admit ambiguity; SE of b weights. Refer also to correlations.
• Select or combine variables.
• Factor analyze set of IVs.
• Use another type of analysis (e.g., path analysis).
• Use another type of regression (ridge regression).
• Unit weights (no longer regression).
21
Diagnostics
Checking Assumptions and Bad Data
22
Good-Looking Graph
6420-2
X
9
6
3
0
-3
Y
No apparent departures from line.
23
Problem with Linearity
50 100 150 200 250
Horsepower
10
20
30
40
50
MilesperGallon
R Sq Linear = 0.595
24
Outliers
65320-2
X
10
8
6
3
1
-1
Y
Outlier
Outlier = pathological point
25
Non-parametric or Distribution-free Tests
26
Non-parametric or Distribution-free Tests
• Two kinds of assertions in statistical tests: 1. Assertion directly related to the
purpose of investigation, i.e., hypothesis to be tested 2. Assertion to make a
probability statement. Set of all assertions is called the model
• Testing a hypothesis without a model is non-parametric test. That is, tests
which do not make basic assumptions about and without having the
knowledge of the distribution of the population parameters
27
Characteristics
1.Do not depend on any assumptions about properties / parameters of the
parent population, I.e., do not suppose any particular distribution &
consequential assumptions (Parametric tests like ‘t’& ‘F’ tests make
assumption about homogeneity of the variances) & No such assumptions or
less restricting assumptions
2.When measurements are not so accurate, non-parametric tests come very
handy
3.Most non-parametric tests assume only nominal or ordinal data I.e., more
suitable (than parametric tests) for nominal & ordinal (or rated data)
4.Involves few arithmetic computations
28
(cont.)
5.Usually less efficient & powerful than parametric tests as they are based on
no assumption
6.Greater risk of accepting a false hypothesis and committing type II error;
Non-parametric tests require more observations than parametric tests to
achieve the same size of type I and type II errors
7.Null hypothesis is somewhat loosely defined & hence rejection of null
hypothesis may lead to less precise conclusion than parametric tests
8.It is a trade off between loss in sharpness of estimating intervals and gain in
the ability of using less information & to calculate faster
29
Some important applications are
(I)concerning single value for the given data
(II)difference among 2 or more sets of data
(III)relations between variables
(IV)variation in the given data
(V)randomness of a sample
(VI)association or dependency of categorical data
(VII)comparing theoretical population with actual data in categories
30
Typical situation
1.Data not likely to be normally distributed
2.Nominal data from responses to questionnaire
3.Partially filled questions, i.e., to handle incomplete / missing data. to make
necessary adjustments to extract maximum information from average data
4.Reasonably good results from even very small sample but need more
observations than parametric tests to achieve the same size of type I and
type II errors
31
31
32
Mc Nemer Test
•Useful for testing nominal data of two related samples and before –after
measurements of the same subjects with a view to judge the significance
for any observed change after treatment
33
Chi-Square Test
• An important non-parametric test for significance of association as well as
for testing hypothesis regarding (i) goodness of fit and (ii) homogeneity or
significance of population variance
• When responses are classified into two mutually exclusive classes like favor
-not favor, like -dislike, etc.
• To find whether differences exist between observed and expected data
• χ2is not a measure of degree of relationship
• 2. Assumes random observations
• 3. Items in the sample are independent
34
(cont.)
• Constraints are linear, no cell contains less than five as frequency value and
over all no. of items must be reasonably large (Yate’s correction can be
applied to a 2x2 table if cells frequencies are smaller than five); Use
Kolmogorov-Smirnov Test
• PHI Coefficient, φ= √χ2/ N , as a non-parametric measure of coefficient of
correlation helps to estimate the magnitude of association;
• Cramer’s V-measure, V = φ2/ √min. (r-1), (c-1)
• Coefficient of Contingency, C = √χ2/ χ2+ N , also known as coefficient of
mean square contingency, is a non-parametric measure of relationship
useful where contingency tables are higher order than 2x2 and combining
classes is not possible for Yule’s coefficient of association
35
Wilcoxon-Mann-Whitney U-Test
• Most powerful non-parametric test to determine whether two independent
samples have been drawn from the same population. Used as alternative to
t-test both for qualitative and quantitative data
• Both the samples are pooled together and elements arranged in ascending
order to find U
36
Wilcoxon Matched Pair or Signed Rank Test
• Used in the context of two-related samples where we can determine both
direction and magnitude of difference. Examples: wife & husband, subjects
studied before & after experiment, comparing output of two machines, etc.
• As it attaches greater weight to pair which shows a larger difference it is
more powerful test than sign test
• Null hypothesis (Ho) is that there is no difference in the two groups with
respect to characteristics under study
37
K Sample (i.e., more than two sample) Tests
The Kruskal-Wallis Test or H Test:
• Similar to U test;
• H0, ‘K’ individual random samples come from identical universes; does not
require approximation of normal distribution as H follows Chi-square
distribution; use Chi-square table.
38
A few points on K-W
• Calculation of P-values: (avoiding type I errors)
– F statistic: F distribution (requires normality)
– K-W statistic: 2 distribution (requires large samples)
– Either statistic: Permutation tests
• Power: (avoiding type II errors)
– K-W statistic more resistant to outliers
– F statistic more powerful in the case of normality
• K-W statistic: don’t need to worry about transformations
39
Reference
• Cohen, Louis and Manion, Lawrence. Research methods in education.
London: Routledge, 1980.
• Goode, William J and Hatt, Paul K. Methods on social research. London; Mc
GrawHill, 1981.
• Gopal, M.H. An introduction to research procedures in social sciences.
Bombay: Asia Publishing House, 1970.
• Koosis, Donald J. Business statistics. New York: John Wiley,1972.
40
Multivariate Analysis
• Discriminant Analysis
-It joins a nominally scaled criterion or dependent variable with one or
more independent variables that are interval or ratio scaled.
• Multivariate ANOVA
-Assesses the relationship between two or more dependent variables
and classificatory variables or factors
• LISREL (Linear Structural Relationships)
-Measurement and Structural equation model
-Causality testing
41
Interdependency Techniques
•Factor analysis
-A factor is a linear combination of variables
-Construct with a new set of variables based on the
relationships in the correlation matrix
-Factor loading
-Orthogonal or oblique rotaion
•Cluster Analysis
-A set of technique for grouping similar objects or people

More Related Content

What's hot

Generalized Linear Models for Between-Subjects Designs
Generalized Linear Models for Between-Subjects DesignsGeneralized Linear Models for Between-Subjects Designs
Generalized Linear Models for Between-Subjects Designssmackinnon
 
Statistics And Correlation
Statistics And CorrelationStatistics And Correlation
Statistics And Correlationpankaj prabhakar
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IJames Neill
 
Introduction to Regression Analysis and R
Introduction to Regression Analysis and R   Introduction to Regression Analysis and R
Introduction to Regression Analysis and R Rachana Taneja Bhatia
 
Introduction to Mediation using SPSS
Introduction to Mediation using SPSSIntroduction to Mediation using SPSS
Introduction to Mediation using SPSSsmackinnon
 
Research Methodology (Correlational Research) By Emeral & Sarah
Research Methodology (Correlational Research) By Emeral & SarahResearch Methodology (Correlational Research) By Emeral & Sarah
Research Methodology (Correlational Research) By Emeral & SarahEmeral Djunas
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groupsDhritiman Chakrabarti
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amosBalaji P
 
A gentle introduction to growth curves using SPSS
A gentle introduction to growth curves using SPSSA gentle introduction to growth curves using SPSS
A gentle introduction to growth curves using SPSSsmackinnon
 

What's hot (19)

Generalized Linear Models for Between-Subjects Designs
Generalized Linear Models for Between-Subjects DesignsGeneralized Linear Models for Between-Subjects Designs
Generalized Linear Models for Between-Subjects Designs
 
Specification Errors | Eonomics
Specification Errors | EonomicsSpecification Errors | Eonomics
Specification Errors | Eonomics
 
Statistics And Correlation
Statistics And CorrelationStatistics And Correlation
Statistics And Correlation
 
Multiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA IMultiple Linear Regression II and ANOVA I
Multiple Linear Regression II and ANOVA I
 
Confirmatory Factor Analysis
Confirmatory Factor AnalysisConfirmatory Factor Analysis
Confirmatory Factor Analysis
 
Inferential statistics correlations
Inferential statistics correlationsInferential statistics correlations
Inferential statistics correlations
 
Regression vs ANOVA
Regression vs ANOVARegression vs ANOVA
Regression vs ANOVA
 
Structural Equation Modelling (SEM) Part 1
Structural Equation Modelling (SEM) Part 1Structural Equation Modelling (SEM) Part 1
Structural Equation Modelling (SEM) Part 1
 
Applied statistics part 4
Applied statistics part  4Applied statistics part  4
Applied statistics part 4
 
Introduction to Regression Analysis and R
Introduction to Regression Analysis and R   Introduction to Regression Analysis and R
Introduction to Regression Analysis and R
 
Sem+Essentials
Sem+EssentialsSem+Essentials
Sem+Essentials
 
Introduction to Mediation using SPSS
Introduction to Mediation using SPSSIntroduction to Mediation using SPSS
Introduction to Mediation using SPSS
 
Correlation
CorrelationCorrelation
Correlation
 
Research Methodology (Correlational Research) By Emeral & Sarah
Research Methodology (Correlational Research) By Emeral & SarahResearch Methodology (Correlational Research) By Emeral & Sarah
Research Methodology (Correlational Research) By Emeral & Sarah
 
Applied statistics part 3
Applied statistics part 3Applied statistics part 3
Applied statistics part 3
 
Sem with amos ii
Sem with amos iiSem with amos ii
Sem with amos ii
 
Inferential statistics quantitative data - single sample and 2 groups
Inferential statistics   quantitative data - single sample and 2 groupsInferential statistics   quantitative data - single sample and 2 groups
Inferential statistics quantitative data - single sample and 2 groups
 
Structural equation modeling in amos
Structural equation modeling in amosStructural equation modeling in amos
Structural equation modeling in amos
 
A gentle introduction to growth curves using SPSS
A gentle introduction to growth curves using SPSSA gentle introduction to growth curves using SPSS
A gentle introduction to growth curves using SPSS
 

Viewers also liked

Health sciences university of mongolia
Health sciences university of mongoliaHealth sciences university of mongolia
Health sciences university of mongolianaranbatn
 
хичээлийн хуваарь 2011-2012 1-р улирал
хичээлийн хуваарь 2011-2012 1-р улиралхичээлийн хуваарь 2011-2012 1-р улирал
хичээлийн хуваарь 2011-2012 1-р улиралnaranbatn
 
хэлэлцүүлэх материалын танилцуулга
хэлэлцүүлэх материалын танилцуулгахэлэлцүүлэх материалын танилцуулга
хэлэлцүүлэх материалын танилцуулгаnaranbatn
 
Магистрын ганцаарчилсан сургалтын төлөвлөгөө
Магистрын ганцаарчилсан сургалтын төлөвлөгөөМагистрын ганцаарчилсан сургалтын төлөвлөгөө
Магистрын ганцаарчилсан сургалтын төлөвлөгөөnaranbatn
 
эрүүл мэндийн шинжлэх ухаан
эрүүл мэндийн шинжлэх ухаанэрүүл мэндийн шинжлэх ухаан
эрүүл мэндийн шинжлэх ухаанnaranbatn
 
Logframme _ strategic options__by dr malik khalid mehmood ph_d
Logframme  _ strategic options__by dr malik khalid mehmood ph_dLogframme  _ strategic options__by dr malik khalid mehmood ph_d
Logframme _ strategic options__by dr malik khalid mehmood ph_dMalik Khalid Mehmood
 
д.амарсайхан захирал
д.амарсайхан захиралд.амарсайхан захирал
д.амарсайхан захиралnaranbatn
 
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015Epic Research Singapore
 
L1 introduction to respiratory medicine
L1 introduction to respiratory medicineL1 introduction to respiratory medicine
L1 introduction to respiratory medicinebilal natiq
 
Unemployment - A Quick View
Unemployment - A Quick ViewUnemployment - A Quick View
Unemployment - A Quick ViewMarco Laera
 
03011181320072 fernando taslim_analisa_variance_dengan_ppt
03011181320072 fernando taslim_analisa_variance_dengan_ppt03011181320072 fernando taslim_analisa_variance_dengan_ppt
03011181320072 fernando taslim_analisa_variance_dengan_pptFernando Taslim
 

Viewers also liked (12)

Health sciences university of mongolia
Health sciences university of mongoliaHealth sciences university of mongolia
Health sciences university of mongolia
 
хичээлийн хуваарь 2011-2012 1-р улирал
хичээлийн хуваарь 2011-2012 1-р улиралхичээлийн хуваарь 2011-2012 1-р улирал
хичээлийн хуваарь 2011-2012 1-р улирал
 
хэлэлцүүлэх материалын танилцуулга
хэлэлцүүлэх материалын танилцуулгахэлэлцүүлэх материалын танилцуулга
хэлэлцүүлэх материалын танилцуулга
 
Магистрын ганцаарчилсан сургалтын төлөвлөгөө
Магистрын ганцаарчилсан сургалтын төлөвлөгөөМагистрын ганцаарчилсан сургалтын төлөвлөгөө
Магистрын ганцаарчилсан сургалтын төлөвлөгөө
 
эрүүл мэндийн шинжлэх ухаан
эрүүл мэндийн шинжлэх ухаанэрүүл мэндийн шинжлэх ухаан
эрүүл мэндийн шинжлэх ухаан
 
Logframme _ strategic options__by dr malik khalid mehmood ph_d
Logframme  _ strategic options__by dr malik khalid mehmood ph_dLogframme  _ strategic options__by dr malik khalid mehmood ph_d
Logframme _ strategic options__by dr malik khalid mehmood ph_d
 
д.амарсайхан захирал
д.амарсайхан захиралд.амарсайхан захирал
д.амарсайхан захирал
 
WTB_Day_MP2010
WTB_Day_MP2010 WTB_Day_MP2010
WTB_Day_MP2010
 
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015
EPIC RESEARCH SINGAPORE - Daily SGX Singapore report of 01 2015
 
L1 introduction to respiratory medicine
L1 introduction to respiratory medicineL1 introduction to respiratory medicine
L1 introduction to respiratory medicine
 
Unemployment - A Quick View
Unemployment - A Quick ViewUnemployment - A Quick View
Unemployment - A Quick View
 
03011181320072 fernando taslim_analisa_variance_dengan_ppt
03011181320072 fernando taslim_analisa_variance_dengan_ppt03011181320072 fernando taslim_analisa_variance_dengan_ppt
03011181320072 fernando taslim_analisa_variance_dengan_ppt
 

Similar to Research method ch09 statistical methods 3 estimation np

A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Beamsync
 
regression-linearandlogisitics-220524024037-4221a176 (1).pdf
regression-linearandlogisitics-220524024037-4221a176 (1).pdfregression-linearandlogisitics-220524024037-4221a176 (1).pdf
regression-linearandlogisitics-220524024037-4221a176 (1).pdflisow86669
 
Applied statistics lecture_6
Applied statistics lecture_6Applied statistics lecture_6
Applied statistics lecture_6Daria Bogdanova
 
Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminardrdeepika87
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxkrunal soni
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in StatisticsVikash Keshri
 
Chi square test final
Chi square test finalChi square test final
Chi square test finalHar Jindal
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysisRajesh Mishra
 
Biostatistics lecture notes 7.ppt
Biostatistics lecture notes 7.pptBiostatistics lecture notes 7.ppt
Biostatistics lecture notes 7.pptletayh2016
 
Introduction to measures of relationship: covariance, and Pearson r
Introduction to measures of relationship: covariance, and Pearson rIntroduction to measures of relationship: covariance, and Pearson r
Introduction to measures of relationship: covariance, and Pearson rIvan Jacob Pesigan
 

Similar to Research method ch09 statistical methods 3 estimation np (20)

UNIT 5.pptx
UNIT 5.pptxUNIT 5.pptx
UNIT 5.pptx
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10
 
Correlation & Regression.pptx
Correlation & Regression.pptxCorrelation & Regression.pptx
Correlation & Regression.pptx
 
regression-linearandlogisitics-220524024037-4221a176 (1).pdf
regression-linearandlogisitics-220524024037-4221a176 (1).pdfregression-linearandlogisitics-220524024037-4221a176 (1).pdf
regression-linearandlogisitics-220524024037-4221a176 (1).pdf
 
Linear and Logistics Regression
Linear and Logistics RegressionLinear and Logistics Regression
Linear and Logistics Regression
 
Applied statistics lecture_6
Applied statistics lecture_6Applied statistics lecture_6
Applied statistics lecture_6
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Statistical analysis in SPSS_
Statistical analysis in SPSS_ Statistical analysis in SPSS_
Statistical analysis in SPSS_
 
Parametric tests seminar
Parametric tests seminarParametric tests seminar
Parametric tests seminar
 
Correlation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptxCorrelation _ Regression Analysis statistics.pptx
Correlation _ Regression Analysis statistics.pptx
 
Test of significance in Statistics
Test of significance in StatisticsTest of significance in Statistics
Test of significance in Statistics
 
Chi square test final
Chi square test finalChi square test final
Chi square test final
 
Statistic and orthodontic by almuzian
Statistic and orthodontic by almuzianStatistic and orthodontic by almuzian
Statistic and orthodontic by almuzian
 
Quantitative analysis
Quantitative analysisQuantitative analysis
Quantitative analysis
 
Hm306 week 5
Hm306 week 5Hm306 week 5
Hm306 week 5
 
Hm306 week 5
Hm306 week 5Hm306 week 5
Hm306 week 5
 
Biostatistics lecture notes 7.ppt
Biostatistics lecture notes 7.pptBiostatistics lecture notes 7.ppt
Biostatistics lecture notes 7.ppt
 
Introduction to measures of relationship: covariance, and Pearson r
Introduction to measures of relationship: covariance, and Pearson rIntroduction to measures of relationship: covariance, and Pearson r
Introduction to measures of relationship: covariance, and Pearson r
 

More from naranbatn

Instructions to authors
Instructions to authorsInstructions to authors
Instructions to authorsnaranbatn
 
Төгсөлтийн сургалтыг зохицуулах журам
Төгсөлтийн сургалтыг зохицуулах журамТөгсөлтийн сургалтыг зохицуулах журам
Төгсөлтийн сургалтыг зохицуулах журамnaranbatn
 
ц.лхагвасүрэн захирал
ц.лхагвасүрэн захиралц.лхагвасүрэн захирал
ц.лхагвасүрэн захиралnaranbatn
 
Self eval report english final for printing, 28.09.2011last for printing
Self eval report english final for printing, 28.09.2011last for printingSelf eval report english final for printing, 28.09.2011last for printing
Self eval report english final for printing, 28.09.2011last for printingnaranbatn
 
Бүрдүүлэх материал
Бүрдүүлэх материалБүрдүүлэх материал
Бүрдүүлэх материалnaranbatn
 
мэргэжлийн индекс
мэргэжлийн индексмэргэжлийн индекс
мэргэжлийн индексnaranbatn
 
Germany summer school-2010
Germany summer school-2010Germany summer school-2010
Germany summer school-2010naranbatn
 
Uni sannio courses
Uni sannio coursesUni sannio courses
Uni sannio coursesnaranbatn
 
Germany summer school-2010
Germany summer school-2010Germany summer school-2010
Germany summer school-2010naranbatn
 
Germany international semester
Germany international semesterGermany international semester
Germany international semesternaranbatn
 
Uni sannio courses_faculty_economics
Uni sannio courses_faculty_economicsUni sannio courses_faculty_economics
Uni sannio courses_faculty_economicsnaranbatn
 
Ull ph dproposals
Ull ph dproposalsUll ph dproposals
Ull ph dproposalsnaranbatn
 
Ull ph dproposals
Ull ph dproposalsUll ph dproposals
Ull ph dproposalsnaranbatn
 
Su pd ph-dproposals
Su pd ph-dproposalsSu pd ph-dproposals
Su pd ph-dproposalsnaranbatn
 
Sannio ph dproposals_v4
Sannio ph dproposals_v4Sannio ph dproposals_v4
Sannio ph dproposals_v4naranbatn
 
Postdoc subject
Postdoc subjectPostdoc subject
Postdoc subjectnaranbatn
 
Sampling, master course, 2010
Sampling, master course, 2010Sampling, master course, 2010
Sampling, master course, 2010naranbatn
 
Sampling exercises
Sampling exercisesSampling exercises
Sampling exercisesnaranbatn
 
Research method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anovaResearch method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anovanaranbatn
 

More from naranbatn (20)

Instructions to authors
Instructions to authorsInstructions to authors
Instructions to authors
 
Төгсөлтийн сургалтыг зохицуулах журам
Төгсөлтийн сургалтыг зохицуулах журамТөгсөлтийн сургалтыг зохицуулах журам
Төгсөлтийн сургалтыг зохицуулах журам
 
ц.лхагвасүрэн захирал
ц.лхагвасүрэн захиралц.лхагвасүрэн захирал
ц.лхагвасүрэн захирал
 
Self eval report english final for printing, 28.09.2011last for printing
Self eval report english final for printing, 28.09.2011last for printingSelf eval report english final for printing, 28.09.2011last for printing
Self eval report english final for printing, 28.09.2011last for printing
 
Бүрдүүлэх материал
Бүрдүүлэх материалБүрдүүлэх материал
Бүрдүүлэх материал
 
мэргэжлийн индекс
мэргэжлийн индексмэргэжлийн индекс
мэргэжлийн индекс
 
Germany summer school-2010
Germany summer school-2010Germany summer school-2010
Germany summer school-2010
 
Uni sannio courses
Uni sannio coursesUni sannio courses
Uni sannio courses
 
Germany summer school-2010
Germany summer school-2010Germany summer school-2010
Germany summer school-2010
 
Germany international semester
Germany international semesterGermany international semester
Germany international semester
 
Uni sannio courses_faculty_economics
Uni sannio courses_faculty_economicsUni sannio courses_faculty_economics
Uni sannio courses_faculty_economics
 
Ull ph dproposals
Ull ph dproposalsUll ph dproposals
Ull ph dproposals
 
Ull ph dproposals
Ull ph dproposalsUll ph dproposals
Ull ph dproposals
 
Su pd ph-dproposals
Su pd ph-dproposalsSu pd ph-dproposals
Su pd ph-dproposals
 
Sannio ph dproposals_v4
Sannio ph dproposals_v4Sannio ph dproposals_v4
Sannio ph dproposals_v4
 
Postdoc subject
Postdoc subjectPostdoc subject
Postdoc subject
 
Sampling, master course, 2010
Sampling, master course, 2010Sampling, master course, 2010
Sampling, master course, 2010
 
Sampling exercises
Sampling exercisesSampling exercises
Sampling exercises
 
Havsralt 1
Havsralt 1Havsralt 1
Havsralt 1
 
Research method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anovaResearch method ch08 statistical methods 2 anova
Research method ch08 statistical methods 2 anova
 

Recently uploaded

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Recently uploaded (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Research method ch09 statistical methods 3 estimation np

  • 1. 1 Research Methods in Health Chapter 9. Estimation Young Moon Chae, Ph.D. Graduate School of Public Health Yonsei University, Korea ymchae@yuhs.ac
  • 3. 3 Questions • Why does the maximum value of r equal 1.0? • What does it mean when a correlation is positive? Negative? • What is the purpose of the Fisher r to z transformation? • What is range restriction? Range enhancement? What do they do to r? • Give an example in which data properly analyzed by ANOVA cannot be used to infer causality. • Why do we care about the sampling distribution of the correlation coefficient? • What is the effect of reliability on r?
  • 4. 4 Basic Ideas • Nominal vs. continuous IV • Degree (direction) & closeness (magnitude) of linear relations -Sign (+ or -) for direction -Absolute value for magnitude • Pearson product-moment correlation coefficient N zz r YXå=
  • 5. 5 Illustrations 757269666360 Height 210 180 150 120 90 Weight Plot of Weight by Height 4003002001000 Study Time 30 20 10 0 Errors Plot of Errors by Study Time 1.91.81.71.61.5 Toe Size 700 600 500 400 SAT-V Plot of SAT-V by Toe Size Positive, negative, zero
  • 6. 6 Graphic Representation 757269666360 Height 210 180 150 120 90 Weight Plot of Weight by Height 757269666360 Height Plot of Weight by HeightPlot of Weight by Height Mean = 66.8 Inches Mean = 150.7 lbs. 210-1-2 Z-height 2 1 0 -1 -2 Z-weight Plot of Weight by Height in Z-scores 2 1 0 -1 -2 Z-weight Plot of Weight by Height in Z-scoresPlot of Weight by Height in Z-scores + - - + 1. Conversion from raw to z. 2. Points & quadrants. Positive & negative products. 3. Correlation is average of cross products. Sign & magnitude of r depend on where the points fall. 4. Product at maximum (average =1) when points on line where zX=zY.
  • 7. 7 Correlation Analysis • It measures the closeness of the relationship between two or more variables • The degree of association or covariation between variables, no causality • Measures of Association by Measurement • Interpretation of Correlation -T-test
  • 9. 9 Questions • What are predictors and criteria? • Write an equation for the linear regression. Describe each term. • How do changes in the slope and intercept affect (move) the regression line? • What does it mean to test the significance of the regression sum of squares? R-square? • What is R-square? • What does it mean to choose a regression line to satisfy the loss function of least squares? • How do we find the slope and intercept for the regression line with a single independent variable? (Either formula for the slope is acceptable.) • Why does testing for the regression sum of squares turn out to have the same result as testing for R- square?
  • 10. 10 Basic Ideas • Jargon -IV = X = Predictor (pl. predictors) -DV = Y = Criterion (pl. criteria) -Regression of Y on X e.g., GPA on SAT • Linear Model = relations between IV and DV represented by straight line. • A score on Y has 2 parts – (1) linear function of X and (2) error. Y Xi i i= + +a b e (population values)
  • 11. 11 Regression Analysis • It refers to the techniques used to derive an equation that relates the criterion variable to one or more predictor variables • Method of least squares • Standardized coefficients • Goodness of fit -F test, t test, Coefficient of Determination • multicollinearity
  • 13. 13 ANOVA as linear regression
  • 15. 15 Raw & Standardized Regression Weights • Each X has a raw score slope, b. • Slope tells expected change in Y if X changes 1 unit*. • Large b weights should indicate important variables, but b depends on variance of X. • A b for height in inches would be 12 times larger than b for height in feet. • If we standardize X and Y, all units of X are the same. • Relative size of b now meaningful. *strictly speaking, holding other X variables constant.
  • 16. 16 Tests of R2 vs Tests of b • Slopes (b) tell about the relation between Y and the unique part of X. R2 tells about proportion of variance in Y accounted for by set of predictors all together. • Correlations among X variables increase the standard errors of b weights but not R2 . • Possible to get significant R2 , but no or few significant b weights • Possible but unlikely to have significant b but not significant R2 . Look to R2 first. If it is n.s., avoid interpreting b weights.
  • 17. 17 Testing Incremental R2 You can start regression with a set of one or more variables and then add predictors 1 or more at a time. When you add predictors, R 2 will never go down. It usually goes up, and you can test whether the increment in R 2 is significant or else if likely due to chance. )1/()1( )/()( 2 22 --- -- = LL SLSL kNR kkRR F 2 LR 2 SR Sk Lk =R-square for the larger model =R-square for the smaller model = number of predictors in the larger model =number of predictors in the smaller model
  • 18. 18 (cont.) • In regression problems, the most commonly used indices of importance are the correlation, r, and the increment to R-square when the variable of interest is considered last. The second is sometimes called a last-in R-square change. The last-in increment corresponds to the Type III sums of squares and is closely related to the b weight. • The correlation tells about the importance of the variable ignoring all other predictors. • The last-in increment tells about the importance of the variable as a unique contributor to the prediction of Y, above and beyond all other predictors in the model. •“Importance” is not well defined statistically when IVs are correlated. Doesn’t include mediated models (path analysis).
  • 19. 19 Collinearity Defined • The problem of large correlations among the independent variables • Within the set of IVs, one or more IVs are (nearly) totally predicted by the other IVs. • In such a case, the b or beta weights are poorly estimated. • Problem of the “Bouncing Betas.”
  • 20. 20 Dealing with Collinearity • Lump it. Admit ambiguity; SE of b weights. Refer also to correlations. • Select or combine variables. • Factor analyze set of IVs. • Use another type of analysis (e.g., path analysis). • Use another type of regression (ridge regression). • Unit weights (no longer regression).
  • 23. 23 Problem with Linearity 50 100 150 200 250 Horsepower 10 20 30 40 50 MilesperGallon R Sq Linear = 0.595
  • 26. 26 Non-parametric or Distribution-free Tests • Two kinds of assertions in statistical tests: 1. Assertion directly related to the purpose of investigation, i.e., hypothesis to be tested 2. Assertion to make a probability statement. Set of all assertions is called the model • Testing a hypothesis without a model is non-parametric test. That is, tests which do not make basic assumptions about and without having the knowledge of the distribution of the population parameters
  • 27. 27 Characteristics 1.Do not depend on any assumptions about properties / parameters of the parent population, I.e., do not suppose any particular distribution & consequential assumptions (Parametric tests like ‘t’& ‘F’ tests make assumption about homogeneity of the variances) & No such assumptions or less restricting assumptions 2.When measurements are not so accurate, non-parametric tests come very handy 3.Most non-parametric tests assume only nominal or ordinal data I.e., more suitable (than parametric tests) for nominal & ordinal (or rated data) 4.Involves few arithmetic computations
  • 28. 28 (cont.) 5.Usually less efficient & powerful than parametric tests as they are based on no assumption 6.Greater risk of accepting a false hypothesis and committing type II error; Non-parametric tests require more observations than parametric tests to achieve the same size of type I and type II errors 7.Null hypothesis is somewhat loosely defined & hence rejection of null hypothesis may lead to less precise conclusion than parametric tests 8.It is a trade off between loss in sharpness of estimating intervals and gain in the ability of using less information & to calculate faster
  • 29. 29 Some important applications are (I)concerning single value for the given data (II)difference among 2 or more sets of data (III)relations between variables (IV)variation in the given data (V)randomness of a sample (VI)association or dependency of categorical data (VII)comparing theoretical population with actual data in categories
  • 30. 30 Typical situation 1.Data not likely to be normally distributed 2.Nominal data from responses to questionnaire 3.Partially filled questions, i.e., to handle incomplete / missing data. to make necessary adjustments to extract maximum information from average data 4.Reasonably good results from even very small sample but need more observations than parametric tests to achieve the same size of type I and type II errors
  • 31. 31 31
  • 32. 32 Mc Nemer Test •Useful for testing nominal data of two related samples and before –after measurements of the same subjects with a view to judge the significance for any observed change after treatment
  • 33. 33 Chi-Square Test • An important non-parametric test for significance of association as well as for testing hypothesis regarding (i) goodness of fit and (ii) homogeneity or significance of population variance • When responses are classified into two mutually exclusive classes like favor -not favor, like -dislike, etc. • To find whether differences exist between observed and expected data • χ2is not a measure of degree of relationship • 2. Assumes random observations • 3. Items in the sample are independent
  • 34. 34 (cont.) • Constraints are linear, no cell contains less than five as frequency value and over all no. of items must be reasonably large (Yate’s correction can be applied to a 2x2 table if cells frequencies are smaller than five); Use Kolmogorov-Smirnov Test • PHI Coefficient, φ= √χ2/ N , as a non-parametric measure of coefficient of correlation helps to estimate the magnitude of association; • Cramer’s V-measure, V = φ2/ √min. (r-1), (c-1) • Coefficient of Contingency, C = √χ2/ χ2+ N , also known as coefficient of mean square contingency, is a non-parametric measure of relationship useful where contingency tables are higher order than 2x2 and combining classes is not possible for Yule’s coefficient of association
  • 35. 35 Wilcoxon-Mann-Whitney U-Test • Most powerful non-parametric test to determine whether two independent samples have been drawn from the same population. Used as alternative to t-test both for qualitative and quantitative data • Both the samples are pooled together and elements arranged in ascending order to find U
  • 36. 36 Wilcoxon Matched Pair or Signed Rank Test • Used in the context of two-related samples where we can determine both direction and magnitude of difference. Examples: wife & husband, subjects studied before & after experiment, comparing output of two machines, etc. • As it attaches greater weight to pair which shows a larger difference it is more powerful test than sign test • Null hypothesis (Ho) is that there is no difference in the two groups with respect to characteristics under study
  • 37. 37 K Sample (i.e., more than two sample) Tests The Kruskal-Wallis Test or H Test: • Similar to U test; • H0, ‘K’ individual random samples come from identical universes; does not require approximation of normal distribution as H follows Chi-square distribution; use Chi-square table.
  • 38. 38 A few points on K-W • Calculation of P-values: (avoiding type I errors) – F statistic: F distribution (requires normality) – K-W statistic: 2 distribution (requires large samples) – Either statistic: Permutation tests • Power: (avoiding type II errors) – K-W statistic more resistant to outliers – F statistic more powerful in the case of normality • K-W statistic: don’t need to worry about transformations
  • 39. 39 Reference • Cohen, Louis and Manion, Lawrence. Research methods in education. London: Routledge, 1980. • Goode, William J and Hatt, Paul K. Methods on social research. London; Mc GrawHill, 1981. • Gopal, M.H. An introduction to research procedures in social sciences. Bombay: Asia Publishing House, 1970. • Koosis, Donald J. Business statistics. New York: John Wiley,1972.
  • 40. 40 Multivariate Analysis • Discriminant Analysis -It joins a nominally scaled criterion or dependent variable with one or more independent variables that are interval or ratio scaled. • Multivariate ANOVA -Assesses the relationship between two or more dependent variables and classificatory variables or factors • LISREL (Linear Structural Relationships) -Measurement and Structural equation model -Causality testing
  • 41. 41 Interdependency Techniques •Factor analysis -A factor is a linear combination of variables -Construct with a new set of variables based on the relationships in the correlation matrix -Factor loading -Orthogonal or oblique rotaion •Cluster Analysis -A set of technique for grouping similar objects or people