SlideShare una empresa de Scribd logo
1 de 43
Descargar para leer sin conexión
Statistical Methods in Research
Dr Kiran Gaur
Associate Professor & Head
Department of Statistics, Mathematics & Computer Science
SKN Colloge of Agriculture, Jobner
Statistics
Descriptive statistics – Methods of organizing, summarizing, and
presenting data in an informative way
Inferential statistics – The methods used to determine something about a
population on the basis of a sample
Inference is the process of drawing conclusions or making decisions
about a population based on sample results
Types of variables
Variables
Quantitative
Qualitative
Dichotomic Polynomic Discrete Continuous
Gender, marital
status
Brand of Pc, hair
color
Children in family,
Strokes on a golf
hole
Amount of income
tax paid, weight of a
student
Types of Measurement Scale
Nominal Scale
Colour, Region , gender etc.
Ordinal Scale
Size, grades, SEB etc.
Interval Scale
Temperature, certain size measurement etc.
Ratio Scale
Height, weight, income etc.
Frequency distribution
The frequency with which observations are assigned to each category
or point on a measurement scale.
Most basic form of descriptive statistics
May be expressed as a percentage of the total sample found in
each category
The distribution is “read” differently depending upon the
measurement level
Nominal scales are read as discrete measurements at each level
Ordinal measures show tendencies, but categories should not be
compared
Interval and ratio scales allow for comparison among categories
Cross Tabulation
Chart Guide
Commonly Used Graphs in Business Research
A Taxonomy of Statistics
11
Central Tendency
• Statistical measure that determines a single value that accurately describes
the center of the distribution and represents the entire distribution of
scores.
• By identifying the "average score," central tendency allows
researchers to summarize or condense a large set of data into a single
value.
• In addition, it is possible to compare two (or more) sets of data by
simply comparing the average score (central tendency) for one set
versus the average score for another set.
Measures of central tendency
• These measures give us an idea what the ‘typical’ case in a distribution
• Mean-
• The ‘average’ score—sum of all individual scores divided by the number of scores
• Has a number of useful statistical properties
however, can be sensitive to extreme scores (“outliers”)
• many statistics are based on the mean
• Mode - the most frequent score in a distribution
• good for nominal data
• Median - the midpoint or mid score in a distribution.
• 50% cases above/50% cases below
insensitive to extreme cases
Ordinal or ratio
0
20
40
60
80
100
120
140
160
1
q1
min
median
max
q3
Box- Plot Chart
Dispersion
• Some statistics look at how widely scattered over the scale the
individual scores are
• Groups with identical means can be more or less widely dispersed
• To find out how the group is distributed, we need to know how far
from or close to the mean individual scores are
• Like the mean, these statistics are only meaningful for interval or
ratio-level measures
Estimates of Dispersion
• Range
• Distance between the highest and lowest scores in a distribution;
• sensitive to extreme scores;
• Can compensate by calculating inter quartile range (distance between the 25th and 75th
percentile points) which represents the range of scores for the middle half of a
distribution
Variance (S2)
• Average of squared distances of individual points from the mean
• sample variance
• High variance means that most scores are far away from the mean. Low variance
indicates that most scores cluster tightly about the mean.
• The amount that one score differs from the mean is called its deviation score
(deviate)
• The sum of all deviation scores in a sample is called the sum of squares
Estimates of dispersion
Standard Deviation (SD)
A summary statistic of how much scores vary from the mean
Square root of the Variance
• expressed in the original units of measurement
• Represents the average amount of dispersion in a sample
• Used in a number of inferential statistics
Measures the peackedness of a distribution;
Leptokurtic (positive excess kurtosis, i.e. fatter tails),
Mesokurtic,
Platykurtic (negative excess kurtosis, i.e. thinner tails),
Skewness:
Kurtosis:
Measures the skewness of a distribution;
Positive or Negative skewness
Shape of the Distribution
Negatively
Skewed
Mode
Median
Mean
Symmetric
(Not Skewed)
Mean
Median
Mode
Positively
Skewed
Mode
Median
Mean
Normal distribution
• Many characteristics are distributed through the
population in a ‘normal’ manner
• Normal curves have well-defined statistical properties
• Parametric statistics are based on the assumption that the
variables are distributed normally
Most commonly used statistics
• This is the famous “Bell curve” where many cases fall near
the middle of the distribution and few fall very high or
very low
I.Q. Distribution
Data Transformation
• With skewed data, the mean is not a good measure of central
tendency because it is sensitive to extreme scores
• May need to transform skewed data to make distribution appear
more normal or symmetrical
• Must determine the degree & type of skewness prior to
transformation
Correlation and Regression
Correlation describes the strength of a linear relationship between two variables
Linear means “straight line”
Measures-
Scatter Plot
Karl Pearson Correlation Coefficient
Spearman’s Rank Correlation
Regression tells us how to draw the straight line described by the correlation.
It is the technique concerned with predicting some variables by knowing others i.e
the process of predicting variable Y using variable X
Multiple regression analysis
Multiple regression analysis is a straight forward extension of simple regression
analysis which allows more than one independent variable.
Y = a + b1X1 + b2X2 + …bkXk ;
The b’s are called partial regression coefficients
Statistical Inference
Use a random sample to learn something about a
larger population
Statistical inference: Drawing conclusions about the whole
population on the basis of a sample
Precondition for statistical inference: A sample is randomly
selected from the population (probability sample)
Hypotheses
The null hypothesis, denoted H0, is the claim that is initially assumed to be true. The alternative hypothesis,
denoted by Ha, is the assertion that is contrary to H0. Possible conclusions from hypothesis-testing analysis are
reject H0 or fail to reject H0.
Rules for Hypotheses
H0 is always stated as an equality claim involving parameters.
Ha is an inequality claim that contradicts H0. It may be one-sided (using either > or <) or two-sided (using ≠).
Steps for Hypothesis Testing
Draw Marketing Research Conclusion
Formulate H0 and H1
Select Appropriate Test
Choose Level of Significance
Determine Prob
Assoc with Test Stat
Determine Critical
Value of Test Stat
TSCR
Determine if TSCR
falls into (Non)
Rejection Region
Compare with Level
of Significance, 
Reject/Do not Reject H0
Calculate Test Statistic TSCAL
Choice of an Appropriate Test
What size sample do We need?
The answer to this question is influenced by a number of factors,viz
➢The purpose of the study
➢Population size
➢The risk of selecting a “bad” sample
➢The allowable sampling error
➢Most of all whether undertaking a qualitative or quantitative study
Different approaches for study designs , such as cross section, case-control, cohort
design, longitudinal study, diagnostics test study etc.
Sample Size Determination
Criteria
➢ Level of confidence ( Normally 95%)
➢ Margin of Error (Usually 1%, 3% or 5%)
➢ Degree of variability in the attributes being measured (Prevalence)
More homogeneous population → Smaller sample size
More heterogeneous population → Large sample size for desired precision.
Sample size
Quantitative Qualitative
n =
Z2
σ2
𝑒 2
n =
(Z2σ2𝑁)
e2 𝑁 − 1 + Z2σ2
n =
Z2𝑃𝑄
e2
n =
(Z2𝑃𝑄𝑁)
e2 𝑁−1 +Z2𝑃𝑄
Infinite
Population
Finite
Population
Sample Size Table
Online Sample Size Calculator
https://www.surveysystem.com/sscalc.htm
https://www.calculator.net/sample-size-calculator
http://www.raosoft.com/samplesize.html
https://www.stat.ubc.ca/~rollin/stats/ssize/n2.html
P-Value
Definition: P-value is the probability of obtaining a sample “more extreme” than
one observed from the sample data, if the null hypothesis is true
Understanding P value
Interpreting P-value
Caution : The P-value was never intended to be a substitute for scientific reasoning
Multivariate Analysis Techniques
• Multiple regression
• Canonical correlation
• Discriminant analysis
• Logistic regression
• Survival analysis
• Principal component analysis
• Factor analysis
• Cluster analysis
Thank You…
“All the statistics in the world can’t measure the warmth of a smile.” Chris Hart

Más contenido relacionado

La actualidad más candente

Chapter 6 simple regression and correlation
Chapter 6 simple regression and correlationChapter 6 simple regression and correlation
Chapter 6 simple regression and correlation
Rione Drevale
 
Factor Analysis in Research
Factor Analysis in ResearchFactor Analysis in Research
Factor Analysis in Research
Qasim Raza
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
Aileen Balbido
 
Spss lecture notes
Spss lecture notesSpss lecture notes
Spss lecture notes
David mbwiga
 

La actualidad más candente (20)

Chapter 6 simple regression and correlation
Chapter 6 simple regression and correlationChapter 6 simple regression and correlation
Chapter 6 simple regression and correlation
 
Range
RangeRange
Range
 
Multivariate Analysis Techniques
Multivariate Analysis TechniquesMultivariate Analysis Techniques
Multivariate Analysis Techniques
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 
01 parametric and non parametric statistics
01 parametric and non parametric statistics01 parametric and non parametric statistics
01 parametric and non parametric statistics
 
Multivariate Analysis
Multivariate AnalysisMultivariate Analysis
Multivariate Analysis
 
Measure of Dispersion in statistics
Measure of Dispersion in statisticsMeasure of Dispersion in statistics
Measure of Dispersion in statistics
 
Correlation ppt...
Correlation ppt...Correlation ppt...
Correlation ppt...
 
Factor Analysis in Research
Factor Analysis in ResearchFactor Analysis in Research
Factor Analysis in Research
 
Measures of dispersion
Measures  of  dispersionMeasures  of  dispersion
Measures of dispersion
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Basic Statistics
Basic  StatisticsBasic  Statistics
Basic Statistics
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Measure of Dispersion
Measure of DispersionMeasure of Dispersion
Measure of Dispersion
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
Spss lecture notes
Spss lecture notesSpss lecture notes
Spss lecture notes
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Skewness
SkewnessSkewness
Skewness
 
Variance & standard deviation
Variance & standard deviationVariance & standard deviation
Variance & standard deviation
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 

Similar a Statistical Methods in Research

Univariate Analysis
 Univariate Analysis Univariate Analysis
Univariate Analysis
Soumya Sahoo
 
Basics in Epidemiology & Biostatistics 2 RSS6 2014
Basics in Epidemiology & Biostatistics 2 RSS6 2014Basics in Epidemiology & Biostatistics 2 RSS6 2014
Basics in Epidemiology & Biostatistics 2 RSS6 2014
RSS6
 

Similar a Statistical Methods in Research (20)

Univariate Analysis
 Univariate Analysis Univariate Analysis
Univariate Analysis
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
 
Res701 research methodology lecture 7 8-devaprakasam
Res701 research methodology lecture 7 8-devaprakasamRes701 research methodology lecture 7 8-devaprakasam
Res701 research methodology lecture 7 8-devaprakasam
 
Introduction to Statistics2312.ppt
Introduction to Statistics2312.pptIntroduction to Statistics2312.ppt
Introduction to Statistics2312.ppt
 
Introduction to Statistics23122223.ppt
Introduction to Statistics23122223.pptIntroduction to Statistics23122223.ppt
Introduction to Statistics23122223.ppt
 
Descriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptxDescriptive_statistics - Sample 1.pptx
Descriptive_statistics - Sample 1.pptx
 
2. chapter ii(analyz)
2. chapter ii(analyz)2. chapter ii(analyz)
2. chapter ii(analyz)
 
Medical Statistics.ppt
Medical Statistics.pptMedical Statistics.ppt
Medical Statistics.ppt
 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
 
Introduction to Statistics53004300.ppt
Introduction to Statistics53004300.pptIntroduction to Statistics53004300.ppt
Introduction to Statistics53004300.ppt
 
Chapter34
Chapter34Chapter34
Chapter34
 
Statistics
StatisticsStatistics
Statistics
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
 
Bgy5901
Bgy5901Bgy5901
Bgy5901
 
1 introduction to psychological statistics
1 introduction to psychological statistics1 introduction to psychological statistics
1 introduction to psychological statistics
 
Basics in Epidemiology & Biostatistics 2 RSS6 2014
Basics in Epidemiology & Biostatistics 2 RSS6 2014Basics in Epidemiology & Biostatistics 2 RSS6 2014
Basics in Epidemiology & Biostatistics 2 RSS6 2014
 
Review of Chapters 1-5.ppt
Review of Chapters 1-5.pptReview of Chapters 1-5.ppt
Review of Chapters 1-5.ppt
 
Quantitative Research Design.pptx
Quantitative Research Design.pptxQuantitative Research Design.pptx
Quantitative Research Design.pptx
 
Statistics ppt.ppt
Statistics ppt.pptStatistics ppt.ppt
Statistics ppt.ppt
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 

Más de Manoj Sharma

Experimental designs and data analysis in the field of soil science by making...
Experimental designs and data analysis in the field of soil science by making...Experimental designs and data analysis in the field of soil science by making...
Experimental designs and data analysis in the field of soil science by making...
Manoj Sharma
 
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
Manoj Sharma
 
Issues and Challenges of a Community Scientist at KVK and Way Forward
Issues and Challenges of a Community Scientist at KVK and Way ForwardIssues and Challenges of a Community Scientist at KVK and Way Forward
Issues and Challenges of a Community Scientist at KVK and Way Forward
Manoj Sharma
 
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
Manoj Sharma
 
Technology Assessment and Refinement for Its Adoption
Technology Assessment and  Refinement for Its AdoptionTechnology Assessment and  Refinement for Its Adoption
Technology Assessment and Refinement for Its Adoption
Manoj Sharma
 
Role and Responsibilities of Community Scientist in a KVK
Role and Responsibilities of    Community Scientist in a KVKRole and Responsibilities of    Community Scientist in a KVK
Role and Responsibilities of Community Scientist in a KVK
Manoj Sharma
 
Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...
Manoj Sharma
 

Más de Manoj Sharma (8)

Experimental designs and data analysis in the field of soil science by making...
Experimental designs and data analysis in the field of soil science by making...Experimental designs and data analysis in the field of soil science by making...
Experimental designs and data analysis in the field of soil science by making...
 
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
Sampling Techniques, Data Collection and tabulation in the field of Social Sc...
 
Issues and Challenges of a Community Scientist at KVK and Way Forward
Issues and Challenges of a Community Scientist at KVK and Way ForwardIssues and Challenges of a Community Scientist at KVK and Way Forward
Issues and Challenges of a Community Scientist at KVK and Way Forward
 
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
Biometrical Techniques for Analysis of Genotype x Environment Interactions & ...
 
Technology Assessment and Refinement for Its Adoption
Technology Assessment and  Refinement for Its AdoptionTechnology Assessment and  Refinement for Its Adoption
Technology Assessment and Refinement for Its Adoption
 
Role and Responsibilities of Community Scientist in a KVK
Role and Responsibilities of    Community Scientist in a KVKRole and Responsibilities of    Community Scientist in a KVK
Role and Responsibilities of Community Scientist in a KVK
 
Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...Experimental designs and data analysis in the field of Agronomy science by ma...
Experimental designs and data analysis in the field of Agronomy science by ma...
 
Writing Guidelines for Journal of Krishi Vigyan (www.iskv.in)
Writing Guidelines for Journal of Krishi Vigyan(www.iskv.in)Writing Guidelines for Journal of Krishi Vigyan(www.iskv.in)
Writing Guidelines for Journal of Krishi Vigyan (www.iskv.in)
 

Último

Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
AnaAcapella
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
ssuserdda66b
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Spellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please PractiseSpellings Wk 3 English CAPS CARES Please Practise
Spellings Wk 3 English CAPS CARES Please Practise
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Dyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptxDyslexia AI Workshop for Slideshare.pptx
Dyslexia AI Workshop for Slideshare.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 

Statistical Methods in Research

  • 1.
  • 2. Statistical Methods in Research Dr Kiran Gaur Associate Professor & Head Department of Statistics, Mathematics & Computer Science SKN Colloge of Agriculture, Jobner
  • 3. Statistics Descriptive statistics – Methods of organizing, summarizing, and presenting data in an informative way Inferential statistics – The methods used to determine something about a population on the basis of a sample Inference is the process of drawing conclusions or making decisions about a population based on sample results
  • 4. Types of variables Variables Quantitative Qualitative Dichotomic Polynomic Discrete Continuous Gender, marital status Brand of Pc, hair color Children in family, Strokes on a golf hole Amount of income tax paid, weight of a student
  • 5. Types of Measurement Scale Nominal Scale Colour, Region , gender etc. Ordinal Scale Size, grades, SEB etc. Interval Scale Temperature, certain size measurement etc. Ratio Scale Height, weight, income etc.
  • 6. Frequency distribution The frequency with which observations are assigned to each category or point on a measurement scale. Most basic form of descriptive statistics May be expressed as a percentage of the total sample found in each category The distribution is “read” differently depending upon the measurement level Nominal scales are read as discrete measurements at each level Ordinal measures show tendencies, but categories should not be compared Interval and ratio scales allow for comparison among categories
  • 9. Commonly Used Graphs in Business Research
  • 10. A Taxonomy of Statistics
  • 11. 11 Central Tendency • Statistical measure that determines a single value that accurately describes the center of the distribution and represents the entire distribution of scores. • By identifying the "average score," central tendency allows researchers to summarize or condense a large set of data into a single value. • In addition, it is possible to compare two (or more) sets of data by simply comparing the average score (central tendency) for one set versus the average score for another set.
  • 12. Measures of central tendency • These measures give us an idea what the ‘typical’ case in a distribution • Mean- • The ‘average’ score—sum of all individual scores divided by the number of scores • Has a number of useful statistical properties however, can be sensitive to extreme scores (“outliers”) • many statistics are based on the mean • Mode - the most frequent score in a distribution • good for nominal data • Median - the midpoint or mid score in a distribution. • 50% cases above/50% cases below insensitive to extreme cases Ordinal or ratio
  • 14. Dispersion • Some statistics look at how widely scattered over the scale the individual scores are • Groups with identical means can be more or less widely dispersed • To find out how the group is distributed, we need to know how far from or close to the mean individual scores are • Like the mean, these statistics are only meaningful for interval or ratio-level measures
  • 15. Estimates of Dispersion • Range • Distance between the highest and lowest scores in a distribution; • sensitive to extreme scores; • Can compensate by calculating inter quartile range (distance between the 25th and 75th percentile points) which represents the range of scores for the middle half of a distribution
  • 16. Variance (S2) • Average of squared distances of individual points from the mean • sample variance • High variance means that most scores are far away from the mean. Low variance indicates that most scores cluster tightly about the mean. • The amount that one score differs from the mean is called its deviation score (deviate) • The sum of all deviation scores in a sample is called the sum of squares Estimates of dispersion Standard Deviation (SD) A summary statistic of how much scores vary from the mean Square root of the Variance • expressed in the original units of measurement • Represents the average amount of dispersion in a sample • Used in a number of inferential statistics
  • 17. Measures the peackedness of a distribution; Leptokurtic (positive excess kurtosis, i.e. fatter tails), Mesokurtic, Platykurtic (negative excess kurtosis, i.e. thinner tails), Skewness: Kurtosis: Measures the skewness of a distribution; Positive or Negative skewness Shape of the Distribution
  • 19. Normal distribution • Many characteristics are distributed through the population in a ‘normal’ manner • Normal curves have well-defined statistical properties • Parametric statistics are based on the assumption that the variables are distributed normally Most commonly used statistics • This is the famous “Bell curve” where many cases fall near the middle of the distribution and few fall very high or very low
  • 21. Data Transformation • With skewed data, the mean is not a good measure of central tendency because it is sensitive to extreme scores • May need to transform skewed data to make distribution appear more normal or symmetrical • Must determine the degree & type of skewness prior to transformation
  • 22. Correlation and Regression Correlation describes the strength of a linear relationship between two variables Linear means “straight line” Measures- Scatter Plot Karl Pearson Correlation Coefficient Spearman’s Rank Correlation
  • 23.
  • 24. Regression tells us how to draw the straight line described by the correlation. It is the technique concerned with predicting some variables by knowing others i.e the process of predicting variable Y using variable X
  • 25.
  • 26. Multiple regression analysis Multiple regression analysis is a straight forward extension of simple regression analysis which allows more than one independent variable. Y = a + b1X1 + b2X2 + …bkXk ; The b’s are called partial regression coefficients
  • 27. Statistical Inference Use a random sample to learn something about a larger population Statistical inference: Drawing conclusions about the whole population on the basis of a sample Precondition for statistical inference: A sample is randomly selected from the population (probability sample)
  • 28. Hypotheses The null hypothesis, denoted H0, is the claim that is initially assumed to be true. The alternative hypothesis, denoted by Ha, is the assertion that is contrary to H0. Possible conclusions from hypothesis-testing analysis are reject H0 or fail to reject H0. Rules for Hypotheses H0 is always stated as an equality claim involving parameters. Ha is an inequality claim that contradicts H0. It may be one-sided (using either > or <) or two-sided (using ≠).
  • 29. Steps for Hypothesis Testing Draw Marketing Research Conclusion Formulate H0 and H1 Select Appropriate Test Choose Level of Significance Determine Prob Assoc with Test Stat Determine Critical Value of Test Stat TSCR Determine if TSCR falls into (Non) Rejection Region Compare with Level of Significance,  Reject/Do not Reject H0 Calculate Test Statistic TSCAL
  • 30. Choice of an Appropriate Test
  • 31.
  • 32. What size sample do We need? The answer to this question is influenced by a number of factors,viz ➢The purpose of the study ➢Population size ➢The risk of selecting a “bad” sample ➢The allowable sampling error ➢Most of all whether undertaking a qualitative or quantitative study Different approaches for study designs , such as cross section, case-control, cohort design, longitudinal study, diagnostics test study etc.
  • 33. Sample Size Determination Criteria ➢ Level of confidence ( Normally 95%) ➢ Margin of Error (Usually 1%, 3% or 5%) ➢ Degree of variability in the attributes being measured (Prevalence) More homogeneous population → Smaller sample size More heterogeneous population → Large sample size for desired precision.
  • 34. Sample size Quantitative Qualitative n = Z2 σ2 𝑒 2 n = (Z2σ2𝑁) e2 𝑁 − 1 + Z2σ2 n = Z2𝑃𝑄 e2 n = (Z2𝑃𝑄𝑁) e2 𝑁−1 +Z2𝑃𝑄 Infinite Population Finite Population
  • 36. Online Sample Size Calculator https://www.surveysystem.com/sscalc.htm https://www.calculator.net/sample-size-calculator http://www.raosoft.com/samplesize.html https://www.stat.ubc.ca/~rollin/stats/ssize/n2.html
  • 37. P-Value Definition: P-value is the probability of obtaining a sample “more extreme” than one observed from the sample data, if the null hypothesis is true
  • 40.
  • 41. Caution : The P-value was never intended to be a substitute for scientific reasoning
  • 42. Multivariate Analysis Techniques • Multiple regression • Canonical correlation • Discriminant analysis • Logistic regression • Survival analysis • Principal component analysis • Factor analysis • Cluster analysis
  • 43. Thank You… “All the statistics in the world can’t measure the warmth of a smile.” Chris Hart