SlideShare una empresa de Scribd logo
1 de 36
Lecture Series on
  Biostatistics

                       No. Biostat -8
                      Date:25.01.2009


  MEASURES OF DISPERSION,
   RELATIVE STANDING AND
          SHAPE

              Dr. Bijaya Bhusan Nanda,
          M. Sc (Gold Medalist) Ph. D. (Stat.)
       Topper Orissa Statistics & Economics Services, 1988
               bijayabnanda@yahoo.com
CONTENTS
   What is measures of dispersion?
   Why measures of dispersion?
   How measures of dispersions are calculated?
    Range
    Quartile deviation or semi inter-quartile range,
    Mean deviation and
    Standard deviation.
    Methods for detecting outlier
   Measure of Relative Standing
   Measure of shape
LEARNING OBJECTIVE

   They will be able to:
    describe the homogeneity or heterogeneity
      of the distribution,
    understand the reliability of the mean,
    compare the distributions as regards the
      variability.
    describe the relative standing of the data
      and also shape of the distribution.
What is measures of dispersion?
         (Definition)
Central tendency measures do not
 reveal the variability present in the
 data.
Dispersion is the scattered ness of
 the data series around it average.
Dispersion is the extent to which
 values in a distribution differ from the
 average of the distribution.
Why measures of dispersion?
          (Significance)
   Determine the reliability of an
    average
   Serve as a basis for the control
    of the variability
   To compare the variability of
    two or more series and
   Facilitate the use of other
    statistical measures.
Dispersion Example

   Number of minutes 20         X:Mean Time – 14.6
    clients waited to see a
    consulting doctor             minutes
       Consultant Doctor         Y:Mean waiting time
        X           Y             14.6 minutes
    05     15     15     16      What is the difference
    12     03     12     18       in the two series?
    04     19     15     14
    37     11     13     17
    06     34     11     15

X: High variability, Less consistency.
Y: Low variability, More Consistency
Frequency curve of distribution of
           three sets of data


      C

                                    A
B
Characteristics of an Ideal Measure of
               Dispersion
 It should be rigidly defined.
1.

 It should be easy to understand and easy to calculate.
2.

 It should be based on all the observations of the data.
3.

 It should be easily subjected to further mathematical
4.



 It should be least affected by the sampling fluctuation .
5.

 It should not be unduly affected by the extreme values.
6.
How dispersions are measured?
   Measure of dispersion:
   Absolute: Measure the dispersion in the
    original unit of the data.
   Variability in 2 or more distrn can be
    compared provided they are given in the
    same unit and have the same average.
   Relative: Measure of dispersion is free from
    unit of measurement of data.
   It is the ratio of a measaure of absolute
    dispersion to the average, from which
    absolute deviations are measured.
   It is called as co-efficient of dispersion.
How dispersions are measured? Contd.
    The following measures of
     dispersion are used to study the
     variation:
      The range
      The inter quartile range and
        quartile deviation
      The mean deviation or average
        deviation
      The standard deviation
How dispersions are measured? Contd.
Range:
The difference between the values of the two
extreme items of a series.
Example:
Age of a sample of 10 subjects from a population
of 169subjects are:
   X1   X2   X3   X4   X5   X6   X7   X8   X9   X10
   42 28 28 61 31 23 50 34 32 37
 The youngest subject in the sample is
 23years old and the oldest is 61 years, The
 range: R=XL – Xs
              = 61-23 =38
Co-efficient of Range:
R = (XL - XS) / (XL + XS)
  =   (61 -23) / (61 + 23) =38 /84 = 0.452

 Characteristics of Range
  Simplest and most crude measure of
   dispersion
  It is not based on all the observations.
  Unduly affected by the extreme values
   and fluctuations of sampling.
  The range may increase with the size of
   the set of observations though it can
   decrease
  Gives an idea of the variability very
   quickly
Percentiles, Quartiles (Measure of Relative Standing)
                   and Interquartile Range
   Descriptive measures that locate the relative position of an
    observation in relation to the other observations are called
    measures of relative standing.
   They are quartiles, deciles and percentiles
   The quartiles & the median divide the array into four equal parts,
    deciles into ten equal groups, and percentiles into one hundred
    equal groups.
   Given a set of n observations X1, X2, …. Xn, the pth percentile ‘P’ is the
    value of X such that ‘p’ per cent of the observations are less than
    and 100 –p per cent of the observations are greater than P.
   25th percentile = 1st Quartile i.e., Q1
   50th percentile = 2nd Quartile i.e., Q2
   75th percentile = 3rd Quartile i.e., Q3
QL      M      QU
Figure 8.1 Locating of lower, mid and upper quartiles
Percentiles, Quartiles and Interquartile Range Contd.
               n+1
        Q1 =       th ordered observation
                4

        Q2 = 2(n+1) th ordered observation
                4
        Q3 = 3(n+1) th ordered observation
                4
Interquartile Range (IQR): The difference
between the 3rd and 1st quartile.
IQR = Q3 – Q1
Semi Interquartile Range:= (Q3 – Q1)/ 2
Coefficient of quartile deviation:
(Q3 – Q1)/(Q3 + Q1)
Interquartile Range
Merits:
 It is superior to range as a measure of dispersion.
 A special utility in measuring variation in case of open end
  distribution or one which the data may be ranked but measured
  quantitatively.
 Useful in erratic or badly skewed distribution.
 The Quartile deviation is not affected by the presence of
  extreme values.
Limitations:
 As the value of quartile deviation dose not depend upon every
  item of the series it can’t be regarded as a good method of
  measuring dispersion.
 It is not capable of mathematical manipulation.
 Its value is very much affected by sampling fluctuation.
 Another measure of relative standing is the z-score
  for an observation (or standard score).
 It describes how far individual item in a distribution
  departs from the mean of the distribution.
 Standard score gives us the number of standard
  deviations, a particular observation lies below or above
  the mean.
  Standard xscore (or z -score) is defined as follows:
                  z-score=
  For a population:                 X-µ
                               σ
  where X =the observation from the population
  µ the population mean, σ = the population s.d
      For a sample z-score=   X-X
                               s
  where X =the observation from the sample
  X the sample mean, s = the sample s.d
Mean Absolute Deviation (MAD) or Mean
                  Deviation (MD)
   The average of difference of the values of items from some average
    of the series (ignoring negative sign), i.e. the arithmetic mean of the
    absolute differences of the values from their average .

Note:
1. MD is based on all values and hence cannot be calculated for open-
   ended distributions.
2. It uses average but ignores signs and hence appears unmethodical.
3. MD is calculated from mean as well as from median for both
   ungrouped data using direct method and for continuous
   distribution using assumed mean method and short-cut-method.
4. The average used is either the arithmetic mean or median
Computation of Mean absolute Deviation
For individual series: X1, X2, ……… Xn
                       ∑ |Xi -X|
             M.A.D =
                            n
For discrete series: X1, X2, ……… Xn & with
  corresponding frequency f1, f2, ……… fn
                         ∑ fi |Xi -X|
              M.A.D =        ∑ fi
 X: Mean of the data series.
Computation of Mean absolute Deviation:
For continuous grouped data: m1, m2, …… mn are the
class mid points with corresponding class
frequency f1, f2, ……… fn
                           ∑ fi|mi -X|
                   M.A.D =      ∑fi
 X: Mean of the data series.
Coeff. Of MAD: = (MAD /Average)
The average from which the Deviations are
calculated. It is a relative measure of dispersion
and is comparable to similar measure of other
series.
Example:
  Find MAD of Confinement after delivery in the
  following series.
   Days of       No. of          Total days of       Absolute    fi|Xi - X|
 Confinement   patients (f)   confinement of each    Deviation
     ( X)                          group Xf         from mean
                                                      |X - X |
      6             5                 30              1.61         8.05
      7             4                 28              0.61         2.44
      8             4                 32              1.61         6.44
      9             3                 27              2.61         7.83
     10            2                  20              3.61        7.22
    Total          18                137                          31.98

   X = Mean days of confinement = 137 / 18 = 7.61
MAD=31.98 / 18=1.78, Coeff.of MAD= 1.78/7.61=0.233
Problem:
       Find the MAD of weight and coefficient of MAD of
470 infants born in a hospital in one year from following
table.

Weight   2.0-2.4   2.5-2.9   3.0-3.4   3.5-3.9   4.0-4.4   4.5+
in Kg
No. of     17        97       187       135        28       6
infant
Merits and Limitations of MAD
   Simple to understand and easy to compute.
   Based on all observations.
   MAD is less affected by the extreme items than
    the Standard deviation.
   Greatest draw back is that the algebraic signs
    are ignored.
   Not amenable to further mathematical
    treatment.
   MAD gives us best result when deviation is
    taken from median. But median is not
    satisfactory for large variability in the data. If
    MAD is computed from mode, the value of the
    mode can not be determined always.
Standard Deviation (σ)
 It is the positive square root of the average of squares
of deviations of the observations from the mean. This is
        also called root mean squared deviation (σ) .
For individual series: x1, x2, ……… xn
           Σ ( xi–x )2                      ∑xi2 ∑xi 2
 σ=
       √   ------------
                 n
                                 σ=
                                      √     n    -( n )
   For discrete series: X1, X2, ……… Xn & with
     corresponding frequency f1, f2, ……… fn

        Σ fi ( xi–x )2
σ=                                  ∑fixi2        ∑fixi 2
        ------------
              Σ fi
                            σ=
                                      ∑fi       -( ∑ f )
                                                    i
Standard Deviation (σ) Contd.

For continuous grouped series with class
  midpoints : m1, m2, ……… mn & with
  corresponding frequency f1, f2, ……… fn

          Σ fi ( mi–x )2
σ=                               ∑fimi2      ∑fimi 2
      √   ------------
                Σ fi
                           σ=
                                  ∑ fi    -( ∑ f )
                                               i

     Variance: It is the square of the s.d
Coefficient of Variation (CV): Corresponding
Relative measure of dispersion.
                         σ
         CV =         ------- × 100
                         X
Characteristics of Standard Deviation:
   SD is very satisfactory and most widely used
    measure of dispersion
   Amenable for mathematical manipulation
   It is independent of origin, but not of scale
   If SD is small, there is a high probability for
    getting a value close to the mean and if it is large,
    the value is father away from the mean
   Does not ignore the algebraic signs and it is less
    affected by fluctuations of sampling
   SD can be calculated by :
     • Direct method
     • Assumed mean method.
     • Step deviation method.
 It is the average of the distances of the observed
  values from the mean value for a set of data
 Basic rule --More spread will yield a larger SD

Uses of the standard deviation
 The standard deviation enables us to determine,

  with a great deal of accuracy, where the values
  of a frequency distribution are located in relation
  to the mean.
 Chebyshev’s Theorem

   • For any data set with the mean ‘µ’ and the
       standard deviation ‘s’ at least 75% of the
       values will fall within the 2σ interval and at
       least 89% of the values will fall within the 3σ
       interval of the mean
TABLE: Calculation of the standard deviation (σ)
Weights of 265 male students at the university of Washington
 Class-Interval   f   d    fd   fd2
                                                     (Σƒd2)                (Σfd)2
   (Weight)
                                      σ=                              -              ×(i)
                                                      n                       n2
     90-99        1   -5   -5   25
    100-109       1   -4   -4   16                 931                    (99)2
    110-119     9   -3  -27     81    =                           -                 ×(10)
                                                    265                    265
    120-129    30   -2  -60    120
    130-139    42   -1  -42     42
    140-149    66    0    0      0                   (3.5132 – 0.1396) (×10)
                                      =
    150-159    47    1   47     47
    160-169    39    2   78    156    =             (1.8367) (10)
    170-179    15    3   45    135
    180-189    11    4   44    176
                                      =             18.37 or 18.4
                                          d = (Xi –A)/i           n = Σfi
    190-199     1    5    5     25
    200-209     3    6   18    108        .   A = 144.5, i = 10
n =265      Σƒd= 99    Σƒd2 = 931
 Means, standard deviation, and coefficients of variation of the age
  distributions of four groups of mothers who gave birth to one or
  more children in the city of minneapol in: 1931 to 1935. Interprete
  the data
        CLASSIFICATION                   X        σ          CV
Resident married                       28.2      6.0         21.3
Non-resident married                   29.5      6.0        20.3
Resident unmarried                     23.4      5.8        24.8

Non-resident unmarried                 21.7      3.7        17.1


Example: Suppose that each day laboratory technician A completes
40 analyses with a standard deviation of 5. Technician B completes
160 analyses per day with a standard deviation of 15. Which
employee shows less variability?
Uses of Standard deviation
   Uses of the standard deviation
    • The standard deviation enables us to
      determine, with a great deal of accuracy,
      where the values of a frequency
      distribution are located in relation to the
      mean. We can do this according to a
      theorem devised by the Russian
      mathematician P.L. Chebyshev (1821-
      1894).
Measure of Shape
The fourth important numerical characteristic of a
data set is its shape: Skewness and kurtosis.
 Skewness
    • Skewness characterizes the degree of
      asymmetry of a distribution around its
      mean. For a sample data, the
      skewness is defined by the formula:

                                                              3
                                    n          n
                                                  xi − x 
                  Skewness =                ∑ s 
                             (n − 1)(n − 2) i =1         
where n = the number of observations in the sample,
xi= ith observation in the sample, s= standard deviation of
the sample, x = sample mean
Measure of Shape




Figure 8.2 +ve or Right-skewed
           distribution
Kurtosis:
Kurtosis characterizes the relative peakedness or flatness of
a distribution compared with the bell-shaped distribution
(normal distribution).
Kurtosis of a sample data set is calculated by the formula:

                 
                         n(n + 1)         n
                                               xi − x  
                                                        4
                                                           3(n − 1) 2
      Kurtosis =                        ∑  s   − (n − 2)(n − 3)
                  (n − 1)(n − 2)(n − 3) i =1          
                                                         


Positive kurtosis indicates a relatively peaked distribution.
Negative kurtosis indicates a relatively flat distribution.
The distributions with positive and negative kurtosis
are depicted in Figure 8.4 , where the distribution with
null kurtosis is normal distribution.
REFERENCE
1.   Mathematical Statistics- S.P Gupta
2.   Statistics for management- Richard I.
     Levin, David S. Rubin
3.   Biostatistics A foundation for Analysis
     in the Health Sciences.
THANK YOU

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Correlation
CorrelationCorrelation
Correlation
 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
 
Parametric tests
Parametric testsParametric tests
Parametric tests
 
Standard error-Biostatistics
Standard error-BiostatisticsStandard error-Biostatistics
Standard error-Biostatistics
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
Non-Parametric Tests
Non-Parametric TestsNon-Parametric Tests
Non-Parametric Tests
 
descriptive and inferential statistics
descriptive and inferential statisticsdescriptive and inferential statistics
descriptive and inferential statistics
 
Correlation
CorrelationCorrelation
Correlation
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Type i and type ii errors
Type i and type ii errorsType i and type ii errors
Type i and type ii errors
 
Regression
RegressionRegression
Regression
 
Correlation
CorrelationCorrelation
Correlation
 
Definition of dispersion
Definition of dispersionDefinition of dispersion
Definition of dispersion
 
Skewness
SkewnessSkewness
Skewness
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measures of Central Tendency - Biostatstics
Measures of Central Tendency - BiostatsticsMeasures of Central Tendency - Biostatstics
Measures of Central Tendency - Biostatstics
 
Central tendency
Central tendencyCentral tendency
Central tendency
 
non parametric statistics
non parametric statisticsnon parametric statistics
non parametric statistics
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 
Skewness and kurtosis
Skewness and kurtosisSkewness and kurtosis
Skewness and kurtosis
 

Destacado

Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationVasant Kothari
 
Emotional Disorders Presentation
Emotional Disorders PresentationEmotional Disorders Presentation
Emotional Disorders Presentationmendiolamae
 
Emotional disturbance
Emotional disturbanceEmotional disturbance
Emotional disturbanceellentschopp
 
What is a partial correlation?
What is a partial correlation?What is a partial correlation?
What is a partial correlation?Ken Plummer
 
Partial and multiple correlation and regression
Partial and multiple correlation and regressionPartial and multiple correlation and regression
Partial and multiple correlation and regressionKritika Jain
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Upama Dwivedi
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variabilityMark Santos
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard ScoresJenewel Azuelo
 
Normal distribution
Normal distributionNormal distribution
Normal distributionSteve Bishop
 
Mean Median Mode
Mean Median ModeMean Median Mode
Mean Median Modehiratufail
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)Harsh Upadhyay
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Ramnath Takiar
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionAzmi Mohd Tamil
 
Mean, median, mode, & range ppt
Mean, median, mode, & range pptMean, median, mode, & range ppt
Mean, median, mode, & range pptmashal2013
 
Statistics
StatisticsStatistics
Statisticspikuoec
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Sneh Kumari
 

Destacado (20)

Variability
VariabilityVariability
Variability
 
Mpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlationMpc 006 - 02-03 partial and multiple correlation
Mpc 006 - 02-03 partial and multiple correlation
 
Statistics report THE RANGE
Statistics report THE RANGEStatistics report THE RANGE
Statistics report THE RANGE
 
Emotional Disorders Presentation
Emotional Disorders PresentationEmotional Disorders Presentation
Emotional Disorders Presentation
 
Emotional disturbance
Emotional disturbanceEmotional disturbance
Emotional disturbance
 
What is a partial correlation?
What is a partial correlation?What is a partial correlation?
What is a partial correlation?
 
Partial and multiple correlation and regression
Partial and multiple correlation and regressionPartial and multiple correlation and regression
Partial and multiple correlation and regression
 
Normal Distribution
Normal DistributionNormal Distribution
Normal Distribution
 
Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)Brm (one tailed and two tailed hypothesis)
Brm (one tailed and two tailed hypothesis)
 
Measures of variability
Measures of variabilityMeasures of variability
Measures of variability
 
Normal Curve and Standard Scores
Normal Curve and Standard ScoresNormal Curve and Standard Scores
Normal Curve and Standard Scores
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Mean Median Mode
Mean Median ModeMean Median Mode
Mean Median Mode
 
Simple linear regression (final)
Simple linear regression (final)Simple linear regression (final)
Simple linear regression (final)
 
Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)Test of significance (t-test, proportion test, chi-square test)
Test of significance (t-test, proportion test, chi-square test)
 
Pearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear RegressionPearson Correlation, Spearman Correlation &Linear Regression
Pearson Correlation, Spearman Correlation &Linear Regression
 
Pearson Correlation
Pearson CorrelationPearson Correlation
Pearson Correlation
 
Mean, median, mode, & range ppt
Mean, median, mode, & range pptMean, median, mode, & range ppt
Mean, median, mode, & range ppt
 
Statistics
StatisticsStatistics
Statistics
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 

Similar a Measures of dispersion

Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxNabeelAli89
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersionDrZahid Khan
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
Kwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeKwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeAarti Vijaykumar
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
Basics of biostatistic
Basics of biostatisticBasics of biostatistic
Basics of biostatisticNeurologyKota
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersionWaqar Abbasi
 
Bio statistics
Bio statisticsBio statistics
Bio statisticsNc Das
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variationRaj Teotia
 
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfUnit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfAravindS199
 
presentation
presentationpresentation
presentationPwalmiki
 
Student’s presentation
Student’s presentationStudent’s presentation
Student’s presentationPwalmiki
 
Biostatistics
BiostatisticsBiostatistics
Biostatisticspriyarokz
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersionKirti Gupta
 
Measures of Central Tendency and Dispersion (Week-07).pptx
Measures of Central Tendency and Dispersion (Week-07).pptxMeasures of Central Tendency and Dispersion (Week-07).pptx
Measures of Central Tendency and Dispersion (Week-07).pptxSarmadAltafHafizAlta
 

Similar a Measures of dispersion (20)

Lecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptxLecture. Introduction to Statistics (Measures of Dispersion).pptx
Lecture. Introduction to Statistics (Measures of Dispersion).pptx
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Kwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and modeKwoledge of calculation of mean,median and mode
Kwoledge of calculation of mean,median and mode
 
Statistics
StatisticsStatistics
Statistics
 
MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
Basics of biostatistic
Basics of biostatisticBasics of biostatistic
Basics of biostatistic
 
Measure of dispersion
Measure of dispersionMeasure of dispersion
Measure of dispersion
 
Bio statistics
Bio statisticsBio statistics
Bio statistics
 
Measures of dispersion or variation
Measures of dispersion or variationMeasures of dispersion or variation
Measures of dispersion or variation
 
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdfUnit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
Unit 1 - Measures of Dispersion - 18MAB303T - PPT - Part 2.pdf
 
presentation
presentationpresentation
presentation
 
Student’s presentation
Student’s presentationStudent’s presentation
Student’s presentation
 
Biostatistics
BiostatisticsBiostatistics
Biostatistics
 
5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt5.DATA SUMMERISATION.ppt
5.DATA SUMMERISATION.ppt
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
Measures of Central Tendency and Dispersion (Week-07).pptx
Measures of Central Tendency and Dispersion (Week-07).pptxMeasures of Central Tendency and Dispersion (Week-07).pptx
Measures of Central Tendency and Dispersion (Week-07).pptx
 

Más de Southern Range, Berhampur, Odisha

Understanding Stress, Stressors, Signs and Symptoms of Stress
Understanding Stress, Stressors, Signs and Symptoms of StressUnderstanding Stress, Stressors, Signs and Symptoms of Stress
Understanding Stress, Stressors, Signs and Symptoms of StressSouthern Range, Berhampur, Odisha
 

Más de Southern Range, Berhampur, Odisha (20)

Growth and Instability in Food Grain production in Oisha
Growth and Instability in Food Grain production  in OishaGrowth and Instability in Food Grain production  in Oisha
Growth and Instability in Food Grain production in Oisha
 
Growth and Instability of OIlseeds Production in Odisha
Growth and Instability of OIlseeds Production in OdishaGrowth and Instability of OIlseeds Production in Odisha
Growth and Instability of OIlseeds Production in Odisha
 
Statistical thinking and development planning
Statistical thinking and development planningStatistical thinking and development planning
Statistical thinking and development planning
 
CAN PLEASURE BE A JOURNEY RATHER THAN A STOP?
CAN PLEASURE BE A JOURNEY RATHER THAN A STOP?CAN PLEASURE BE A JOURNEY RATHER THAN A STOP?
CAN PLEASURE BE A JOURNEY RATHER THAN A STOP?
 
Monitoring and Evaluation
Monitoring and EvaluationMonitoring and Evaluation
Monitoring and Evaluation
 
Anger Management
Anger ManagementAnger Management
Anger Management
 
A Simple Promise for Happiness
A Simple Promise for HappinessA Simple Promise for Happiness
A Simple Promise for Happiness
 
Measures of mortality
Measures of mortalityMeasures of mortality
Measures of mortality
 
Understanding data through presentation_contd
Understanding data through presentation_contdUnderstanding data through presentation_contd
Understanding data through presentation_contd
 
Nonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contdNonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contd
 
Nonparametric and Distribution- Free Statistics
Nonparametric and Distribution- Free Statistics Nonparametric and Distribution- Free Statistics
Nonparametric and Distribution- Free Statistics
 
Simple and Powerful promise for Peace and Happiness
Simple and Powerful promise for Peace and HappinessSimple and Powerful promise for Peace and Happiness
Simple and Powerful promise for Peace and Happiness
 
Understanding Stress, Stressors, Signs and Symptoms of Stress
Understanding Stress, Stressors, Signs and Symptoms of StressUnderstanding Stress, Stressors, Signs and Symptoms of Stress
Understanding Stress, Stressors, Signs and Symptoms of Stress
 
Chi sqr
Chi sqrChi sqr
Chi sqr
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
Inferential statistics-estimation
Inferential statistics-estimationInferential statistics-estimation
Inferential statistics-estimation
 
Sampling methods in medical research
Sampling methods in medical researchSampling methods in medical research
Sampling methods in medical research
 
Probability concept and Probability distribution_Contd
Probability concept and Probability distribution_ContdProbability concept and Probability distribution_Contd
Probability concept and Probability distribution_Contd
 
Probability concept and Probability distribution
Probability concept and Probability distributionProbability concept and Probability distribution
Probability concept and Probability distribution
 

Último

APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfSanaAli374401
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 

Último (20)

APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 

Measures of dispersion

  • 1. Lecture Series on Biostatistics No. Biostat -8 Date:25.01.2009 MEASURES OF DISPERSION, RELATIVE STANDING AND SHAPE Dr. Bijaya Bhusan Nanda, M. Sc (Gold Medalist) Ph. D. (Stat.) Topper Orissa Statistics & Economics Services, 1988 bijayabnanda@yahoo.com
  • 2. CONTENTS  What is measures of dispersion?  Why measures of dispersion?  How measures of dispersions are calculated? Range Quartile deviation or semi inter-quartile range, Mean deviation and Standard deviation. Methods for detecting outlier  Measure of Relative Standing  Measure of shape
  • 3. LEARNING OBJECTIVE  They will be able to: describe the homogeneity or heterogeneity of the distribution, understand the reliability of the mean, compare the distributions as regards the variability. describe the relative standing of the data and also shape of the distribution.
  • 4. What is measures of dispersion? (Definition) Central tendency measures do not reveal the variability present in the data. Dispersion is the scattered ness of the data series around it average. Dispersion is the extent to which values in a distribution differ from the average of the distribution.
  • 5. Why measures of dispersion? (Significance)  Determine the reliability of an average  Serve as a basis for the control of the variability  To compare the variability of two or more series and  Facilitate the use of other statistical measures.
  • 6. Dispersion Example  Number of minutes 20  X:Mean Time – 14.6 clients waited to see a consulting doctor minutes Consultant Doctor  Y:Mean waiting time X Y 14.6 minutes 05 15 15 16  What is the difference 12 03 12 18 in the two series? 04 19 15 14 37 11 13 17 06 34 11 15 X: High variability, Less consistency. Y: Low variability, More Consistency
  • 7. Frequency curve of distribution of three sets of data C A B
  • 8. Characteristics of an Ideal Measure of Dispersion It should be rigidly defined. 1. It should be easy to understand and easy to calculate. 2. It should be based on all the observations of the data. 3. It should be easily subjected to further mathematical 4. It should be least affected by the sampling fluctuation . 5. It should not be unduly affected by the extreme values. 6.
  • 9. How dispersions are measured?  Measure of dispersion:  Absolute: Measure the dispersion in the original unit of the data.  Variability in 2 or more distrn can be compared provided they are given in the same unit and have the same average.  Relative: Measure of dispersion is free from unit of measurement of data.  It is the ratio of a measaure of absolute dispersion to the average, from which absolute deviations are measured.  It is called as co-efficient of dispersion.
  • 10. How dispersions are measured? Contd.  The following measures of dispersion are used to study the variation:  The range  The inter quartile range and quartile deviation  The mean deviation or average deviation  The standard deviation
  • 11. How dispersions are measured? Contd. Range: The difference between the values of the two extreme items of a series. Example: Age of a sample of 10 subjects from a population of 169subjects are: X1 X2 X3 X4 X5 X6 X7 X8 X9 X10 42 28 28 61 31 23 50 34 32 37 The youngest subject in the sample is 23years old and the oldest is 61 years, The range: R=XL – Xs = 61-23 =38
  • 12. Co-efficient of Range: R = (XL - XS) / (XL + XS) = (61 -23) / (61 + 23) =38 /84 = 0.452 Characteristics of Range  Simplest and most crude measure of dispersion  It is not based on all the observations.  Unduly affected by the extreme values and fluctuations of sampling.  The range may increase with the size of the set of observations though it can decrease  Gives an idea of the variability very quickly
  • 13. Percentiles, Quartiles (Measure of Relative Standing) and Interquartile Range  Descriptive measures that locate the relative position of an observation in relation to the other observations are called measures of relative standing.  They are quartiles, deciles and percentiles  The quartiles & the median divide the array into four equal parts, deciles into ten equal groups, and percentiles into one hundred equal groups.  Given a set of n observations X1, X2, …. Xn, the pth percentile ‘P’ is the value of X such that ‘p’ per cent of the observations are less than and 100 –p per cent of the observations are greater than P.  25th percentile = 1st Quartile i.e., Q1  50th percentile = 2nd Quartile i.e., Q2  75th percentile = 3rd Quartile i.e., Q3
  • 14. QL M QU Figure 8.1 Locating of lower, mid and upper quartiles
  • 15. Percentiles, Quartiles and Interquartile Range Contd. n+1 Q1 = th ordered observation 4 Q2 = 2(n+1) th ordered observation 4 Q3 = 3(n+1) th ordered observation 4 Interquartile Range (IQR): The difference between the 3rd and 1st quartile. IQR = Q3 – Q1 Semi Interquartile Range:= (Q3 – Q1)/ 2 Coefficient of quartile deviation: (Q3 – Q1)/(Q3 + Q1)
  • 16. Interquartile Range Merits:  It is superior to range as a measure of dispersion.  A special utility in measuring variation in case of open end distribution or one which the data may be ranked but measured quantitatively.  Useful in erratic or badly skewed distribution.  The Quartile deviation is not affected by the presence of extreme values. Limitations:  As the value of quartile deviation dose not depend upon every item of the series it can’t be regarded as a good method of measuring dispersion.  It is not capable of mathematical manipulation.  Its value is very much affected by sampling fluctuation.
  • 17.  Another measure of relative standing is the z-score for an observation (or standard score).  It describes how far individual item in a distribution departs from the mean of the distribution.  Standard score gives us the number of standard deviations, a particular observation lies below or above the mean. Standard xscore (or z -score) is defined as follows: z-score= For a population: X-µ σ where X =the observation from the population µ the population mean, σ = the population s.d For a sample z-score= X-X s where X =the observation from the sample X the sample mean, s = the sample s.d
  • 18. Mean Absolute Deviation (MAD) or Mean Deviation (MD)  The average of difference of the values of items from some average of the series (ignoring negative sign), i.e. the arithmetic mean of the absolute differences of the values from their average . Note: 1. MD is based on all values and hence cannot be calculated for open- ended distributions. 2. It uses average but ignores signs and hence appears unmethodical. 3. MD is calculated from mean as well as from median for both ungrouped data using direct method and for continuous distribution using assumed mean method and short-cut-method. 4. The average used is either the arithmetic mean or median
  • 19. Computation of Mean absolute Deviation For individual series: X1, X2, ……… Xn ∑ |Xi -X| M.A.D = n For discrete series: X1, X2, ……… Xn & with corresponding frequency f1, f2, ……… fn ∑ fi |Xi -X| M.A.D = ∑ fi X: Mean of the data series.
  • 20. Computation of Mean absolute Deviation: For continuous grouped data: m1, m2, …… mn are the class mid points with corresponding class frequency f1, f2, ……… fn ∑ fi|mi -X| M.A.D = ∑fi X: Mean of the data series. Coeff. Of MAD: = (MAD /Average) The average from which the Deviations are calculated. It is a relative measure of dispersion and is comparable to similar measure of other series.
  • 21. Example: Find MAD of Confinement after delivery in the following series. Days of No. of Total days of Absolute fi|Xi - X| Confinement patients (f) confinement of each Deviation ( X) group Xf from mean |X - X | 6 5 30 1.61 8.05 7 4 28 0.61 2.44 8 4 32 1.61 6.44 9 3 27 2.61 7.83 10 2 20 3.61 7.22 Total 18 137 31.98 X = Mean days of confinement = 137 / 18 = 7.61 MAD=31.98 / 18=1.78, Coeff.of MAD= 1.78/7.61=0.233
  • 22. Problem: Find the MAD of weight and coefficient of MAD of 470 infants born in a hospital in one year from following table. Weight 2.0-2.4 2.5-2.9 3.0-3.4 3.5-3.9 4.0-4.4 4.5+ in Kg No. of 17 97 187 135 28 6 infant
  • 23. Merits and Limitations of MAD  Simple to understand and easy to compute.  Based on all observations.  MAD is less affected by the extreme items than the Standard deviation.  Greatest draw back is that the algebraic signs are ignored.  Not amenable to further mathematical treatment.  MAD gives us best result when deviation is taken from median. But median is not satisfactory for large variability in the data. If MAD is computed from mode, the value of the mode can not be determined always.
  • 24. Standard Deviation (σ) It is the positive square root of the average of squares of deviations of the observations from the mean. This is also called root mean squared deviation (σ) . For individual series: x1, x2, ……… xn Σ ( xi–x )2 ∑xi2 ∑xi 2 σ= √ ------------ n σ= √ n -( n ) For discrete series: X1, X2, ……… Xn & with corresponding frequency f1, f2, ……… fn Σ fi ( xi–x )2 σ= ∑fixi2 ∑fixi 2 ------------ Σ fi σ= ∑fi -( ∑ f ) i
  • 25. Standard Deviation (σ) Contd. For continuous grouped series with class midpoints : m1, m2, ……… mn & with corresponding frequency f1, f2, ……… fn Σ fi ( mi–x )2 σ= ∑fimi2 ∑fimi 2 √ ------------ Σ fi σ= ∑ fi -( ∑ f ) i Variance: It is the square of the s.d Coefficient of Variation (CV): Corresponding Relative measure of dispersion. σ CV = ------- × 100 X
  • 26. Characteristics of Standard Deviation:  SD is very satisfactory and most widely used measure of dispersion  Amenable for mathematical manipulation  It is independent of origin, but not of scale  If SD is small, there is a high probability for getting a value close to the mean and if it is large, the value is father away from the mean  Does not ignore the algebraic signs and it is less affected by fluctuations of sampling  SD can be calculated by : • Direct method • Assumed mean method. • Step deviation method.
  • 27.  It is the average of the distances of the observed values from the mean value for a set of data  Basic rule --More spread will yield a larger SD Uses of the standard deviation  The standard deviation enables us to determine, with a great deal of accuracy, where the values of a frequency distribution are located in relation to the mean.  Chebyshev’s Theorem • For any data set with the mean ‘µ’ and the standard deviation ‘s’ at least 75% of the values will fall within the 2σ interval and at least 89% of the values will fall within the 3σ interval of the mean
  • 28. TABLE: Calculation of the standard deviation (σ) Weights of 265 male students at the university of Washington Class-Interval f d fd fd2 (Σƒd2) (Σfd)2 (Weight) σ= - ×(i) n n2 90-99 1 -5 -5 25 100-109 1 -4 -4 16 931 (99)2 110-119 9 -3 -27 81 = - ×(10) 265 265 120-129 30 -2 -60 120 130-139 42 -1 -42 42 140-149 66 0 0 0 (3.5132 – 0.1396) (×10) = 150-159 47 1 47 47 160-169 39 2 78 156 = (1.8367) (10) 170-179 15 3 45 135 180-189 11 4 44 176 = 18.37 or 18.4 d = (Xi –A)/i n = Σfi 190-199 1 5 5 25 200-209 3 6 18 108 . A = 144.5, i = 10 n =265 Σƒd= 99 Σƒd2 = 931
  • 29.  Means, standard deviation, and coefficients of variation of the age distributions of four groups of mothers who gave birth to one or more children in the city of minneapol in: 1931 to 1935. Interprete the data CLASSIFICATION X σ CV Resident married 28.2 6.0 21.3 Non-resident married 29.5 6.0 20.3 Resident unmarried 23.4 5.8 24.8 Non-resident unmarried 21.7 3.7 17.1 Example: Suppose that each day laboratory technician A completes 40 analyses with a standard deviation of 5. Technician B completes 160 analyses per day with a standard deviation of 15. Which employee shows less variability?
  • 30. Uses of Standard deviation  Uses of the standard deviation • The standard deviation enables us to determine, with a great deal of accuracy, where the values of a frequency distribution are located in relation to the mean. We can do this according to a theorem devised by the Russian mathematician P.L. Chebyshev (1821- 1894).
  • 31. Measure of Shape The fourth important numerical characteristic of a data set is its shape: Skewness and kurtosis.  Skewness • Skewness characterizes the degree of asymmetry of a distribution around its mean. For a sample data, the skewness is defined by the formula: 3 n n  xi − x  Skewness = ∑ s  (n − 1)(n − 2) i =1   where n = the number of observations in the sample, xi= ith observation in the sample, s= standard deviation of the sample, x = sample mean
  • 32. Measure of Shape Figure 8.2 +ve or Right-skewed distribution
  • 33. Kurtosis: Kurtosis characterizes the relative peakedness or flatness of a distribution compared with the bell-shaped distribution (normal distribution). Kurtosis of a sample data set is calculated by the formula:   n(n + 1) n  xi − x   4  3(n − 1) 2 Kurtosis =  ∑  s   − (n − 2)(n − 3)  (n − 1)(n − 2)(n − 3) i =1      Positive kurtosis indicates a relatively peaked distribution. Negative kurtosis indicates a relatively flat distribution.
  • 34. The distributions with positive and negative kurtosis are depicted in Figure 8.4 , where the distribution with null kurtosis is normal distribution.
  • 35. REFERENCE 1. Mathematical Statistics- S.P Gupta 2. Statistics for management- Richard I. Levin, David S. Rubin 3. Biostatistics A foundation for Analysis in the Health Sciences.