SlideShare una empresa de Scribd logo
1 de 73
Introduction to Probability and Statistics
                          10th Week (5/17)



            Estimation Theory
Confidence Intervals

Content of this chapter
 Confidence Intervals for the Population

  Mean, μ
       when Population Standard Deviation σ is Known
       when Population Standard Deviation σ is Unknown
   Confidence Intervals for the Population
    Proportion, p
   Determining the Required Sample Size
Point and Interval Estimates

     A point estimate is a single number,
     a confidence interval provides additional
      information about variability


Lower                                        Upper
Confidence                                   Confidence
                   Point Estimate
Limit                                        Limit

                     Width of
                confidence interval
Point Estimates


  We can estimate a        with a Sample
Population Parameter …        Statistic
                         (a Point Estimate)

  Mean           μ             X
Proportion       π             p
Confidence Intervals

   How much uncertainty is associated with a
    point estimate of a population parameter?
   An interval estimate provides more
    information about a population characteristic
    than does a point estimate
   Such interval estimates are called confidence
    intervals
Confidence Interval Estimate
   An interval gives a range of values:
       Takes into consideration variation in sample
        statistics from sample to sample
       Based on observations from 1 sample
       Gives information about closeness to
        unknown population parameters
       Stated in terms of level of confidence
            Can never be 100% confident
Estimation Process


                 Random Sample   I am 95%
                                 confident that
                                 μ is between
Population          Mean         40 & 60.
(mean, μ, is         X = 50
 unknown)

 Sample
General Formula

      The general formula for all
       confidence intervals is:
Point Estimate ± (Critical Value)(Standard Error)
Confidence Level

   Confidence Level
       Confidence for which the interval
        will contain the unknown
        population parameter
   A percentage (less than 100%)
Confidence Level, (1-α)
                                                     (continued)
   Suppose confidence level = 95%
   Also written (1 - α) = 0.95
   A relative frequency interpretation:
       In the long run, 95% of all the confidence
        intervals that can be constructed will contain the
        unknown true parameter
   A specific interval either will contain or will
    not contain the true parameter
       No probability involved in a specific interval
Estimates
Estimates
Estimates
Efficient estimator: If the sampling distributions of two
statistics have the same mean, the statistic with the smaller
variance is called a more efficient estimator of the mean.

Clearly one would in practice prefer to have estimates that are
both efficient and unbiased, but this is not always possible.
Point/Interval Estimates
•Point estimate of the parameter: An estimate of a population parameter
given by a single number

•Interval estimate of the parameter: An estimate of a population
parameter given by two numbers between which the parameter

•Reliability: A statement of the error or precision of an estimate
Confidence Interval

- A particular kind of interval estimate of a population
  parameter and is used to indicate the reliability of an
  estimate.
- It is an observed interval (i.e. it is calculated from the
  observations), in principle different from sample to sample,
  that frequently includes the parameter of interest, if the
  experiment is repeated.

- How frequently the observed interval contains the
  parameter is determined by the confidence level or
  confidence coefficient.
Confidence Interval
Confidence Intervals
                 Confidence
                  Intervals


    Population                Population
      Mean                    Proportion




σ Known   σ Unknown
Confidence Interval for μ
                  (σ Known)
• Assumptions
   – Population standard deviation σ is known
   – Population is normally distributed
   – If population is not normal, use large sample

• Confidence interval estimate:
                            σ
                        X±Z
                             n
        X

  σ/ n
where     is the point estimate
        Z is the normal distribution critical value for a probability of α/2 in
Finding the Critical Value, Z
• Consider a 95% confidence interval: = ±1.96
                                   Z
                           1− α = 0.95




 α                                                     α
   = 0.025                                               = 0.025
 2                                                     2


Z units:     Z= -1.96            0           Z= 1.96
              Lower                           Upper
X units:      Confidence    Point Estimate    Confidence
              Limit                           Limit
Common Levels of
           Confidence
• Commonly used confidence levels are
  90%, 95%, and 99%
                 Confidence
    Confidence
                 Coefficient,   Z value
      Level
                   1− α
       80%          0.80        1.28
       90%          0.90        1.645
       95%          0.95        1.96
       98%          0.98        2.33
       99%          0.99        2.58
       99.8%        0.998       3.08
       99.9%        0.999       3.27
Intervals and Level of
                   Confidence
              Sampling Distribution of the Mean


                α/2          1− α            α/2
                                                   x
 Intervals                  μx = μ
extend from                          x1
     σ                      x2                     (1-α)x100%
 X+Z                                               of intervals
      n
 to                                                constructed
     σ                                             contain μ;
 X−Z
      n                                            (α)x100% do
                                                   not.
                      Confidence Intervals
Example
• A sample of 11 circuits from a large
  normal population has a mean
  resistance of 2.20 ohms. We know
  from past testing that the population
  standard deviation is 0.35 ohms.


• Determine a 95% confidence interval
  for the true mean resistance of the
  population.
Example
                                           (continued)

• A sample of 11 circuits from a large
  normal population has a mean
  resistance of 2.20 ohms. We know
  from past testing that the population
  standard deviation±is 0.35 ohms.
                   X Z
                         σ
                            n
• Solution:          = 2.20 ± 1.96 (0.35/ 11)
                     = 2.20 ± 0.2068
              1.9932 ≤ µ ≤ 2.4068
Interpretation
• We are 95% confident that the true
  mean resistance is between 1.9932
  and 2.4068 ohms
• Although the true mean may or may
  not be in this interval, 95% of intervals
  formed in this manner will contain the
  true mean
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals
                 Confidence
                  Intervals


    Population                Population
      Mean                    Proportion




σ Known   σ Unknown
Confidence Interval for μ
               (σ Unknown)
   If the population standard deviation σ is
    unknown, we can substitute the sample
    standard deviation, S
   This introduces extra uncertainty, since
    S is variable from sample to sample
   So we use the t distribution instead of the
    normal distribution
Confidence Interval for μ
                    (σ Unknown)
                                                                        (continued)
   Assumptions
        Population standard deviation is unknown
        Population is normally distributed
        If population is not normal, use large sample
   Use Student’s t Distribution
   Confidence Interval Estimate:
                                                 S
                               X ± t n-1
                                                  n
    (where t is the critical value of the t distribution with n -1 degrees of
    freedom and an area of α/2 in each tail)
Student’s t Distribution

   The t is a family of distributions
   The t value depends on degrees of
    freedom (d.f.)
       Number of observations that are free to vary after
        sample mean has been calculated

                       d.f. = n - 1
Degrees of Freedom (df)
Idea: Number of observations that are free to vary
      after sample mean has been calculated

Example: Suppose the mean of 3 numbers is 8.0

     Let X1 = 7            If the mean of these three
     Let X2 = 8            values is 8.0,
     What is X 3 ?         then X3 must be 9
                           (i.e., X3 is not free to vary)
Here, n = 3, so degrees of freedom = n – 1 = 3 – 1 = 2
(2 values can be any numbers, but the third is not free to vary
for a given mean)
Student’s t Distribution
                   Note: t         Z as n increases

                    Standard
                     Normal
                 (t with df = ∞)

                                           t (df = 13)
t-distributions are bell-
shaped and symmetric, but
have ‘fatter’ tails than the                       t (df = 5)
normal




                                   0                            t
Student’s t Table

     Upper Tail Area
                                Let: n = 3
df    .25    .10      .05       df = n - 1 = 2
                                     α = 0.10
1 1.000 3.078 6.314                α/2 = 0.05

2 0.817 1.886 2.920
3 0.765 1.638 2.353                              α/2 = 0.05

       The body of the table
       contains t values, not         0    2.920 t
       probabilities
t distribution values
         With comparison to the Z value

Confidence    t            t           t         Z
 Level     (10 d.f.)    (20 d.f.)   (30 d.f.)   ____

 0.80        1.372       1.325       1.310      1.28
 0.90        1.812       1.725       1.697      1.645
 0.95        2.228       2.086       2.042      1.96
 0.99        3.169       2.845       2.750      2.58

          Note: t      Z as n increases
Example
A random sample of n = 25 has X = 50 and
S = 8. Form a 95% confidence interval for μ

   d.f. = n – 1 = 24, so   tα/2 , n−1 = t 0.025,24 = 2.0639

The confidence interval is
                     S                  8
    X ± t α/2, n-1      = 50 ± (2.0639)
                      n                 25
              46.698 ≤ μ ≤ 53.302
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals for Means
Confidence Intervals

                 Confidence
                  Intervals


    Population                Population
      Mean                    Proportion




σ Known     σ Unknown
Confidence Intervals for the
      Population Proportion, π

   An interval estimate for the population
    proportion ( π ) can be calculated by
    adding an allowance for uncertainty to
    the sample proportion ( p )
Confidence Intervals for the
     Population Proportion, π
                                              (continued)
   Recall that the distribution of the sample
    proportion is approximately normal if the
    sample size is large, with standard deviation

                    π (1− π )
               σp =
                        n
   We will estimate this with sample data:

                      p(1− p)
                         n
Confidence Interval Endpoints
   Upper and lower confidence limits for the
    population proportion are calculated with the
    formula

                          p(1− p)
                      p±Z
                             n
   where
       Z is the standard normal value for the level of confidence desired
       p is the sample proportion
       n is the sample size
Example

   A random sample of 100 people
    shows that 25 are left-handed.
   Form a 95% confidence interval for
    the true proportion of left-handers
Example
                                           (continued)
   A random sample of 100 people shows
    that 25 are left-handed. Form a 95%
    confidence interval for the true proportion
    of left-handers.

      p ± Z p(1− p)/n
        = 25/100 ± 1.96 0.25(0.75)/100

       = 0.25 ± 1.96 (0.0433)
         0.1651 ≤ π ≤ 0.3349
Interpretation

   We are 95% confident that the true
    percentage of left-handers in the population
    is between
             16.51% and 33.49%.

   Although the interval from 0.1651 to 0.3349
    may or may not contain the true proportion,
    95% of intervals formed from samples of
    size 100 in this manner will contain the true
    proportion.
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Confidence Intervals for Differences and
                 Sums
Confidence Intervals for Differences and
                 Sums
Confidence Intervals for Differences and
                 Sums
Confidence Intervals for Differences and
                 Sums
Confidence Intervals for the Variance of a
          Normal Distribution
Confidence Intervals for Variance Ratios
Maximum Likelihood Estimates
Determining Sample Size

          Determining
          Sample Size


For the                  For the
 Mean                   Proportion
Sampling Error
   The required sample size can be found to reach
    a desired margin of error (e) with a specified
    level of confidence (1 - α)

   The margin of error is also called sampling error
       the amount of imprecision in the estimate of the
        population parameter
       the amount added and subtracted to the point
        estimate to form the confidence interval
Determining Sample Size

             Determining
             Sample Size


   For the
    Mean             Sampling error
                     (margin of error)
    σ                    σ
X±Z                  e=Z
     n                    n
Determining Sample Size
                                     (continued)

               Determining
               Sample Size


     For the
      Mean


    σ                            2
                                Z σ    2
e=Z        Now solve
           for n to get      n=
     n                           e 2
Determining Sample Size
                                                    (continued)

   To determine the required sample size for the
    mean, you must know:

       The desired level of confidence (1 - α), which
        determines the critical Z value
       The acceptable sampling error, e
       The standard deviation, σ
Required Sample Size Example

If σ = 45, what sample size is needed to
estimate the mean within ± 5 with 90%
confidence?

         2   2          2    2
      Z σ    (1.645) (45)
   n=    2
           =        2
                          = 219.19
       e          5

  So the required sample size is n = 220
                                  (Always round up)
If σ is unknown

   If unknown, σ can be estimated when
    using the required sample size formula
       Use a value for σ that is expected to be
        at least as large as the true σ
       Select a pilot sample and estimate σ with
        the sample standard deviation, S
Determining Sample Size
                                             (continued)

                  Determining
                  Sample Size


                                 For the
                                Proportion



    π (1− π )   Now solve           Z 2 π (1− π )
e=Z             for n to get     n=        2
        n                                e
Determining Sample Size
                                                          (continued)

   To determine the required sample size for the
    proportion, you must know:

       The desired level of confidence (1 - α), which
        determines the critical Z value
       The acceptable sampling error, e
       The true proportion of “successes”, π
         
             π can be estimated with a pilot sample, if
             necessary (or conservatively use π = 0.5)
Required Sample Size Example

How large a sample would be necessary
to estimate the true proportion defective in
a large population within ±3%, with 95%
confidence?
(Assume a pilot sample yields p = 0.12)
Required Sample Size Example
                                               (continued)

    Solution:
    For 95% confidence, use Z = 1.96
    e = 0.03
    p = 0.12, so use this to estimate π

   Z π (1− π ) (1.96) (0.12)(1− 0.12)
     2                  2
n=      2
              =               2
                                      = 450.74
      e                (0.03)
                                          So use n = 451
Ethical Issues

   A confidence interval estimate (reflecting
    sampling error) should always be included
    when reporting a point estimate
   The level of confidence should always be
    reported
   The sample size should be reported
   An interpretation of the confidence interval
    estimate should also be provided

Más contenido relacionado

La actualidad más candente

JM Statr session 13, Jan 11
JM Statr session 13, Jan 11JM Statr session 13, Jan 11
JM Statr session 13, Jan 11Ruru Chowdhury
 
STATISTICS: Normal Distribution
STATISTICS: Normal Distribution STATISTICS: Normal Distribution
STATISTICS: Normal Distribution jundumaug1
 
Normal distribution
Normal distributionNormal distribution
Normal distributionAlok Roy
 
Using Microsoft excel for six sigma
Using Microsoft excel for six sigmaUsing Microsoft excel for six sigma
Using Microsoft excel for six sigmaAbhilash Surendran
 
Statistics-3 : Statistical Inference - Core
Statistics-3 : Statistical Inference - CoreStatistics-3 : Statistical Inference - Core
Statistics-3 : Statistical Inference - CoreGiridhar Chandrasekaran
 
CABT SHS Statistics & Probability - The Standard Normal Distribution
CABT SHS Statistics & Probability - The Standard Normal DistributionCABT SHS Statistics & Probability - The Standard Normal Distribution
CABT SHS Statistics & Probability - The Standard Normal DistributionGilbert Joseph Abueg
 
Review on probability distributions, estimation and hypothesis testing
Review on probability distributions, estimation and hypothesis testingReview on probability distributions, estimation and hypothesis testing
Review on probability distributions, estimation and hypothesis testingMeselu Mellaku
 
Statistics Formulae for School Students
Statistics Formulae for School StudentsStatistics Formulae for School Students
Statistics Formulae for School Studentsdhatiraghu
 
Statistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normalStatistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normalSelvin Hadi
 
Normal distribution and hypothesis testing
Normal distribution and hypothesis testingNormal distribution and hypothesis testing
Normal distribution and hypothesis testingLorelyn Turtosa-Dumaug
 
60 revision points on statistics
60 revision points on statistics60 revision points on statistics
60 revision points on statistics2coolshivani
 
Probability distributions
Probability distributionsProbability distributions
Probability distributionsmvskrishna
 
Week 7 quiz_review_mini_tab_2011
Week 7 quiz_review_mini_tab_2011Week 7 quiz_review_mini_tab_2011
Week 7 quiz_review_mini_tab_2011Brent Heard
 

La actualidad más candente (20)

Normal distribution
Normal distributionNormal distribution
Normal distribution
 
JM Statr session 13, Jan 11
JM Statr session 13, Jan 11JM Statr session 13, Jan 11
JM Statr session 13, Jan 11
 
STATISTICS: Normal Distribution
STATISTICS: Normal Distribution STATISTICS: Normal Distribution
STATISTICS: Normal Distribution
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Msb12e ppt ch06
Msb12e ppt ch06Msb12e ppt ch06
Msb12e ppt ch06
 
Week 5 ppt.
Week 5 ppt.Week 5 ppt.
Week 5 ppt.
 
Using Microsoft excel for six sigma
Using Microsoft excel for six sigmaUsing Microsoft excel for six sigma
Using Microsoft excel for six sigma
 
Chapter09
Chapter09Chapter09
Chapter09
 
Statistics-3 : Statistical Inference - Core
Statistics-3 : Statistical Inference - CoreStatistics-3 : Statistical Inference - Core
Statistics-3 : Statistical Inference - Core
 
ECO202 formulae
ECO202 formulaeECO202 formulae
ECO202 formulae
 
CABT SHS Statistics & Probability - The Standard Normal Distribution
CABT SHS Statistics & Probability - The Standard Normal DistributionCABT SHS Statistics & Probability - The Standard Normal Distribution
CABT SHS Statistics & Probability - The Standard Normal Distribution
 
Review on probability distributions, estimation and hypothesis testing
Review on probability distributions, estimation and hypothesis testingReview on probability distributions, estimation and hypothesis testing
Review on probability distributions, estimation and hypothesis testing
 
Talk 3
Talk 3Talk 3
Talk 3
 
Statistics Formulae for School Students
Statistics Formulae for School StudentsStatistics Formulae for School Students
Statistics Formulae for School Students
 
Statistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normalStatistik 1 6 distribusi probabilitas normal
Statistik 1 6 distribusi probabilitas normal
 
Normal distribution and hypothesis testing
Normal distribution and hypothesis testingNormal distribution and hypothesis testing
Normal distribution and hypothesis testing
 
Estimation by c.i
Estimation by c.iEstimation by c.i
Estimation by c.i
 
60 revision points on statistics
60 revision points on statistics60 revision points on statistics
60 revision points on statistics
 
Probability distributions
Probability distributionsProbability distributions
Probability distributions
 
Week 7 quiz_review_mini_tab_2011
Week 7 quiz_review_mini_tab_2011Week 7 quiz_review_mini_tab_2011
Week 7 quiz_review_mini_tab_2011
 

Similar a 10주차

Estimating population values ppt @ bec doms
Estimating population values ppt @ bec domsEstimating population values ppt @ bec doms
Estimating population values ppt @ bec domsBabasab Patil
 
Statistical Analysis-Confidence Interval_Session 5.pptx
Statistical Analysis-Confidence Interval_Session 5.pptxStatistical Analysis-Confidence Interval_Session 5.pptx
Statistical Analysis-Confidence Interval_Session 5.pptxmaruco1
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciSelvin Hadi
 
Estimation&ci (assignebt )
Estimation&ci (assignebt )Estimation&ci (assignebt )
Estimation&ci (assignebt )Mmedsc Hahm
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)jillmitchell8778
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervalsLayal Fahad
 
Lect w4 Lect w3 estimation
Lect w4 Lect w3 estimationLect w4 Lect w3 estimation
Lect w4 Lect w3 estimationRione Drevale
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence IntervalFarhan Alfin
 
Bca admission in india
Bca admission in indiaBca admission in india
Bca admission in indiaEdhole.com
 
Business Analytics _ Confidence Interval
Business Analytics _ Confidence IntervalBusiness Analytics _ Confidence Interval
Business Analytics _ Confidence IntervalRavindra Nath Shukla
 
unit-2.2 and 2.3.pptx
unit-2.2 and 2.3.pptxunit-2.2 and 2.3.pptx
unit-2.2 and 2.3.pptxvishnupavan8
 
Montgomery
Montgomery Montgomery
Montgomery bazz8
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsUniversity of Salerno
 
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docxHW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docxwilcockiris
 

Similar a 10주차 (20)

Inferential statistics-estimation
Inferential statistics-estimationInferential statistics-estimation
Inferential statistics-estimation
 
Estimating population values ppt @ bec doms
Estimating population values ppt @ bec domsEstimating population values ppt @ bec doms
Estimating population values ppt @ bec doms
 
Statistical Analysis-Confidence Interval_Session 5.pptx
Statistical Analysis-Confidence Interval_Session 5.pptxStatistical Analysis-Confidence Interval_Session 5.pptx
Statistical Analysis-Confidence Interval_Session 5.pptx
 
Statistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ciStatistik 1 7 estimasi & ci
Statistik 1 7 estimasi & ci
 
Estimation&ci (assignebt )
Estimation&ci (assignebt )Estimation&ci (assignebt )
Estimation&ci (assignebt )
 
Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)Statistics lecture 8 (chapter 7)
Statistics lecture 8 (chapter 7)
 
Confidence Intervals
Confidence IntervalsConfidence Intervals
Confidence Intervals
 
Presentation_advance_1n.pptx
Presentation_advance_1n.pptxPresentation_advance_1n.pptx
Presentation_advance_1n.pptx
 
L10 confidence intervals
L10 confidence intervalsL10 confidence intervals
L10 confidence intervals
 
Lect w4 Lect w3 estimation
Lect w4 Lect w3 estimationLect w4 Lect w3 estimation
Lect w4 Lect w3 estimation
 
Ch4 Confidence Interval
Ch4 Confidence IntervalCh4 Confidence Interval
Ch4 Confidence Interval
 
Sampling Theory Part 3
Sampling Theory Part 3Sampling Theory Part 3
Sampling Theory Part 3
 
Bca admission in india
Bca admission in indiaBca admission in india
Bca admission in india
 
Business Analytics _ Confidence Interval
Business Analytics _ Confidence IntervalBusiness Analytics _ Confidence Interval
Business Analytics _ Confidence Interval
 
unit-2.2 and 2.3.pptx
unit-2.2 and 2.3.pptxunit-2.2 and 2.3.pptx
unit-2.2 and 2.3.pptx
 
L estimation
L estimationL estimation
L estimation
 
Montgomery
Montgomery Montgomery
Montgomery
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis tests
 
lecture-2.ppt
lecture-2.pptlecture-2.ppt
lecture-2.ppt
 
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docxHW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
 

Más de Kookmin University (9)

13주차
13주차13주차
13주차
 
12주차
12주차12주차
12주차
 
11주차
11주차11주차
11주차
 
9주차
9주차9주차
9주차
 
7주차
7주차7주차
7주차
 
6주차
6주차6주차
6주차
 
5주차
5주차5주차
5주차
 
4주차
4주차4주차
4주차
 
2주차
2주차2주차
2주차
 

10주차

  • 1. Introduction to Probability and Statistics 10th Week (5/17) Estimation Theory
  • 2. Confidence Intervals Content of this chapter  Confidence Intervals for the Population Mean, μ  when Population Standard Deviation σ is Known  when Population Standard Deviation σ is Unknown  Confidence Intervals for the Population Proportion, p  Determining the Required Sample Size
  • 3. Point and Interval Estimates  A point estimate is a single number,  a confidence interval provides additional information about variability Lower Upper Confidence Confidence Point Estimate Limit Limit Width of confidence interval
  • 4. Point Estimates We can estimate a with a Sample Population Parameter … Statistic (a Point Estimate) Mean μ X Proportion π p
  • 5. Confidence Intervals  How much uncertainty is associated with a point estimate of a population parameter?  An interval estimate provides more information about a population characteristic than does a point estimate  Such interval estimates are called confidence intervals
  • 6. Confidence Interval Estimate  An interval gives a range of values:  Takes into consideration variation in sample statistics from sample to sample  Based on observations from 1 sample  Gives information about closeness to unknown population parameters  Stated in terms of level of confidence  Can never be 100% confident
  • 7. Estimation Process Random Sample I am 95% confident that μ is between Population Mean 40 & 60. (mean, μ, is X = 50 unknown) Sample
  • 8. General Formula  The general formula for all confidence intervals is: Point Estimate ± (Critical Value)(Standard Error)
  • 9. Confidence Level  Confidence Level  Confidence for which the interval will contain the unknown population parameter  A percentage (less than 100%)
  • 10. Confidence Level, (1-α) (continued)  Suppose confidence level = 95%  Also written (1 - α) = 0.95  A relative frequency interpretation:  In the long run, 95% of all the confidence intervals that can be constructed will contain the unknown true parameter  A specific interval either will contain or will not contain the true parameter  No probability involved in a specific interval
  • 14.
  • 15. Efficient estimator: If the sampling distributions of two statistics have the same mean, the statistic with the smaller variance is called a more efficient estimator of the mean. Clearly one would in practice prefer to have estimates that are both efficient and unbiased, but this is not always possible.
  • 16. Point/Interval Estimates •Point estimate of the parameter: An estimate of a population parameter given by a single number •Interval estimate of the parameter: An estimate of a population parameter given by two numbers between which the parameter •Reliability: A statement of the error or precision of an estimate
  • 17. Confidence Interval - A particular kind of interval estimate of a population parameter and is used to indicate the reliability of an estimate. - It is an observed interval (i.e. it is calculated from the observations), in principle different from sample to sample, that frequently includes the parameter of interest, if the experiment is repeated. - How frequently the observed interval contains the parameter is determined by the confidence level or confidence coefficient.
  • 19. Confidence Intervals Confidence Intervals Population Population Mean Proportion σ Known σ Unknown
  • 20. Confidence Interval for μ (σ Known) • Assumptions – Population standard deviation σ is known – Population is normally distributed – If population is not normal, use large sample • Confidence interval estimate: σ X±Z n X σ/ n where is the point estimate Z is the normal distribution critical value for a probability of α/2 in
  • 21. Finding the Critical Value, Z • Consider a 95% confidence interval: = ±1.96 Z 1− α = 0.95 α α = 0.025 = 0.025 2 2 Z units: Z= -1.96 0 Z= 1.96 Lower Upper X units: Confidence Point Estimate Confidence Limit Limit
  • 22. Common Levels of Confidence • Commonly used confidence levels are 90%, 95%, and 99% Confidence Confidence Coefficient, Z value Level 1− α 80% 0.80 1.28 90% 0.90 1.645 95% 0.95 1.96 98% 0.98 2.33 99% 0.99 2.58 99.8% 0.998 3.08 99.9% 0.999 3.27
  • 23. Intervals and Level of Confidence Sampling Distribution of the Mean α/2 1− α α/2 x Intervals μx = μ extend from x1 σ x2 (1-α)x100% X+Z of intervals n to constructed σ contain μ; X−Z n (α)x100% do not. Confidence Intervals
  • 24. Example • A sample of 11 circuits from a large normal population has a mean resistance of 2.20 ohms. We know from past testing that the population standard deviation is 0.35 ohms. • Determine a 95% confidence interval for the true mean resistance of the population.
  • 25. Example (continued) • A sample of 11 circuits from a large normal population has a mean resistance of 2.20 ohms. We know from past testing that the population standard deviation±is 0.35 ohms. X Z σ n • Solution: = 2.20 ± 1.96 (0.35/ 11) = 2.20 ± 0.2068 1.9932 ≤ µ ≤ 2.4068
  • 26. Interpretation • We are 95% confident that the true mean resistance is between 1.9932 and 2.4068 ohms • Although the true mean may or may not be in this interval, 95% of intervals formed in this manner will contain the true mean
  • 31. Confidence Intervals Confidence Intervals Population Population Mean Proportion σ Known σ Unknown
  • 32. Confidence Interval for μ (σ Unknown)  If the population standard deviation σ is unknown, we can substitute the sample standard deviation, S  This introduces extra uncertainty, since S is variable from sample to sample  So we use the t distribution instead of the normal distribution
  • 33. Confidence Interval for μ (σ Unknown) (continued)  Assumptions  Population standard deviation is unknown  Population is normally distributed  If population is not normal, use large sample  Use Student’s t Distribution  Confidence Interval Estimate: S X ± t n-1 n (where t is the critical value of the t distribution with n -1 degrees of freedom and an area of α/2 in each tail)
  • 34. Student’s t Distribution  The t is a family of distributions  The t value depends on degrees of freedom (d.f.)  Number of observations that are free to vary after sample mean has been calculated d.f. = n - 1
  • 35. Degrees of Freedom (df) Idea: Number of observations that are free to vary after sample mean has been calculated Example: Suppose the mean of 3 numbers is 8.0 Let X1 = 7 If the mean of these three Let X2 = 8 values is 8.0, What is X 3 ? then X3 must be 9 (i.e., X3 is not free to vary) Here, n = 3, so degrees of freedom = n – 1 = 3 – 1 = 2 (2 values can be any numbers, but the third is not free to vary for a given mean)
  • 36. Student’s t Distribution Note: t Z as n increases Standard Normal (t with df = ∞) t (df = 13) t-distributions are bell- shaped and symmetric, but have ‘fatter’ tails than the t (df = 5) normal 0 t
  • 37. Student’s t Table Upper Tail Area Let: n = 3 df .25 .10 .05 df = n - 1 = 2 α = 0.10 1 1.000 3.078 6.314 α/2 = 0.05 2 0.817 1.886 2.920 3 0.765 1.638 2.353 α/2 = 0.05 The body of the table contains t values, not 0 2.920 t probabilities
  • 38. t distribution values With comparison to the Z value Confidence t t t Z Level (10 d.f.) (20 d.f.) (30 d.f.) ____ 0.80 1.372 1.325 1.310 1.28 0.90 1.812 1.725 1.697 1.645 0.95 2.228 2.086 2.042 1.96 0.99 3.169 2.845 2.750 2.58 Note: t Z as n increases
  • 39. Example A random sample of n = 25 has X = 50 and S = 8. Form a 95% confidence interval for μ  d.f. = n – 1 = 24, so tα/2 , n−1 = t 0.025,24 = 2.0639 The confidence interval is S 8 X ± t α/2, n-1 = 50 ± (2.0639) n 25 46.698 ≤ μ ≤ 53.302
  • 44. Confidence Intervals Confidence Intervals Population Population Mean Proportion σ Known σ Unknown
  • 45. Confidence Intervals for the Population Proportion, π  An interval estimate for the population proportion ( π ) can be calculated by adding an allowance for uncertainty to the sample proportion ( p )
  • 46. Confidence Intervals for the Population Proportion, π (continued)  Recall that the distribution of the sample proportion is approximately normal if the sample size is large, with standard deviation π (1− π ) σp = n  We will estimate this with sample data: p(1− p) n
  • 47. Confidence Interval Endpoints  Upper and lower confidence limits for the population proportion are calculated with the formula p(1− p) p±Z n  where  Z is the standard normal value for the level of confidence desired  p is the sample proportion  n is the sample size
  • 48. Example  A random sample of 100 people shows that 25 are left-handed.  Form a 95% confidence interval for the true proportion of left-handers
  • 49. Example (continued)  A random sample of 100 people shows that 25 are left-handed. Form a 95% confidence interval for the true proportion of left-handers. p ± Z p(1− p)/n = 25/100 ± 1.96 0.25(0.75)/100 = 0.25 ± 1.96 (0.0433) 0.1651 ≤ π ≤ 0.3349
  • 50. Interpretation  We are 95% confident that the true percentage of left-handers in the population is between 16.51% and 33.49%.  Although the interval from 0.1651 to 0.3349 may or may not contain the true proportion, 95% of intervals formed from samples of size 100 in this manner will contain the true proportion.
  • 55. Confidence Intervals for Differences and Sums
  • 56. Confidence Intervals for Differences and Sums
  • 57. Confidence Intervals for Differences and Sums
  • 58. Confidence Intervals for Differences and Sums
  • 59. Confidence Intervals for the Variance of a Normal Distribution
  • 60. Confidence Intervals for Variance Ratios
  • 62. Determining Sample Size Determining Sample Size For the For the Mean Proportion
  • 63. Sampling Error  The required sample size can be found to reach a desired margin of error (e) with a specified level of confidence (1 - α)  The margin of error is also called sampling error  the amount of imprecision in the estimate of the population parameter  the amount added and subtracted to the point estimate to form the confidence interval
  • 64. Determining Sample Size Determining Sample Size For the Mean Sampling error (margin of error) σ σ X±Z e=Z n n
  • 65. Determining Sample Size (continued) Determining Sample Size For the Mean σ 2 Z σ 2 e=Z Now solve for n to get n= n e 2
  • 66. Determining Sample Size (continued)  To determine the required sample size for the mean, you must know:  The desired level of confidence (1 - α), which determines the critical Z value  The acceptable sampling error, e  The standard deviation, σ
  • 67. Required Sample Size Example If σ = 45, what sample size is needed to estimate the mean within ± 5 with 90% confidence? 2 2 2 2 Z σ (1.645) (45) n= 2 = 2 = 219.19 e 5 So the required sample size is n = 220 (Always round up)
  • 68. If σ is unknown  If unknown, σ can be estimated when using the required sample size formula  Use a value for σ that is expected to be at least as large as the true σ  Select a pilot sample and estimate σ with the sample standard deviation, S
  • 69. Determining Sample Size (continued) Determining Sample Size For the Proportion π (1− π ) Now solve Z 2 π (1− π ) e=Z for n to get n= 2 n e
  • 70. Determining Sample Size (continued)  To determine the required sample size for the proportion, you must know:  The desired level of confidence (1 - α), which determines the critical Z value  The acceptable sampling error, e  The true proportion of “successes”, π  π can be estimated with a pilot sample, if necessary (or conservatively use π = 0.5)
  • 71. Required Sample Size Example How large a sample would be necessary to estimate the true proportion defective in a large population within ±3%, with 95% confidence? (Assume a pilot sample yields p = 0.12)
  • 72. Required Sample Size Example (continued) Solution: For 95% confidence, use Z = 1.96 e = 0.03 p = 0.12, so use this to estimate π Z π (1− π ) (1.96) (0.12)(1− 0.12) 2 2 n= 2 = 2 = 450.74 e (0.03) So use n = 451
  • 73. Ethical Issues  A confidence interval estimate (reflecting sampling error) should always be included when reporting a point estimate  The level of confidence should always be reported  The sample size should be reported  An interpretation of the confidence interval estimate should also be provided