SlideShare a Scribd company logo
1 of 39
Business Statistics: Communicating with Numbers
By Sanjiv Jaggia and Alison Kelly
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
7-2
Chapter 7 Learning Objectives (LOs)
LO 7.1: Differentiate between a population
parameter and a sample statistic.
LO 7.5: Describe the properties of the sampling
distribution of the sample mean.
7-3
Chapter 7 Learning Objectives (LOs)
LO 7.6: Explain the importance of the central
limit theorem.
LO 7.7: Describe the properties of the sampling
distribution of the sample proportion.
LO 7.8: Use a finite population correction
factor.
7-4
7.1 Sampling
LO 7.1 Differentiate between a population parameter and sample statistic.
 Population—consists of all items of interest
in a statistical problem.
 Population Parameter is unknown.
 Sample—a subset of the population.
 Sample Statistic is calculated from sample and
used to make inferences about the population.
 Bias—the tendency of a sample statistic to
systematically over- or underestimate a
population parameter.
7-5
7.1 Sampling
 Classic Case of a “Bad” Sample: The Literary
Digest Debacle of 1936
 During the1936 presidential election, the Literary Digest
predicted a landslide victory for Alf Landon over Franklin
D. Roosevelt (FDR) with only a 1% margin of error.
 They were wrong! FDR won in a landslide election.
 The Literary Digest had committed selection bias by
randomly sampling from their own subscriber/
membership lists, etc.
 In addition, with only a 24% response rate, the Literary
Digest had a great deal of non-response bias.
LO 7.2 Explain common sample biases.
7-6
7.1 SamplingLO 7.2
 Selection bias—a systematic exclusion of certain
groups from consideration for the sample.
 The Literary Digest committed selection bias by excluding
a large portion of the population (e.g., lower income
voters).
 Nonresponse bias—a systematic difference in
preferences between respondents and non-
respondents to a survey or a poll.
 The Literary Digest had only a 24% response rate. This
indicates that only those who cared a great deal about the
election took the time to respond to the survey. These
respondents may be atypical of the population as a whole.
7-7
7.1 Sampling
 Sampling Methods
 Simple random sample is a sample of n
observations which has the same probability of
being selected from the population as any other
sample of n observations.
 Most statistical methods presume simple
random samples.
 However, in some situations other sampling
methods have an advantage over simple
random samples.
LO 7.3 Describe simple random sampling.
7-8
7.1 SamplingLO 7.3
 Example: In 1961, students invested 24 hours per
week in their academic pursuits, whereas today’s
students study an average of 14 hours per week.
 A dean at a large university in California wonders if this
trend is reflective of the students at her university. The
university has 20,000 students and the dean would like a
sample of 100. Use Excel to draw a simple random
sample of 100 students.
 In Excel, choose
Formulas > Insert function >
RANDBETWEEN and input
the values shown here.
7-9
Simple Random Sample: Using Table of
Random Numbers
A population consists of 845 employees of Nitra Industries. A
sample of 52 employees is to be selected from that
population.
A more convenient method of selecting a random sample is
to use the identification number of each employee and a
table of random numbers such as the one in Appendix B.6.
7-10
Systematic Random Sampling
EXAMPLE
A population consists of 845 employees of Nitra Industries. A sample
of 52 employees is to be selected from that population.
First, k is calculated as the population size divided by the sample
size. For Nitra Industries, we would select every 16th (845/52)
employee list. If k is not a whole number, then round down.
Random sampling is used in the selection of the first name. Then,
select every 16th name on the list thereafter.
Systematic Random Sampling: The items or individuals of the
population are arranged in some order. A random starting point is
selected and then every kth member of the population is selected for
the sample.
7-11
7.1 Sampling
 Stratified Random Sampling
 Divide the population into mutually exclusive and
collectively exhaustive groups, called strata.
 Randomly select observations from each stratum,
which are proportional to the stratum’s size.
 Advantages:
 Guarantees that the each population subdivision is
represented in the sample.
 Parameter estimates have greater precision than those
estimated from simple random sampling.
LO 7.4 Distinguish between stratified random sampling and cluster sampling.
7-12
Stratified Random Sampling
Stratified Random Sampling: A population is first divided into
subgroups, called strata, and a sample is selected from each stratum.
Useful when a population can be clearly divided in groups based on
some characteristics
Suppose we want to study the advertising
expenditures for the 352 largest companies
in the United States to determine whether
firms with high returns on equity (a measure
of profitability) spent more of each sales
dollar on advertising than firms with a low
return or deficit.
To make sure that the sample is a fair
representation of the 352 companies, the
companies are grouped on percent return
on equity and a sample proportional to the
relative size of the group is randomly
selected.
7-13
7.1 SamplingLO 7.4
 Cluster Sampling
 Divide population into mutually exclusive and
collectively exhaustive groups, called clusters.
 Randomly select clusters.
 Sample every observation in those randomly
selected clusters.
 Advantages and disadvantages:
 Less expensive than other sampling methods.
 Less precision than simple random sampling or
stratified sampling.
 Useful when clusters occur naturally in the population.
7-14
Cluster Sampling
Cluster Sampling: A population is divided into clusters using naturally
occurring geographic or other boundaries. Then, clusters are
randomly selected and a sample is collected by randomly selecting
from each cluster.
Suppose you want to determine the views
of residents in Oregon about state and
federal environmental protection policies.
Cluster sampling can be used by
subdividing the state into small units—
either counties or regions, select at
random say 4 regions, then take samples
of the residents in each of these regions
and interview them. (Note that this is a
combination of cluster sampling and
simple random sampling.)
7-15
7.1 SamplingLO 7.4
 Stratified versus Cluster Sampling
 Stratified Sampling
 Sample consists of
elements from
each group.
 Preferred when the
objective is to
increase precision.
 Cluster Sampling
 Sample consists of
elements from the
selected groups.
 Preferred when
the objective is to
reduce costs.
7-16
7.2 The Sampling Distribution of the Means
 Population is described by parameters.
 A parameter is a constant, whose value may be unknown.
 Only one population.
 Sample is described by statistics.
 A statistic is a random variable whose value depends on
the chosen random sample.
 Statistics are used to make inferences about the
population parameters.
 Can draw multiple random samples of size n.
LO 7.5 Describe the properties of the sampling distribution of the
sample mean.
7-17
Methods of Probability Sampling
 In nonprobability sample inclusion in the
sample is based on the judgment of the
person selecting the sample.
 The sampling error is the difference
between a sample statistic and its
corresponding population parameter.
7-18
7.2 The Sampling Distribution of the
Sample Mean
 Estimator
 A statistic that is used to estimate a population
parameter.
 For example, , the mean of the sample, is an
estimator of m, the mean of the population.
 Estimate
 A particular value of the estimator.
 For example, the mean of the sample is an
estimate of m, the mean of the population.
LO 7.5
X
x
7-19
7.2 The Sampling Distribution of the
Sample Mean
 Sampling Distribution of the Mean
 Each random sample of size n drawn from the
population provides an estimate of m—the sample
mean .
 The sampling distribution of the sample mean is a
probability distribution consisting of all possible
sample means of a given sample size selected
from a population.
LO 7.5
X
x
7-20
7.2 The Sampling Distribution of the
Sample Mean
 Example
LO 7.5
Random Variable
X1 X2 X3 X4
Mean of
X
6 10 8 4 5.57
5 10 4 3 5.71
1 8 4 3 6.36
4 1 6 2 4.07
6 6 8 4
7 7 8 6
1 5 10 5
5 5 9 1
4 6 4 2
7 4 9 5
8 5 8 6
9 2 7 7
9 1 2 3
6 10 2 6
Means 5.57 5.71 6.36 4.07 5.43
One simple random sample
drawn from the population—a
single distribution of values of X.
Means from each
distribution (random
draw) from the
population.
A distribution of
means from each
random draw from
the population.
7-21
7.2 The Sampling Distribution of the
Sample Mean
 Example: Draw a sampling distribution of
the sample mean from a population of
X={1,2,3,4,5,6} from a sample of n=3
without replacement.
 Process:
 List all possible samples
 Calculate each mean of all possible samples
 Construct the distribution of the sample means
LO 7.5
7-22
7.2 The Sampling Distribution of the
Sample Mean
 The Expected Value and Standard Deviation
of the Sample Mean
 Expected Value
 The expected value of X,
 The expected value of the sample mean,
LO 7.5
 E X m
   E X E X m 
7-23
7.2 The Sampling Distribution of the
Sample Mean
 The Expected Value and Standard Deviation
of the Sample Mean
 Variance of X
 Standard Deviation
 of X
 of
LO 7.5
  2
Var X 
X
Where n is the sample size.
Also known as the Standard
Error of the Mean.
  2
SD X   
 SD X
n


7-24
7.2 The Sampling Distribution of the
Sample Mean
 The Central Limit Theorem
 For any population X with expected value m and standard
deviation , the sampling distribution of X will be
approximately normal if the sample size n is sufficiently
large.
 As a general guideline, the normal distribution
approximation is justified when n > 30.
 As before, if is approximately
normal, then we can transform it to
LO 7.6 Explain the importance of the central limit theorem.
X X
Z
n
m



7-25
Central Limit Theorem
 If the population follows a normal probability distribution, then for
any sample size the sampling distribution of the sample mean will
also be normal.
 If the population distribution is symmetrical (but not normal), the
normal shape of the distribution of the sample mean emerge with
samples as small as 10.
 If a distribution that is skewed or has thick tails, it may require
samples of 30 or more to observe the normality feature.
 The mean of the sampling distribution equal to μ and the variance
equal to σ2/n.
CENTRAL LIMIT THEOREM If all samples of a particular size are
selected from any population, the sampling distribution of the sample
mean is approximately a normal distribution. This approximation
improves with larger samples.
7-26
7-27
Using the Sampling
Distribution of the Sample Mean (Sigma Known)
 If a population follows the normal distribution, the
sampling distribution of the sample mean will also
follow the normal distribution.
 If the shape is known to be nonnormal, but the sample
contains at least 30 observations, the central limit
theorem guarantees the sampling distribution of the
mean follows a normal distribution.
 To determine the probability a sample mean falls within
a particular region, use:
n
X
z

m

7-28
 If the population does not follow the normal
distribution, but the sample is of at least 30
observations, the sample means will follow
the normal distribution.
 To determine the probability a sample mean
falls within a particular region, use:
ns
X
t
m

Using the Sampling
Distribution of the Sample Mean (Sigma Unknown)
7-29
7.2 The Sampling Distribution of the
Sample Mean
 Example: Given that m = 16 inches and  = 0.8
inches, determine the following:
 What is the expected value and the standard
deviation of the sample mean derived from a
random sample of
 2 pizzas
 4 pizzas
LO 7.5
  16E X m    0.8
0.57
2
SD X
n

  
  16E X m    0.8
0.40
4
SD X
n

  
7-30
7.2 The Sampling Distribution of the
Sample Mean
 Sampling from a Normal Distribution
 For any sample size n, the sampling distribution
of is normal if the population X from which the
sample is drawn is normally distributed.
 If X is normal, then we can transform it into the
standard normal random variable as:
LO 7.5
X
 
 
X E X X
Z
SD X n
m

 
 
For a sampling
distribution.
 
 
x E X x
Z
SD X
m

 
 
For a distribution of
the values of X.
7-31
7.2 The Sampling Distribution of the
Sample Mean
LO 7.5
 Example: Given that m = 16 inches and  = 0.8
inches, determine the following:
 What is the probability that a randomly selected pizza is
less than 15.5 inches?

 What is the probability that 2 randomly selected pizzas
average less than 15.5 inches?

15.5 16
0.63
0.8
x
Z
m

 
   
( 15.5) ( 0.63)
0.2643 or 26.43%
P X P Z   

( 15.5) ( 0.88)
0.1894 or 18.94%
P X P Z   

15.5 16
0.88
0.8 2
x
Z
n
m

 
   
7-32
 Example: From the introductory case, Anne wants
to determine if the marketing campaign has had a
lingering effect on the amount of money customers
spend on iced coffee.
 Before the campaign, m = $4.18 and  = $0.84. Based on 50
customers sampled after the campaign, m = $4.26.
 Let’s find . Since n > 30, the central limit
theorem states that is approximately normal. So,
 4.26P X 
X
7.2 The Sampling Distribution of the
Sample Mean
LO 7.6
 
 
4.26 4.18
4.26
0.84 50
0.67 1 0.7486 0.2514
X
P X P Z P Z
n
P Z
m

    
       
   
    
7-33
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7 Describe the properties of the sampling distribution of the sample
proportion.
 Estimator
 Sample proportion is used to estimate the
population parameter p.
 Estimate
 A particular value of the estimator .
P
p
7-34
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
 The Expected Value and Standard
Deviation of the Sample Proportion
 Expected Value
 The expected value of ,
 The standard deviation of ,
 E P p
P
P
   1p p
SD P
n


7-35
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
 The Central Limit Theorem for the Sample
Proportion
 For any population proportion p, the sampling
distribution of is approximately normal if the
sample size n is sufficiently large .
 As a general guideline, the normal distribution
approximation is justified when
np > 5 and n(1  p) > 5.
P
7-36
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
 The Central Limit Theorem for the Sample
Proportion
 If is normal, we can transform it into the
standard normal random variable as
 Therefore any value on
has a corresponding value
z on Z given by
P
 
   1
P E P P p
Z
SD P p p
n
 
 

Pp
 


1
p p
Z
p p
n
7-37
 The Central Limit Theorem for the Sample
Proportion
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
Sampling distribution of
when the population proportion
is p = 0.10.
P Sampling distribution of
when the population proportion
is p = 0.30.
P
7-38
 Example: From the introductory case, Anne wants
to determine if the marketing campaign has had a
lingering effect on the proportion of customers who
are women and teenage girls.
 Before the campaign, p = 0.43 for women and p = 0.21 for
teenage girls. Based on 50 customers sampled after the
campaign, p = 0.46 and p = 0.34, respectively.
 Let’s find . Since n > 30, the central limit
theorem states that is approximately normal.
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
 0.46P P 
P
7-39
7.3 The Sampling Distribution of the
Sample Proportion
LO 7.7
 
   
 
0.46 0.43
0.46
1 0.43 1 0.43
50
0.43 1 0.6664 0.3336
p p
P P P Z P Z
p p
n
P Z
  
  
       
   
  
   
    

More Related Content

What's hot

Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypothesesRajThakuri
 
Discrete Probability Distributions
Discrete Probability DistributionsDiscrete Probability Distributions
Discrete Probability Distributionsmandalina landy
 
Discrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsDiscrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsBabasab Patil
 
F Distribution
F  DistributionF  Distribution
F Distributionjravish
 
Simple linear regression and correlation
Simple linear regression and correlationSimple linear regression and correlation
Simple linear regression and correlationShakeel Nouman
 
Introduction and crd
Introduction and crdIntroduction and crd
Introduction and crdRione Drevale
 
Hypergeometric probability distribution
Hypergeometric probability distributionHypergeometric probability distribution
Hypergeometric probability distributionNadeem Uddin
 
Hypergeometric distribution
Hypergeometric distributionHypergeometric distribution
Hypergeometric distributionmohammad nouman
 
6.5 central limit
6.5 central limit6.5 central limit
6.5 central limitleblance
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variablesnszakir
 
The Wishart and inverse-wishart distribution
 The Wishart and inverse-wishart distribution The Wishart and inverse-wishart distribution
The Wishart and inverse-wishart distributionPankaj Das
 
Central limit theorem
Central limit theoremCentral limit theorem
Central limit theoremNadeem Uddin
 

What's hot (20)

Testing of hypotheses
Testing of hypothesesTesting of hypotheses
Testing of hypotheses
 
Discrete Probability Distributions
Discrete Probability DistributionsDiscrete Probability Distributions
Discrete Probability Distributions
 
The Central Limit Theorem
The Central Limit TheoremThe Central Limit Theorem
The Central Limit Theorem
 
Discrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec domsDiscrete and continuous probability distributions ppt @ bec doms
Discrete and continuous probability distributions ppt @ bec doms
 
F Distribution
F  DistributionF  Distribution
F Distribution
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Basics of Hypothesis Testing
Basics of Hypothesis Testing  Basics of Hypothesis Testing
Basics of Hypothesis Testing
 
Sampling Distribution
Sampling DistributionSampling Distribution
Sampling Distribution
 
Simple linear regression and correlation
Simple linear regression and correlationSimple linear regression and correlation
Simple linear regression and correlation
 
Introduction and crd
Introduction and crdIntroduction and crd
Introduction and crd
 
Hypergeometric probability distribution
Hypergeometric probability distributionHypergeometric probability distribution
Hypergeometric probability distribution
 
Hypergeometric distribution
Hypergeometric distributionHypergeometric distribution
Hypergeometric distribution
 
Point Estimation
Point EstimationPoint Estimation
Point Estimation
 
Z test, f-test,etc
Z test, f-test,etcZ test, f-test,etc
Z test, f-test,etc
 
Probability Distribution
Probability DistributionProbability Distribution
Probability Distribution
 
Sampling Distribution
Sampling DistributionSampling Distribution
Sampling Distribution
 
6.5 central limit
6.5 central limit6.5 central limit
6.5 central limit
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
The Wishart and inverse-wishart distribution
 The Wishart and inverse-wishart distribution The Wishart and inverse-wishart distribution
The Wishart and inverse-wishart distribution
 
Central limit theorem
Central limit theoremCentral limit theorem
Central limit theorem
 

Similar to Sampling distribution

Similar to Sampling distribution (20)

IPPTCh008.pptx
IPPTCh008.pptxIPPTCh008.pptx
IPPTCh008.pptx
 
sampling methods
sampling methodssampling methods
sampling methods
 
SAMPLING.pptx
SAMPLING.pptxSAMPLING.pptx
SAMPLING.pptx
 
Common sampling techniques
Common sampling techniquesCommon sampling techniques
Common sampling techniques
 
Brm sampling techniques
Brm sampling techniquesBrm sampling techniques
Brm sampling techniques
 
Chap008
Chap008Chap008
Chap008
 
Statistical Techniques in Business & Economics (McGRAV-HILL) 12 Edt. Chapter ...
Statistical Techniques in Business & Economics (McGRAV-HILL) 12 Edt. Chapter ...Statistical Techniques in Business & Economics (McGRAV-HILL) 12 Edt. Chapter ...
Statistical Techniques in Business & Economics (McGRAV-HILL) 12 Edt. Chapter ...
 
Chap008.ppt
Chap008.pptChap008.ppt
Chap008.ppt
 
Chapter-7-Sampling & sampling Distributions.pdf
Chapter-7-Sampling & sampling Distributions.pdfChapter-7-Sampling & sampling Distributions.pdf
Chapter-7-Sampling & sampling Distributions.pdf
 
sampling and statiscal inference
sampling and statiscal inferencesampling and statiscal inference
sampling and statiscal inference
 
Sampling
SamplingSampling
Sampling
 
Population & sample lecture 04
Population & sample lecture 04Population & sample lecture 04
Population & sample lecture 04
 
sampling
samplingsampling
sampling
 
43911
4391143911
43911
 
Characteristics of a good sample design & types of sample design
Characteristics of a good sample design & types of sample designCharacteristics of a good sample design & types of sample design
Characteristics of a good sample design & types of sample design
 
Inferential Statistics
Inferential StatisticsInferential Statistics
Inferential Statistics
 
Sampling techniques 2
Sampling techniques 2Sampling techniques 2
Sampling techniques 2
 
Sampling
Sampling Sampling
Sampling
 
Sampling Design.pptx
Sampling Design.pptxSampling Design.pptx
Sampling Design.pptx
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 

More from Danu Saputra

Statekbis - Pendugaan Interval
Statekbis - Pendugaan IntervalStatekbis - Pendugaan Interval
Statekbis - Pendugaan IntervalDanu Saputra
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distributionDanu Saputra
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan intervalDanu Saputra
 
Pengujian hipotesis
Pengujian hipotesisPengujian hipotesis
Pengujian hipotesisDanu Saputra
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan intervalDanu Saputra
 
Pengujian hipotesis
Pengujian hipotesisPengujian hipotesis
Pengujian hipotesisDanu Saputra
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan intervalDanu Saputra
 

More from Danu Saputra (11)

Statekbis - Pendugaan Interval
Statekbis - Pendugaan IntervalStatekbis - Pendugaan Interval
Statekbis - Pendugaan Interval
 
Sampling distribution
Sampling distributionSampling distribution
Sampling distribution
 
Trend dan vm
Trend dan vmTrend dan vm
Trend dan vm
 
Distribusi normal
Distribusi normalDistribusi normal
Distribusi normal
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan interval
 
Pengujian hipotesis
Pengujian hipotesisPengujian hipotesis
Pengujian hipotesis
 
Distribusi normal
Distribusi normalDistribusi normal
Distribusi normal
 
Trend dan vm
Trend dan vmTrend dan vm
Trend dan vm
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan interval
 
Pengujian hipotesis
Pengujian hipotesisPengujian hipotesis
Pengujian hipotesis
 
Pendugaan interval
Pendugaan intervalPendugaan interval
Pendugaan interval
 

Recently uploaded

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 

Recently uploaded (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 

Sampling distribution

  • 1. Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
  • 2. 7-2 Chapter 7 Learning Objectives (LOs) LO 7.1: Differentiate between a population parameter and a sample statistic. LO 7.5: Describe the properties of the sampling distribution of the sample mean.
  • 3. 7-3 Chapter 7 Learning Objectives (LOs) LO 7.6: Explain the importance of the central limit theorem. LO 7.7: Describe the properties of the sampling distribution of the sample proportion. LO 7.8: Use a finite population correction factor.
  • 4. 7-4 7.1 Sampling LO 7.1 Differentiate between a population parameter and sample statistic.  Population—consists of all items of interest in a statistical problem.  Population Parameter is unknown.  Sample—a subset of the population.  Sample Statistic is calculated from sample and used to make inferences about the population.  Bias—the tendency of a sample statistic to systematically over- or underestimate a population parameter.
  • 5. 7-5 7.1 Sampling  Classic Case of a “Bad” Sample: The Literary Digest Debacle of 1936  During the1936 presidential election, the Literary Digest predicted a landslide victory for Alf Landon over Franklin D. Roosevelt (FDR) with only a 1% margin of error.  They were wrong! FDR won in a landslide election.  The Literary Digest had committed selection bias by randomly sampling from their own subscriber/ membership lists, etc.  In addition, with only a 24% response rate, the Literary Digest had a great deal of non-response bias. LO 7.2 Explain common sample biases.
  • 6. 7-6 7.1 SamplingLO 7.2  Selection bias—a systematic exclusion of certain groups from consideration for the sample.  The Literary Digest committed selection bias by excluding a large portion of the population (e.g., lower income voters).  Nonresponse bias—a systematic difference in preferences between respondents and non- respondents to a survey or a poll.  The Literary Digest had only a 24% response rate. This indicates that only those who cared a great deal about the election took the time to respond to the survey. These respondents may be atypical of the population as a whole.
  • 7. 7-7 7.1 Sampling  Sampling Methods  Simple random sample is a sample of n observations which has the same probability of being selected from the population as any other sample of n observations.  Most statistical methods presume simple random samples.  However, in some situations other sampling methods have an advantage over simple random samples. LO 7.3 Describe simple random sampling.
  • 8. 7-8 7.1 SamplingLO 7.3  Example: In 1961, students invested 24 hours per week in their academic pursuits, whereas today’s students study an average of 14 hours per week.  A dean at a large university in California wonders if this trend is reflective of the students at her university. The university has 20,000 students and the dean would like a sample of 100. Use Excel to draw a simple random sample of 100 students.  In Excel, choose Formulas > Insert function > RANDBETWEEN and input the values shown here.
  • 9. 7-9 Simple Random Sample: Using Table of Random Numbers A population consists of 845 employees of Nitra Industries. A sample of 52 employees is to be selected from that population. A more convenient method of selecting a random sample is to use the identification number of each employee and a table of random numbers such as the one in Appendix B.6.
  • 10. 7-10 Systematic Random Sampling EXAMPLE A population consists of 845 employees of Nitra Industries. A sample of 52 employees is to be selected from that population. First, k is calculated as the population size divided by the sample size. For Nitra Industries, we would select every 16th (845/52) employee list. If k is not a whole number, then round down. Random sampling is used in the selection of the first name. Then, select every 16th name on the list thereafter. Systematic Random Sampling: The items or individuals of the population are arranged in some order. A random starting point is selected and then every kth member of the population is selected for the sample.
  • 11. 7-11 7.1 Sampling  Stratified Random Sampling  Divide the population into mutually exclusive and collectively exhaustive groups, called strata.  Randomly select observations from each stratum, which are proportional to the stratum’s size.  Advantages:  Guarantees that the each population subdivision is represented in the sample.  Parameter estimates have greater precision than those estimated from simple random sampling. LO 7.4 Distinguish between stratified random sampling and cluster sampling.
  • 12. 7-12 Stratified Random Sampling Stratified Random Sampling: A population is first divided into subgroups, called strata, and a sample is selected from each stratum. Useful when a population can be clearly divided in groups based on some characteristics Suppose we want to study the advertising expenditures for the 352 largest companies in the United States to determine whether firms with high returns on equity (a measure of profitability) spent more of each sales dollar on advertising than firms with a low return or deficit. To make sure that the sample is a fair representation of the 352 companies, the companies are grouped on percent return on equity and a sample proportional to the relative size of the group is randomly selected.
  • 13. 7-13 7.1 SamplingLO 7.4  Cluster Sampling  Divide population into mutually exclusive and collectively exhaustive groups, called clusters.  Randomly select clusters.  Sample every observation in those randomly selected clusters.  Advantages and disadvantages:  Less expensive than other sampling methods.  Less precision than simple random sampling or stratified sampling.  Useful when clusters occur naturally in the population.
  • 14. 7-14 Cluster Sampling Cluster Sampling: A population is divided into clusters using naturally occurring geographic or other boundaries. Then, clusters are randomly selected and a sample is collected by randomly selecting from each cluster. Suppose you want to determine the views of residents in Oregon about state and federal environmental protection policies. Cluster sampling can be used by subdividing the state into small units— either counties or regions, select at random say 4 regions, then take samples of the residents in each of these regions and interview them. (Note that this is a combination of cluster sampling and simple random sampling.)
  • 15. 7-15 7.1 SamplingLO 7.4  Stratified versus Cluster Sampling  Stratified Sampling  Sample consists of elements from each group.  Preferred when the objective is to increase precision.  Cluster Sampling  Sample consists of elements from the selected groups.  Preferred when the objective is to reduce costs.
  • 16. 7-16 7.2 The Sampling Distribution of the Means  Population is described by parameters.  A parameter is a constant, whose value may be unknown.  Only one population.  Sample is described by statistics.  A statistic is a random variable whose value depends on the chosen random sample.  Statistics are used to make inferences about the population parameters.  Can draw multiple random samples of size n. LO 7.5 Describe the properties of the sampling distribution of the sample mean.
  • 17. 7-17 Methods of Probability Sampling  In nonprobability sample inclusion in the sample is based on the judgment of the person selecting the sample.  The sampling error is the difference between a sample statistic and its corresponding population parameter.
  • 18. 7-18 7.2 The Sampling Distribution of the Sample Mean  Estimator  A statistic that is used to estimate a population parameter.  For example, , the mean of the sample, is an estimator of m, the mean of the population.  Estimate  A particular value of the estimator.  For example, the mean of the sample is an estimate of m, the mean of the population. LO 7.5 X x
  • 19. 7-19 7.2 The Sampling Distribution of the Sample Mean  Sampling Distribution of the Mean  Each random sample of size n drawn from the population provides an estimate of m—the sample mean .  The sampling distribution of the sample mean is a probability distribution consisting of all possible sample means of a given sample size selected from a population. LO 7.5 X x
  • 20. 7-20 7.2 The Sampling Distribution of the Sample Mean  Example LO 7.5 Random Variable X1 X2 X3 X4 Mean of X 6 10 8 4 5.57 5 10 4 3 5.71 1 8 4 3 6.36 4 1 6 2 4.07 6 6 8 4 7 7 8 6 1 5 10 5 5 5 9 1 4 6 4 2 7 4 9 5 8 5 8 6 9 2 7 7 9 1 2 3 6 10 2 6 Means 5.57 5.71 6.36 4.07 5.43 One simple random sample drawn from the population—a single distribution of values of X. Means from each distribution (random draw) from the population. A distribution of means from each random draw from the population.
  • 21. 7-21 7.2 The Sampling Distribution of the Sample Mean  Example: Draw a sampling distribution of the sample mean from a population of X={1,2,3,4,5,6} from a sample of n=3 without replacement.  Process:  List all possible samples  Calculate each mean of all possible samples  Construct the distribution of the sample means LO 7.5
  • 22. 7-22 7.2 The Sampling Distribution of the Sample Mean  The Expected Value and Standard Deviation of the Sample Mean  Expected Value  The expected value of X,  The expected value of the sample mean, LO 7.5  E X m    E X E X m 
  • 23. 7-23 7.2 The Sampling Distribution of the Sample Mean  The Expected Value and Standard Deviation of the Sample Mean  Variance of X  Standard Deviation  of X  of LO 7.5   2 Var X  X Where n is the sample size. Also known as the Standard Error of the Mean.   2 SD X     SD X n  
  • 24. 7-24 7.2 The Sampling Distribution of the Sample Mean  The Central Limit Theorem  For any population X with expected value m and standard deviation , the sampling distribution of X will be approximately normal if the sample size n is sufficiently large.  As a general guideline, the normal distribution approximation is justified when n > 30.  As before, if is approximately normal, then we can transform it to LO 7.6 Explain the importance of the central limit theorem. X X Z n m   
  • 25. 7-25 Central Limit Theorem  If the population follows a normal probability distribution, then for any sample size the sampling distribution of the sample mean will also be normal.  If the population distribution is symmetrical (but not normal), the normal shape of the distribution of the sample mean emerge with samples as small as 10.  If a distribution that is skewed or has thick tails, it may require samples of 30 or more to observe the normality feature.  The mean of the sampling distribution equal to μ and the variance equal to σ2/n. CENTRAL LIMIT THEOREM If all samples of a particular size are selected from any population, the sampling distribution of the sample mean is approximately a normal distribution. This approximation improves with larger samples.
  • 26. 7-26
  • 27. 7-27 Using the Sampling Distribution of the Sample Mean (Sigma Known)  If a population follows the normal distribution, the sampling distribution of the sample mean will also follow the normal distribution.  If the shape is known to be nonnormal, but the sample contains at least 30 observations, the central limit theorem guarantees the sampling distribution of the mean follows a normal distribution.  To determine the probability a sample mean falls within a particular region, use: n X z  m 
  • 28. 7-28  If the population does not follow the normal distribution, but the sample is of at least 30 observations, the sample means will follow the normal distribution.  To determine the probability a sample mean falls within a particular region, use: ns X t m  Using the Sampling Distribution of the Sample Mean (Sigma Unknown)
  • 29. 7-29 7.2 The Sampling Distribution of the Sample Mean  Example: Given that m = 16 inches and  = 0.8 inches, determine the following:  What is the expected value and the standard deviation of the sample mean derived from a random sample of  2 pizzas  4 pizzas LO 7.5   16E X m    0.8 0.57 2 SD X n       16E X m    0.8 0.40 4 SD X n    
  • 30. 7-30 7.2 The Sampling Distribution of the Sample Mean  Sampling from a Normal Distribution  For any sample size n, the sampling distribution of is normal if the population X from which the sample is drawn is normally distributed.  If X is normal, then we can transform it into the standard normal random variable as: LO 7.5 X     X E X X Z SD X n m      For a sampling distribution.     x E X x Z SD X m      For a distribution of the values of X.
  • 31. 7-31 7.2 The Sampling Distribution of the Sample Mean LO 7.5  Example: Given that m = 16 inches and  = 0.8 inches, determine the following:  What is the probability that a randomly selected pizza is less than 15.5 inches?   What is the probability that 2 randomly selected pizzas average less than 15.5 inches?  15.5 16 0.63 0.8 x Z m        ( 15.5) ( 0.63) 0.2643 or 26.43% P X P Z     ( 15.5) ( 0.88) 0.1894 or 18.94% P X P Z     15.5 16 0.88 0.8 2 x Z n m       
  • 32. 7-32  Example: From the introductory case, Anne wants to determine if the marketing campaign has had a lingering effect on the amount of money customers spend on iced coffee.  Before the campaign, m = $4.18 and  = $0.84. Based on 50 customers sampled after the campaign, m = $4.26.  Let’s find . Since n > 30, the central limit theorem states that is approximately normal. So,  4.26P X  X 7.2 The Sampling Distribution of the Sample Mean LO 7.6     4.26 4.18 4.26 0.84 50 0.67 1 0.7486 0.2514 X P X P Z P Z n P Z m                       
  • 33. 7-33 7.3 The Sampling Distribution of the Sample Proportion LO 7.7 Describe the properties of the sampling distribution of the sample proportion.  Estimator  Sample proportion is used to estimate the population parameter p.  Estimate  A particular value of the estimator . P p
  • 34. 7-34 7.3 The Sampling Distribution of the Sample Proportion LO 7.7  The Expected Value and Standard Deviation of the Sample Proportion  Expected Value  The expected value of ,  The standard deviation of ,  E P p P P    1p p SD P n  
  • 35. 7-35 7.3 The Sampling Distribution of the Sample Proportion LO 7.7  The Central Limit Theorem for the Sample Proportion  For any population proportion p, the sampling distribution of is approximately normal if the sample size n is sufficiently large .  As a general guideline, the normal distribution approximation is justified when np > 5 and n(1  p) > 5. P
  • 36. 7-36 7.3 The Sampling Distribution of the Sample Proportion LO 7.7  The Central Limit Theorem for the Sample Proportion  If is normal, we can transform it into the standard normal random variable as  Therefore any value on has a corresponding value z on Z given by P      1 P E P P p Z SD P p p n      Pp     1 p p Z p p n
  • 37. 7-37  The Central Limit Theorem for the Sample Proportion 7.3 The Sampling Distribution of the Sample Proportion LO 7.7 Sampling distribution of when the population proportion is p = 0.10. P Sampling distribution of when the population proportion is p = 0.30. P
  • 38. 7-38  Example: From the introductory case, Anne wants to determine if the marketing campaign has had a lingering effect on the proportion of customers who are women and teenage girls.  Before the campaign, p = 0.43 for women and p = 0.21 for teenage girls. Based on 50 customers sampled after the campaign, p = 0.46 and p = 0.34, respectively.  Let’s find . Since n > 30, the central limit theorem states that is approximately normal. 7.3 The Sampling Distribution of the Sample Proportion LO 7.7  0.46P P  P
  • 39. 7-39 7.3 The Sampling Distribution of the Sample Proportion LO 7.7         0.46 0.43 0.46 1 0.43 1 0.43 50 0.43 1 0.6664 0.3336 p p P P P Z P Z p p n P Z                              