SlideShare una empresa de Scribd logo
1 de 21
Elementary Statistics
Chapter 3:
Describing, Exploring,
and Comparing Data
3.3 Measures of
Relative Standing and
Boxplots
1
Chapter 3:
Describing, Exploring, and Comparing Data
3.1 Measures of Center
3.2 Measures of Variation
3.3 Measures of Relative Standing and Boxplots
2
Objectives:
1. Summarize data, using measures of central tendency, such as the mean, median, mode,
and midrange.
2. Describe data, using measures of variation, such as the range, variance, and standard
deviation.
3. Identify the position of a data value in a data set, using various measures of position,
such as percentiles, deciles, and quartiles.
4. Use the techniques of exploratory data analysis, including boxplots and five-number
summaries, to discover various aspects of data
Recall: 3.1 Measures of Center
Measure of Center (Central Tendency)
A measure of center is a value at the center or
middle of a data set.
1. Mean: 𝑥 =
𝑥
𝑛
, 𝜇 =
𝑥
𝑁
, 𝑥 =
𝑓∙𝑥 𝑚
𝑛
2. Median: The middle value of ranked data
3. Mode: The value(s) that occur(s) with the
greatest frequency.
4. Midrange: 𝑀𝑟 =
𝑀𝑖𝑛+𝑀𝑎𝑥
2
5. Weighted Mean: 𝑥 =
𝑤∙𝑥
𝑤
3
Key Concept: Variation is the single most important topic in statistics.
This section presents three important measures of variation: range, standard
deviation, and variance.
Recall: 3.2 Measures of Variation
4
1. Range = Max - Min
2. Variance
3. Standard Deviation
4. Coefficient of Variation
5. Chebyshev’s Theorem
6. Empirical Rule (Normal)
7. Range Rule of Thumb for
Understanding Standard Deviation
𝑠 ≈
𝑅𝑎𝑛𝑔𝑒
4
& µ ± 2σ
1 – 1/k2
Use CVAR to compare
variabiity when the units are
different.
100%
s
CVAR
X
 
5
Recall: 3.2 Measures of Variation
The variance is the average of the squares of the distance
each value is from the mean.
The standard deviation is the square root of the variance.
The standard deviation is a measure of how spread out
your data are and how much data values deviate away from the
mean.
Notation
s = sample standard deviation
σ = population standard deviation
Usage & properties:
1. To determine the spread of the data.
2. To determine the consistency of a variable.
3. To determine the number of data values that fall within a specified
interval in a distribution (Chebyshev’s Theorem: 1 – 1/k2).
4. Used in inferential statistics.
5. The value of the standard deviation s is never negative. It is zero
only when all of the data values are exactly the same.
6. Larger values of s indicate greater amounts of variation.
𝑅 = 𝑀𝑎𝑥 − 𝑀𝑖𝑛
𝑠 ≈
𝑅
4
𝐶𝑉 =
𝑠
𝑥
(100%)
𝑥 ± 2𝑠
𝑥 =
𝑥
𝑛
𝜇 =
𝑥
𝑁
𝑠 =
(𝑥 − 𝑥)2
𝑛 − 1
𝜎 =
(𝑥 − 𝜇)2
𝑁
 
2
2
Population Variance:
X
N





 
2
Population Standard Deviation:
X
N





 
 
 
2
2
2
2
Sample Variance:
1
1
X X
X X
s
n
n
n n






 

 
 
 
2
2
2
Sample Standard Deviation
1
1
:
X X
X X
s
n
n
n n






 

TI Calculator:
How to enter data:
1. Stat
2. Edi
3. Highlight & Clear
4. Type in your data in L1, ..
TI Calculator:
Mean, SD, 5-number
summary
1. Stat
2. Calc
3. Select 1 for 1 variable
4. Type: L1 (second 1)
5. Scroll down for 5-
number summary
Key Concept: This section introduces measures of relative standing, which are
numbers showing the location of data values relative to the other values within
the same data set.
3.3 Measures of Relative Standing and Boxplots
Measures of Relative Standing (Position)
z-score
Percentile
Quartile
Outlier
Boxplot
6
x x
z
s


 # of values below
100%
total # of values
X
Percentile  
100 100
k P
L n n   
Quartiles separate the data set into 4
equal groups. Q1=P25, Q2=MD, Q3=P75
𝐼𝑄𝑅 = 𝑄3 − 𝑄1,
Outlier: A value < 𝑄1 − 1.5 𝐼𝑄𝑅
A value > 𝑄3+1.5 𝐼𝑄𝑅
Boxplot: 5- Number summary
z Scores
A z score (or standard score or standardized value) is the number of standard
deviations that a given value x is above or below the mean. The z score is calculated by
using one of the following:
A z-score or standard score for a value is obtained by subtracting the mean from
the value and dividing the result by the standard deviation.
Important Properties of z Scores
1. A z score is the number of standard deviations that a given value x is above or
below the mean.
2. z scores are expressed as numbers with no units of measurement.
3. A data value is significantly low if its z score is less than or equal to −2 or the
value is significantly high if its z score is greater than or equal to +2.
4. If an individual data value is less than the mean, its corresponding z score is a
negative number.
Sample
Population
3.3 Measures of Relative Standing and Boxplots
X X
z
s


X
z




Values:
z scores ≤ −2.00 or
z scores ≥ 2.00
7
Example 1
8
Which of the following two data values is more extreme relative to the data
set from which it came?
x x
z
s


4000 3152
693.4

 1.22
99 398.20
0.62

 1.29
Temperature
The 4000 g weight of a baby (n = 400, 𝑥 = 3152.0𝑔, 𝑠 = 693.4𝑔)
The 990
𝐹 temperature of an adult (n = 106, 𝑥 = 98.200
𝐹, 𝑠 = 0.620
𝐹)
x x
z
s


Example 2
9
A student scored 65 on a calculus test that had a mean of 50 and a standard
deviation of 10; she scored 30 on a history test with a mean of 25 and a
standard deviation of 5. Compare her relative positions on the two tests.
She has a higher relative position in the Calculus class.
30 25
History:
5
z


65 50
Calculus:
10
z

 1.5
1.0
x x
z
s


x
z




x
z




Example 3
10
75 239.4
64.2

 2.56 
Interpretation: 𝑧 = −2.56 < −2 → the platelet count of 75 is significantly low.
Platelets clump together and form clots to stop the bleeding during injury.
The lowest platelet count in a dataset is 75 (platelet counts are measured in
1000 cells/𝜇𝐿), is this significantly low? ( 𝑥 = 239.4, 𝑠 = 64.2)
x x
z
s


x x
z
s


Percentiles are measures of location, denoted P1, P2, . . . , P99, which divide a
set of data into 100 groups with about 1% of the values in each group.
(Percentiles separate the data set into 100 equal groups.)
A percentile rank for a datum represents the percentage of data values
below the datum (round the result to the nearest whole number):
3.3 Measures of Relative Standing and Boxplots
 # of values below
100%
total # of values
X
Percentile  
 # of values below 0.5
Some Texts : 100%
total # of values
X
Percentile

 
n total number of values in the data set
k percentile being used (Example: For the 25th percentile, k = 25.)
L locator that gives the position of a value (Example: For the 12th
value in the sorted list, L = 12.)
Pk kth percentile (Example: P25 is the 25th percentile.)
11100 100
k P
L n n   
Example 4
12
Fifty (50) cell phone data speeds listed below are arranged in increasing order.
Find the percentile for the data speed of 11.8 Mbps.
 # of values below
100%
total # of values
X
Percentile  
100
n k
L


0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
 # of values below 11.8
100%
total # of values
Percentile  
20
100%
50
  40%
Interpretation:
A data speed of 11.8 Mbps is in the 40th percentile and separates the lowest
40% of values from the highest 60% of values. We have P40 = 11.8 Mbps.
Example 5
13
For the 50 cell phone data speeds listed, find the
20th percentile, denoted by P20.
0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
Converting a Percentile to a Data Value
100 100
k P
L n n   
Solution: k = 20, n = 50
100 100
k P
L n n   
20
50
100
  10
Whole Number: The value of the 20th percentile is
between the Lth (10th ) value and the L + 1st (11st )value.
The 20th percentile is P20 = (6.2 + 6.5) / 2 = 6.35 Mbps.
Example 6
14
For these cell phone data speeds listed, find the 87th
percentile, denoted by P87.
0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
Converting a Percentile to a Data Value
100 100
k P
L n n   
Solution: k = 87, n = 50
100 100
k P
L n n   
87
50
100
  43.5
Not a Whole Number: Round up ALWAYS
The value of the 87th percentile is the 44th value.
P87 = 28.5 Mbps
Quartiles
Quartiles are measures of location, denoted Q1, Q2, and Q3, which divide a set of data into four groups with
about 25% of the values in each group. (Quartiles separate the data set into 4 equal groups. Q1=P25, Q2=MD,
Q3=P75 )
Q1 (First quartile, or P25) It separates the bottom 25% of the sorted values from the top 75%.
Q2 (Second quartile, or P50 ) and same as the median. It separates the bottom 50% of the sorted values from the top
50%.
Q3 (Third quartile): Same as P75. It separates the bottom 75% of the sorted values from the top 25%.
Caution Just as there is not universal agreement on a procedure for finding percentiles, there is not universal
agreement on a single procedure for calculating quartiles, and different technologies often yield different results.
Deciles separate the data set into 10 equal groups. D1=P10, D4=P40
The Interquartile Range, IQR = Q3 – Q1.
Step 1 Arrange the data in order from lowest to highest.
Step 2 Find the median of the data values. This is the value for Q2.
Step 3 Find the median of the data values that fall below Q2. This is the value for Q1.
Step 4 Find the median of the data values that fall above Q2. This is the value for Q3.
15
3.3 Measures of Relative Standing and Boxplots
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
𝑆𝑒𝑚𝑖 𝐼𝑄𝑅 = (𝑄3−𝑄1)/2
𝑀𝑖𝑑 𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑔𝑒: = (𝑄3+𝑄1)/2
10-90 𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑔𝑒: = 𝑃90 − 𝑃10
Example 7
16
Given the fifty (50) data speeds listed, find the
5-number summary.
0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
5-Number Summary
Min = 0.8 Mbps & Max = 77.8 Mbps
The median is equal to MD = Q2 = (13.7 + 14.1) / 2 = 13.9 Mbps.
5-Number Summary
Consists of:
1. Minimum
2. First quartile, Q1
3. Second quartile, Q2
(same as the median)
4. Third quartile, Q3
5. Maximum
Q1 = 7.9 Mbps (Median of the bottom half)
Q3 = 21.5 Mbps (Median of the top half)
Example 8
17
Given the fifty (50) data speeds listed, construct a
boxplot. Identify Outliers if any.
0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
Boxplots A boxplot (or box-and-
whisker diagram) is a
graph that consists of a
line extending from the
min to the max value,
and a box with lines
drawn at the first
quartile Q1, the median,
and the third quartile Q3.
Min = 0.8, Q1 = 7.9, Q2 = 13.9, Q3 = 21.5, Max = 77.8
Constructing a Boxplot
1. Find the 5-number
summary (Min, Q1, Q2,
Q3, Max).
2. Construct a line segment
extending from the
minimum to the maximum
data value.
3. Construct a box
(rectangle) extending from
Q1 to Q3, and draw a line
in the box at the value of
Q2 (median).
IQR = Q3 - Q1 = 21.5 - 7.9 = 13.6, Q1 - 1.5 IQR = -12.5 & Q3 +1.5 IQR = 41.9
Any numbers smaller than -12.5 and larger than 41.9 is considered an outlier.
43.0, 55.6, 71.3 & 77.8
Skewness
A boxplot can often be used to identify skewness. A distribution of data is skewed if it is not
symmetric and extends more to one side than to the other.
Modified Boxplots
A modified boxplot is a regular boxplot constructed with these modifications:
1. A special symbol (such as an asterisk or point) is used to identify outliers as defined above,
and
2. the solid horizontal line extends only as far as the minimum data value that is not an outlier
and the maximum data value that is not an outlier.
Identifying Outliers (An outlier is a value that lies very far away from the vast majority of the other
values in a data set.) for Modified Boxplots
1. Find the quartiles Q1, Q2, and Q3.
2. Find the interquartile range (IQR), where IQR = Q3 − Q1.
3. Evaluate 1.5 × IQR.
4. In a modified boxplot, a data value is an outlier if it is above Q3, by an amount greater than 1.5 ×
IQR or below Q1, by an amount greater than 1.5 × IQR.
18
3.3 Measures of Relative Standing and Boxplots
Boxplots
Example 9
19
Given the fifty (50) data speeds listed, find the
40th percentile, denoted by P40.
0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2
6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6
11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1
15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4
23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8
Converting a Percentile to a Data Value (No need)
100 100
k P
L n n   
Solution: k = 40, n = 50
100 100
k P
L n n   
40
50
100
  20
Whole Number: The value of the 40th percentile is
between the Lth (20th ) value and the 21st value.
The 40th percentile is P40 = 11.7 Mbps.
Example 10
20
A teacher gives a 20-point test to 10 students. Find the percentile rank of a score of
12.
18, 15, 12, 6, 8, 2, 3, 5, 20, 10
100
n k
L

Sort in ascending order.
2, 3, 5, 6, 8, 10, 12, 15, 18, 20
Interpretation:
A student whose score was 12 did better than 65% of the class.
 # of values below 0.5
100%
total # of values
X
Percentile

 
6 values
6 0.5
100%
10
65%

 

 # of values below 0.5
Some Texts : 100%
total # of values
X
Percentile

 
Example 11
21
A teacher gives a 20-point test to 10 students. Find the value corresponding to the
25th percentile.
18, 15, 12, 6, 8, 2, 3, 5, 20, 10
100
n k
L

Sort in ascending order.
2, 3, 5, 6, 8, 10, 12, 15, 18, 20
Interpretation:
The value 5 corresponds to the 25th percentile.
100
n p
c


10 25
2.5
100

  3

Más contenido relacionado

La actualidad más candente

Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plotsValPatton
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Modelnszakir
 
The sampling distribution
The sampling distributionThe sampling distribution
The sampling distributionHarve Abella
 
Place value with decimals
Place value with decimalsPlace value with decimals
Place value with decimalsDeepak Kumar
 
Introduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsIntroduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsDocIbrahimAbdelmonaem
 
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORM
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORMChapter 1 : SIGNIFICANT FIGURES AND STANDARD FORM
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORMNurul Ainn
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statisticsAndi Koentary
 
Measures of Variability
Measures of VariabilityMeasures of Variability
Measures of Variabilityjasondroesch
 
Completing the square
Completing the squareCompleting the square
Completing the squareDon Simmons
 
Confidence Interval Estimation
Confidence Interval EstimationConfidence Interval Estimation
Confidence Interval EstimationYesica Adicondro
 
Interval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of ProportionInterval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of ProportionDataminingTools Inc
 
2.3 Graphs that enlighten and graphs that deceive
2.3 Graphs that enlighten and graphs that deceive2.3 Graphs that enlighten and graphs that deceive
2.3 Graphs that enlighten and graphs that deceiveLong Beach City College
 

La actualidad más candente (20)

Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plots
 
Chapter 4 part1-Probability Model
Chapter 4 part1-Probability ModelChapter 4 part1-Probability Model
Chapter 4 part1-Probability Model
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
The sampling distribution
The sampling distributionThe sampling distribution
The sampling distribution
 
Place value with decimals
Place value with decimalsPlace value with decimals
Place value with decimals
 
Introduction to Statistics - Basic concepts
Introduction to Statistics - Basic conceptsIntroduction to Statistics - Basic concepts
Introduction to Statistics - Basic concepts
 
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORM
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORMChapter 1 : SIGNIFICANT FIGURES AND STANDARD FORM
Chapter 1 : SIGNIFICANT FIGURES AND STANDARD FORM
 
Boxplot
BoxplotBoxplot
Boxplot
 
lfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.pptlfstat3e_ppt_01_rev.ppt
lfstat3e_ppt_01_rev.ppt
 
Introduction to statistics
Introduction to statisticsIntroduction to statistics
Introduction to statistics
 
Measures of Variability
Measures of VariabilityMeasures of Variability
Measures of Variability
 
Completing the square
Completing the squareCompleting the square
Completing the square
 
Confidence Interval Estimation
Confidence Interval EstimationConfidence Interval Estimation
Confidence Interval Estimation
 
Les5e ppt 04
Les5e ppt 04Les5e ppt 04
Les5e ppt 04
 
Interval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of ProportionInterval Estimation & Estimation Of Proportion
Interval Estimation & Estimation Of Proportion
 
2.3 Graphs that enlighten and graphs that deceive
2.3 Graphs that enlighten and graphs that deceive2.3 Graphs that enlighten and graphs that deceive
2.3 Graphs that enlighten and graphs that deceive
 
Measures of dispersion discuss 2.2
Measures of dispersion discuss 2.2Measures of dispersion discuss 2.2
Measures of dispersion discuss 2.2
 
Data Visualisation
Data VisualisationData Visualisation
Data Visualisation
 
Estimating a Population Mean
Estimating a Population Mean  Estimating a Population Mean
Estimating a Population Mean
 
Frequency distribution
Frequency distributionFrequency distribution
Frequency distribution
 

Similar a 3.3 Measures of relative standing and boxplots

Measures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsLong Beach City College
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsBabasab Patil
 
3. Descriptive statistics.pdf
3. Descriptive statistics.pdf3. Descriptive statistics.pdf
3. Descriptive statistics.pdfYomifDeksisaHerpa
 
Penggambaran Data Secara Numerik
Penggambaran Data Secara NumerikPenggambaran Data Secara Numerik
Penggambaran Data Secara Numerikanom1392
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of DispersionKainatIqbal7
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptxVanmala Buchke
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Neeraj Bhandari
 
3Measurements of health and disease_MCTD.pdf
3Measurements of health and disease_MCTD.pdf3Measurements of health and disease_MCTD.pdf
3Measurements of health and disease_MCTD.pdfAmanuelDina
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceLong Beach City College
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Long Beach City College
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxAnusuya123
 

Similar a 3.3 Measures of relative standing and boxplots (20)

Measures of Relative Standing and Boxplots
Measures of Relative Standing and BoxplotsMeasures of Relative Standing and Boxplots
Measures of Relative Standing and Boxplots
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
 
3.2 Measures of variation
3.2 Measures of variation3.2 Measures of variation
3.2 Measures of variation
 
Measures of Variation
Measures of Variation Measures of Variation
Measures of Variation
 
3. Descriptive statistics.pdf
3. Descriptive statistics.pdf3. Descriptive statistics.pdf
3. Descriptive statistics.pdf
 
Penggambaran Data Secara Numerik
Penggambaran Data Secara NumerikPenggambaran Data Secara Numerik
Penggambaran Data Secara Numerik
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 
Statistics 3, 4
Statistics 3, 4Statistics 3, 4
Statistics 3, 4
 
Mod mean quartile
Mod mean quartileMod mean quartile
Mod mean quartile
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Measures of Dispersion.pptx
Measures of Dispersion.pptxMeasures of Dispersion.pptx
Measures of Dispersion.pptx
 
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
Measure of dispersion by Neeraj Bhandari ( Surkhet.Nepal )
 
Measures of dispersion
Measures of dispersionMeasures of dispersion
Measures of dispersion
 
3Measurements of health and disease_MCTD.pdf
3Measurements of health and disease_MCTD.pdf3Measurements of health and disease_MCTD.pdf
3Measurements of health and disease_MCTD.pdf
 
2 UNIT-DSP.pptx
2 UNIT-DSP.pptx2 UNIT-DSP.pptx
2 UNIT-DSP.pptx
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or VarianceEstimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance Estimating a Population Standard Deviation or Variance
Estimating a Population Standard Deviation or Variance
 
Statistics.pdf
Statistics.pdfStatistics.pdf
Statistics.pdf
 
Basic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptxBasic Statistical Descriptions of Data.pptx
Basic Statistical Descriptions of Data.pptx
 

Más de Long Beach City College

Practice test ch 9 inferences from two samples
Practice test ch 9 inferences from two samplesPractice test ch 9 inferences from two samples
Practice test ch 9 inferences from two samplesLong Beach City College
 
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaSolution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaLong Beach City College
 
Practice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anovaPractice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anovaLong Beach City College
 
Practice test ch 8 hypothesis testing ch 9 two populations
Practice test ch 8 hypothesis testing ch 9 two populationsPractice test ch 8 hypothesis testing ch 9 two populations
Practice test ch 8 hypothesis testing ch 9 two populationsLong Beach City College
 
Solution to the practice test ch 8 hypothesis testing ch 9 two populations
Solution to the practice test ch 8 hypothesis testing ch 9 two populationsSolution to the practice test ch 8 hypothesis testing ch 9 two populations
Solution to the practice test ch 8 hypothesis testing ch 9 two populationsLong Beach City College
 
Solution to the Practice Test 3A, Chapter 6 Normal Probability Distribution
Solution to the Practice Test 3A, Chapter 6 Normal Probability DistributionSolution to the Practice Test 3A, Chapter 6 Normal Probability Distribution
Solution to the Practice Test 3A, Chapter 6 Normal Probability DistributionLong Beach City College
 
Practice Test Chapter 6 (Normal Probability Distributions)
Practice Test Chapter 6 (Normal Probability Distributions)Practice Test Chapter 6 (Normal Probability Distributions)
Practice Test Chapter 6 (Normal Probability Distributions)Long Beach City College
 

Más de Long Beach City College (20)

Practice test ch 9 inferences from two samples
Practice test ch 9 inferences from two samplesPractice test ch 9 inferences from two samples
Practice test ch 9 inferences from two samples
 
Practice Test Ch 8 Hypothesis Testing
Practice Test Ch 8 Hypothesis TestingPractice Test Ch 8 Hypothesis Testing
Practice Test Ch 8 Hypothesis Testing
 
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anovaSolution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
Solution to the practice test ch 10 correlation reg ch 11 gof ch12 anova
 
Practice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anovaPractice test ch 10 correlation reg ch 11 gof ch12 anova
Practice test ch 10 correlation reg ch 11 gof ch12 anova
 
Practice test ch 8 hypothesis testing ch 9 two populations
Practice test ch 8 hypothesis testing ch 9 two populationsPractice test ch 8 hypothesis testing ch 9 two populations
Practice test ch 8 hypothesis testing ch 9 two populations
 
Solution to the practice test ch 8 hypothesis testing ch 9 two populations
Solution to the practice test ch 8 hypothesis testing ch 9 two populationsSolution to the practice test ch 8 hypothesis testing ch 9 two populations
Solution to the practice test ch 8 hypothesis testing ch 9 two populations
 
Solution to the Practice Test 3A, Chapter 6 Normal Probability Distribution
Solution to the Practice Test 3A, Chapter 6 Normal Probability DistributionSolution to the Practice Test 3A, Chapter 6 Normal Probability Distribution
Solution to the Practice Test 3A, Chapter 6 Normal Probability Distribution
 
Practice Test Chapter 6 (Normal Probability Distributions)
Practice Test Chapter 6 (Normal Probability Distributions)Practice Test Chapter 6 (Normal Probability Distributions)
Practice Test Chapter 6 (Normal Probability Distributions)
 
Practice Test 2 Solutions
Practice Test 2  SolutionsPractice Test 2  Solutions
Practice Test 2 Solutions
 
Practice Test 2 Probability
Practice Test 2 ProbabilityPractice Test 2 Probability
Practice Test 2 Probability
 
Practice Test 1 solutions
Practice Test 1 solutions  Practice Test 1 solutions
Practice Test 1 solutions
 
Practice Test 1
Practice Test 1Practice Test 1
Practice Test 1
 
Stat sample test ch 12 solution
Stat sample test ch 12 solutionStat sample test ch 12 solution
Stat sample test ch 12 solution
 
Stat sample test ch 12
Stat sample test ch 12Stat sample test ch 12
Stat sample test ch 12
 
Stat sample test ch 11
Stat sample test ch 11Stat sample test ch 11
Stat sample test ch 11
 
Stat sample test ch 10
Stat sample test ch 10Stat sample test ch 10
Stat sample test ch 10
 
Two-Way ANOVA
Two-Way ANOVATwo-Way ANOVA
Two-Way ANOVA
 
One-Way ANOVA
One-Way ANOVAOne-Way ANOVA
One-Way ANOVA
 
Contingency Tables
Contingency TablesContingency Tables
Contingency Tables
 
Goodness of Fit Notation
Goodness of Fit NotationGoodness of Fit Notation
Goodness of Fit Notation
 

Último

Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinojohnmickonozaleda
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 

Último (20)

Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
FILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipinoFILIPINO PSYCHology sikolohiyang pilipino
FILIPINO PSYCHology sikolohiyang pilipino
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 

3.3 Measures of relative standing and boxplots

  • 1. Elementary Statistics Chapter 3: Describing, Exploring, and Comparing Data 3.3 Measures of Relative Standing and Boxplots 1
  • 2. Chapter 3: Describing, Exploring, and Comparing Data 3.1 Measures of Center 3.2 Measures of Variation 3.3 Measures of Relative Standing and Boxplots 2 Objectives: 1. Summarize data, using measures of central tendency, such as the mean, median, mode, and midrange. 2. Describe data, using measures of variation, such as the range, variance, and standard deviation. 3. Identify the position of a data value in a data set, using various measures of position, such as percentiles, deciles, and quartiles. 4. Use the techniques of exploratory data analysis, including boxplots and five-number summaries, to discover various aspects of data
  • 3. Recall: 3.1 Measures of Center Measure of Center (Central Tendency) A measure of center is a value at the center or middle of a data set. 1. Mean: 𝑥 = 𝑥 𝑛 , 𝜇 = 𝑥 𝑁 , 𝑥 = 𝑓∙𝑥 𝑚 𝑛 2. Median: The middle value of ranked data 3. Mode: The value(s) that occur(s) with the greatest frequency. 4. Midrange: 𝑀𝑟 = 𝑀𝑖𝑛+𝑀𝑎𝑥 2 5. Weighted Mean: 𝑥 = 𝑤∙𝑥 𝑤 3
  • 4. Key Concept: Variation is the single most important topic in statistics. This section presents three important measures of variation: range, standard deviation, and variance. Recall: 3.2 Measures of Variation 4 1. Range = Max - Min 2. Variance 3. Standard Deviation 4. Coefficient of Variation 5. Chebyshev’s Theorem 6. Empirical Rule (Normal) 7. Range Rule of Thumb for Understanding Standard Deviation 𝑠 ≈ 𝑅𝑎𝑛𝑔𝑒 4 & µ ± 2σ 1 – 1/k2 Use CVAR to compare variabiity when the units are different. 100% s CVAR X  
  • 5. 5 Recall: 3.2 Measures of Variation The variance is the average of the squares of the distance each value is from the mean. The standard deviation is the square root of the variance. The standard deviation is a measure of how spread out your data are and how much data values deviate away from the mean. Notation s = sample standard deviation σ = population standard deviation Usage & properties: 1. To determine the spread of the data. 2. To determine the consistency of a variable. 3. To determine the number of data values that fall within a specified interval in a distribution (Chebyshev’s Theorem: 1 – 1/k2). 4. Used in inferential statistics. 5. The value of the standard deviation s is never negative. It is zero only when all of the data values are exactly the same. 6. Larger values of s indicate greater amounts of variation. 𝑅 = 𝑀𝑎𝑥 − 𝑀𝑖𝑛 𝑠 ≈ 𝑅 4 𝐶𝑉 = 𝑠 𝑥 (100%) 𝑥 ± 2𝑠 𝑥 = 𝑥 𝑛 𝜇 = 𝑥 𝑁 𝑠 = (𝑥 − 𝑥)2 𝑛 − 1 𝜎 = (𝑥 − 𝜇)2 𝑁   2 2 Population Variance: X N        2 Population Standard Deviation: X N            2 2 2 2 Sample Variance: 1 1 X X X X s n n n n                2 2 2 Sample Standard Deviation 1 1 : X X X X s n n n n          TI Calculator: How to enter data: 1. Stat 2. Edi 3. Highlight & Clear 4. Type in your data in L1, .. TI Calculator: Mean, SD, 5-number summary 1. Stat 2. Calc 3. Select 1 for 1 variable 4. Type: L1 (second 1) 5. Scroll down for 5- number summary
  • 6. Key Concept: This section introduces measures of relative standing, which are numbers showing the location of data values relative to the other values within the same data set. 3.3 Measures of Relative Standing and Boxplots Measures of Relative Standing (Position) z-score Percentile Quartile Outlier Boxplot 6 x x z s    # of values below 100% total # of values X Percentile   100 100 k P L n n    Quartiles separate the data set into 4 equal groups. Q1=P25, Q2=MD, Q3=P75 𝐼𝑄𝑅 = 𝑄3 − 𝑄1, Outlier: A value < 𝑄1 − 1.5 𝐼𝑄𝑅 A value > 𝑄3+1.5 𝐼𝑄𝑅 Boxplot: 5- Number summary
  • 7. z Scores A z score (or standard score or standardized value) is the number of standard deviations that a given value x is above or below the mean. The z score is calculated by using one of the following: A z-score or standard score for a value is obtained by subtracting the mean from the value and dividing the result by the standard deviation. Important Properties of z Scores 1. A z score is the number of standard deviations that a given value x is above or below the mean. 2. z scores are expressed as numbers with no units of measurement. 3. A data value is significantly low if its z score is less than or equal to −2 or the value is significantly high if its z score is greater than or equal to +2. 4. If an individual data value is less than the mean, its corresponding z score is a negative number. Sample Population 3.3 Measures of Relative Standing and Boxplots X X z s   X z     Values: z scores ≤ −2.00 or z scores ≥ 2.00 7
  • 8. Example 1 8 Which of the following two data values is more extreme relative to the data set from which it came? x x z s   4000 3152 693.4   1.22 99 398.20 0.62   1.29 Temperature The 4000 g weight of a baby (n = 400, 𝑥 = 3152.0𝑔, 𝑠 = 693.4𝑔) The 990 𝐹 temperature of an adult (n = 106, 𝑥 = 98.200 𝐹, 𝑠 = 0.620 𝐹) x x z s  
  • 9. Example 2 9 A student scored 65 on a calculus test that had a mean of 50 and a standard deviation of 10; she scored 30 on a history test with a mean of 25 and a standard deviation of 5. Compare her relative positions on the two tests. She has a higher relative position in the Calculus class. 30 25 History: 5 z   65 50 Calculus: 10 z   1.5 1.0 x x z s   x z     x z    
  • 10. Example 3 10 75 239.4 64.2   2.56  Interpretation: 𝑧 = −2.56 < −2 → the platelet count of 75 is significantly low. Platelets clump together and form clots to stop the bleeding during injury. The lowest platelet count in a dataset is 75 (platelet counts are measured in 1000 cells/𝜇𝐿), is this significantly low? ( 𝑥 = 239.4, 𝑠 = 64.2) x x z s   x x z s  
  • 11. Percentiles are measures of location, denoted P1, P2, . . . , P99, which divide a set of data into 100 groups with about 1% of the values in each group. (Percentiles separate the data set into 100 equal groups.) A percentile rank for a datum represents the percentage of data values below the datum (round the result to the nearest whole number): 3.3 Measures of Relative Standing and Boxplots  # of values below 100% total # of values X Percentile    # of values below 0.5 Some Texts : 100% total # of values X Percentile    n total number of values in the data set k percentile being used (Example: For the 25th percentile, k = 25.) L locator that gives the position of a value (Example: For the 12th value in the sorted list, L = 12.) Pk kth percentile (Example: P25 is the 25th percentile.) 11100 100 k P L n n   
  • 12. Example 4 12 Fifty (50) cell phone data speeds listed below are arranged in increasing order. Find the percentile for the data speed of 11.8 Mbps.  # of values below 100% total # of values X Percentile   100 n k L   0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8  # of values below 11.8 100% total # of values Percentile   20 100% 50   40% Interpretation: A data speed of 11.8 Mbps is in the 40th percentile and separates the lowest 40% of values from the highest 60% of values. We have P40 = 11.8 Mbps.
  • 13. Example 5 13 For the 50 cell phone data speeds listed, find the 20th percentile, denoted by P20. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8 Converting a Percentile to a Data Value 100 100 k P L n n    Solution: k = 20, n = 50 100 100 k P L n n    20 50 100   10 Whole Number: The value of the 20th percentile is between the Lth (10th ) value and the L + 1st (11st )value. The 20th percentile is P20 = (6.2 + 6.5) / 2 = 6.35 Mbps.
  • 14. Example 6 14 For these cell phone data speeds listed, find the 87th percentile, denoted by P87. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8 Converting a Percentile to a Data Value 100 100 k P L n n    Solution: k = 87, n = 50 100 100 k P L n n    87 50 100   43.5 Not a Whole Number: Round up ALWAYS The value of the 87th percentile is the 44th value. P87 = 28.5 Mbps
  • 15. Quartiles Quartiles are measures of location, denoted Q1, Q2, and Q3, which divide a set of data into four groups with about 25% of the values in each group. (Quartiles separate the data set into 4 equal groups. Q1=P25, Q2=MD, Q3=P75 ) Q1 (First quartile, or P25) It separates the bottom 25% of the sorted values from the top 75%. Q2 (Second quartile, or P50 ) and same as the median. It separates the bottom 50% of the sorted values from the top 50%. Q3 (Third quartile): Same as P75. It separates the bottom 75% of the sorted values from the top 25%. Caution Just as there is not universal agreement on a procedure for finding percentiles, there is not universal agreement on a single procedure for calculating quartiles, and different technologies often yield different results. Deciles separate the data set into 10 equal groups. D1=P10, D4=P40 The Interquartile Range, IQR = Q3 – Q1. Step 1 Arrange the data in order from lowest to highest. Step 2 Find the median of the data values. This is the value for Q2. Step 3 Find the median of the data values that fall below Q2. This is the value for Q1. Step 4 Find the median of the data values that fall above Q2. This is the value for Q3. 15 3.3 Measures of Relative Standing and Boxplots 𝐼𝑄𝑅 = 𝑄3 − 𝑄1 𝑆𝑒𝑚𝑖 𝐼𝑄𝑅 = (𝑄3−𝑄1)/2 𝑀𝑖𝑑 𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑔𝑒: = (𝑄3+𝑄1)/2 10-90 𝑄𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑔𝑒: = 𝑃90 − 𝑃10
  • 16. Example 7 16 Given the fifty (50) data speeds listed, find the 5-number summary. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8 5-Number Summary Min = 0.8 Mbps & Max = 77.8 Mbps The median is equal to MD = Q2 = (13.7 + 14.1) / 2 = 13.9 Mbps. 5-Number Summary Consists of: 1. Minimum 2. First quartile, Q1 3. Second quartile, Q2 (same as the median) 4. Third quartile, Q3 5. Maximum Q1 = 7.9 Mbps (Median of the bottom half) Q3 = 21.5 Mbps (Median of the top half)
  • 17. Example 8 17 Given the fifty (50) data speeds listed, construct a boxplot. Identify Outliers if any. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8 Boxplots A boxplot (or box-and- whisker diagram) is a graph that consists of a line extending from the min to the max value, and a box with lines drawn at the first quartile Q1, the median, and the third quartile Q3. Min = 0.8, Q1 = 7.9, Q2 = 13.9, Q3 = 21.5, Max = 77.8 Constructing a Boxplot 1. Find the 5-number summary (Min, Q1, Q2, Q3, Max). 2. Construct a line segment extending from the minimum to the maximum data value. 3. Construct a box (rectangle) extending from Q1 to Q3, and draw a line in the box at the value of Q2 (median). IQR = Q3 - Q1 = 21.5 - 7.9 = 13.6, Q1 - 1.5 IQR = -12.5 & Q3 +1.5 IQR = 41.9 Any numbers smaller than -12.5 and larger than 41.9 is considered an outlier. 43.0, 55.6, 71.3 & 77.8
  • 18. Skewness A boxplot can often be used to identify skewness. A distribution of data is skewed if it is not symmetric and extends more to one side than to the other. Modified Boxplots A modified boxplot is a regular boxplot constructed with these modifications: 1. A special symbol (such as an asterisk or point) is used to identify outliers as defined above, and 2. the solid horizontal line extends only as far as the minimum data value that is not an outlier and the maximum data value that is not an outlier. Identifying Outliers (An outlier is a value that lies very far away from the vast majority of the other values in a data set.) for Modified Boxplots 1. Find the quartiles Q1, Q2, and Q3. 2. Find the interquartile range (IQR), where IQR = Q3 − Q1. 3. Evaluate 1.5 × IQR. 4. In a modified boxplot, a data value is an outlier if it is above Q3, by an amount greater than 1.5 × IQR or below Q1, by an amount greater than 1.5 × IQR. 18 3.3 Measures of Relative Standing and Boxplots Boxplots
  • 19. Example 9 19 Given the fifty (50) data speeds listed, find the 40th percentile, denoted by P40. 0.8 1.4 1.8 1.9 3.2 3.6 4.5 4.5 4.6 6.2 6.5 7.7 7.9 9.9 10.2 10.3 10.9 11.1 11.1 11.6 11.8 12.0 13.1 13.5 13.7 14.1 14.2 14.7 15.0 15.1 15.5 15.8 16.0 17.5 18.2 20.2 21.1 21.5 22.2 22.4 23.1 24.5 25.7 28.5 34.6 38.5 43.0 55.6 71.3 77.8 Converting a Percentile to a Data Value (No need) 100 100 k P L n n    Solution: k = 40, n = 50 100 100 k P L n n    40 50 100   20 Whole Number: The value of the 40th percentile is between the Lth (20th ) value and the 21st value. The 40th percentile is P40 = 11.7 Mbps.
  • 20. Example 10 20 A teacher gives a 20-point test to 10 students. Find the percentile rank of a score of 12. 18, 15, 12, 6, 8, 2, 3, 5, 20, 10 100 n k L  Sort in ascending order. 2, 3, 5, 6, 8, 10, 12, 15, 18, 20 Interpretation: A student whose score was 12 did better than 65% of the class.  # of values below 0.5 100% total # of values X Percentile    6 values 6 0.5 100% 10 65%      # of values below 0.5 Some Texts : 100% total # of values X Percentile   
  • 21. Example 11 21 A teacher gives a 20-point test to 10 students. Find the value corresponding to the 25th percentile. 18, 15, 12, 6, 8, 2, 3, 5, 20, 10 100 n k L  Sort in ascending order. 2, 3, 5, 6, 8, 10, 12, 15, 18, 20 Interpretation: The value 5 corresponds to the 25th percentile. 100 n p c   10 25 2.5 100    3