SlideShare una empresa de Scribd logo
1 de 20
Descargar para leer sin conexión
The 5-number summary,
     boxplots and outliers




9/4/2011                     Slide 1
The 5-number summary
•      An example 5-number summary:
                  5 12 14 17 21
•      5 is the minimum value in the set
•      12 is the first quartile
•      14 is the median
•      17 is the third quartile
•      21 is the maximum value
9/4/2011                        Slide 2
The 5-number summary
• Quartiles?
     – The 1st quartile is the point at which 25% of
       the data is below and 75% above.
     – The 2nd quartile (the MEDIAN) is the point at
       which 50% of the data is below and 50%
       above.
     – The 3rd quartile is the point at which 75% of
       the data is below and 25% above

9/4/2011                           Slide 3
The 5-number summary
•        Back to the example
                           5 12 14 17 21
•        So, if these are a scores on a 22 point quiz from a
         class…
     –     The lowest score in the class was 5 points
     –     25% of students earned 12 or fewer points (75% earned 12 or
           more)
     –     50% of students earned 14 or fewer points (50% earned 14 or
           more) – the median
     –     75% of students earned 17 or fewer points (25% earned 17 or
           more)
     –     The highest score in the class was 21 points

9/4/2011                                      Slide 4
The 5-number summary
• Finding the minimum and maximum is
  pretty simple
• Finding the median was discussed in
  lesson 1.6
• So to find the quartiles…




9/4/2011                  Slide 5
The 5-number summary
• Finding the quartiles
     – To find the 1st quartile, simply find the median
       of the lower half of the data (the lower half of
       the data does NOT include the median of the
       data).
     – To find the 3rd quartile, simply find the median
       of the upper half of the data (the upper half of
       the data does NOT include the median of the
       data).

9/4/2011                            Slide 6
The 5-number summary
                                                     2
• Example of finding
                                                     3    Median of lower
  the quartiles          Lower Half of Data          3    half is 1st
                                                          quartile = 3
                                                     5
                Median is (8+9)/2 or 8.5             8
                                                     9
                                                     9
                                                          Median of
                         Upper Half of Data          12   upper half is 3rd
                                                          quartile = 12
                                                     13
                                                     13



9/4/2011                                   Slide 7
The 5-number summary
• Find the 5-number         17.1        2.1
  summary of the            5.8         2.0
  following data (which
  are salaries (in          5.0         1.0
  millions) of an NBA       4.5         1.0
  team:
                            4.3         0.8
                            4.2         0.7
• Go to the next slide to
  check your work.          3.1         0.3
9/4/2011                      Slide 8
The 5-number summary
• Your answer should be:
                0.3 1.0 2.6 4.5 17.1

• Another measure of spread is found from the 5-number
  summary, the interquartile range (or IQR).
• The IQR is simply the 3rd quartile (Q3) minus the 1st
  quartile (Q1).
• So, IQR = Q3 – Q1
• This is simply a variation on the definition of range.



9/4/2011                             Slide 9
Outliers and extreme values
• The 17.1 million dollar salary is quite high.
• Is it an outlier among the data?

• Tukey’s rule: A data point is an outlier if it
  falls more than 1.5 IQR below the 1st
  quartile OR 1.5 IQR above the 3rd quartile.


9/4/2011                      Slide 10
Outliers and extreme values
• Recall the summary: 0.3 1.0 2.6 4.5 17.1
• The IQR = 4.5 – 1.0 = 3.5
• Check for the high outlier:
     –     Is 17.1 more than 1.5IQR above the 3rd quartile (4.5)?
     –     Is 17.1 > 1.5(3.5) + 4.5 ?
     –     Is 17.1 > 9.75 ?
     –     Yes, so the $17.1 million salary is an outlier on the
           team’s payroll.


9/4/2011                                   Slide 11
Boxplots
• The boxplot displays the 5-number
  summary: minimum, lower quartile (Q1),
  median upper quartile (Q3), maximum.
• It also shows the Inter-quartile range (IQR)
  and outliers.
• It also gives us information about the
  symmetry of the distribution.

9/4/2011                     Slide 12
Example Boxplot
                                                                   The largest
                                                                   observation
                                                                   (max) is
                                                                   approximately
                                                                   4.

The smallest
observation (min)
is approximately
0.


                    The first          The second             The third quartile
                    quartile (Q1) is   quartile (Q2) is       (Q3) is approximately
                    approximately      approximately          2.7.
  9/4/2011          1.                 1.6.        Slide 13
Example Boxplot (modified)
                    Outliers show up as
                    circles. In this case, it
                    is now the max.



                    This is the largest
                    observation that is
                    NOT an out outlier.
9/4/2011                  Slide 14
Boxplots
• Example
     – Verbal GMAT scores of 12 students: 10, 22, 24, 27,
       31, 33, 39, 40, 42, 43, 44, 45
     – The 5-number summary is:
                     10 25.5 36 42.5 45
     – Now the box-plot is constructed as follows:
           •   The line inside the box indicates the median.
           •   The left side of this box indicates the lower quartile (Q1).
           •   The right side of this box indicates the upper quartile (Q3).
           •   A straight line is then drawn from the lowest value of this
               distribution to the box (at Q1) and another straight line from
               the box (at Q3) to the highest value of this distribution.


9/4/2011                                            Slide 15
Boxplot




9/4/2011             Slide 16
Modified Boxplot
• Consider the 5-number summary of NBA
  salaries: 0.3 1.0 2.6 4.5 17.1
• The modified boxplot shows outliers
  separately.
• On the next slide, note how the outlier
  17.1 is plotted separately.
• The line only goes to the maximum (or
  minimum) point that is NOT an outlier.

9/4/2011                   Slide 17
Modified Boxplot




9/4/2011               Slide 18
Using Technology to Make
                   Boxplots
• Use StatCrunch
• On your calculator:
     – To create a boxplot:
           • Enter your list of data.
           • Now select STAT PLOT by pressing the 2nd key followed by
             the key labeled Y=.
           • Press the ENTER key, then select the option of ON, and select
             the boxplot type that shows outliers (as dots)
           • The Xlist should indicate the name of your list (L1 or other),
             and the Freq value should be set to 1.
           • Now press the ZOOM key and select option 9 (ZoomStat).
           • Press the ENTER key and the boxplot should be displayed.
           • You can use the TRACE key along with the arrow keys in order
             to read the values of the five number summary (Min, Q1, Med,
             Q3, Max).

9/4/2011                                         Slide 19
The 5-number summary, boxplots
           and outliers
• This concludes the presentation.




9/4/2011                    Slide 20

Más contenido relacionado

La actualidad más candente

frequency distribution
 frequency distribution frequency distribution
frequency distributionUnsa Shakir
 
Applications of mean ,mode & median
Applications of mean ,mode & medianApplications of mean ,mode & median
Applications of mean ,mode & medianAnagha Deshpande
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statisticsAjendra Sharma
 
Chapter 2: Frequency Distribution and Graphs
Chapter 2: Frequency Distribution and GraphsChapter 2: Frequency Distribution and Graphs
Chapter 2: Frequency Distribution and GraphsMong Mara
 
Lesson 3 - matrix multiplication
Lesson 3 - matrix multiplicationLesson 3 - matrix multiplication
Lesson 3 - matrix multiplicationJonathan Templin
 
Microsoft Office Excel 2003 Sorting And Filtering
Microsoft Office Excel 2003 Sorting And FilteringMicrosoft Office Excel 2003 Sorting And Filtering
Microsoft Office Excel 2003 Sorting And FilteringMarc Morgenstern
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendencydrdig
 
Mean, median, and mode
Mean, median, and modeMean, median, and mode
Mean, median, and modeguest455435
 
Frequency Distribution
Frequency DistributionFrequency Distribution
Frequency DistributionTeacherMariza
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyMmedsc Hahm
 
Frequency distribution 6
Frequency distribution 6Frequency distribution 6
Frequency distribution 6NadeemShoukat3
 
The sampling distribution
The sampling distributionThe sampling distribution
The sampling distributionHarve Abella
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive MeasuresYesica Adicondro
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis Misab P.T
 

La actualidad más candente (20)

frequency distribution
 frequency distribution frequency distribution
frequency distribution
 
Applications of mean ,mode & median
Applications of mean ,mode & medianApplications of mean ,mode & median
Applications of mean ,mode & median
 
Frequency Distributions
Frequency DistributionsFrequency Distributions
Frequency Distributions
 
Basic Descriptive statistics
Basic Descriptive statisticsBasic Descriptive statistics
Basic Descriptive statistics
 
Chapter 2: Frequency Distribution and Graphs
Chapter 2: Frequency Distribution and GraphsChapter 2: Frequency Distribution and Graphs
Chapter 2: Frequency Distribution and Graphs
 
Lesson 3 - matrix multiplication
Lesson 3 - matrix multiplicationLesson 3 - matrix multiplication
Lesson 3 - matrix multiplication
 
Microsoft Office Excel 2003 Sorting And Filtering
Microsoft Office Excel 2003 Sorting And FilteringMicrosoft Office Excel 2003 Sorting And Filtering
Microsoft Office Excel 2003 Sorting And Filtering
 
Dr digs central tendency
Dr digs central tendencyDr digs central tendency
Dr digs central tendency
 
Mean, median, and mode
Mean, median, and modeMean, median, and mode
Mean, median, and mode
 
Frequency Distribution
Frequency DistributionFrequency Distribution
Frequency Distribution
 
Introduction to Descriptive Statistics
Introduction to Descriptive StatisticsIntroduction to Descriptive Statistics
Introduction to Descriptive Statistics
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Frequency distribution 6
Frequency distribution 6Frequency distribution 6
Frequency distribution 6
 
The sampling distribution
The sampling distributionThe sampling distribution
The sampling distribution
 
Sorting
SortingSorting
Sorting
 
Quartile
QuartileQuartile
Quartile
 
Numerical Descriptive Measures
Numerical Descriptive MeasuresNumerical Descriptive Measures
Numerical Descriptive Measures
 
Correlation analysis
Correlation analysis Correlation analysis
Correlation analysis
 
Variability
VariabilityVariability
Variability
 
Ppt on ms excel
Ppt on ms excelPpt on ms excel
Ppt on ms excel
 

Destacado

Lesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number SummaryLesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number Summarychrismac47
 
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonFinding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonMoonie Kim
 
Finding the Mean from Dot Plot
Finding the Mean from Dot PlotFinding the Mean from Dot Plot
Finding the Mean from Dot PlotMoonie Kim
 
Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Moonie Kim
 
Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Moonie Kim
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power pointmanswag123
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile rangeKen Plummer
 
Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Moonie Kim
 
Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Moonie Kim
 
Further3 summarising univariate data
Further3  summarising univariate dataFurther3  summarising univariate data
Further3 summarising univariate datakmcmullen
 
Further4 box plots, 5 number summary and outliers
Further4  box plots, 5 number summary and outliersFurther4  box plots, 5 number summary and outliers
Further4 box plots, 5 number summary and outlierskmcmullen
 
Finding the Mean Introduction
Finding the Mean IntroductionFinding the Mean Introduction
Finding the Mean IntroductionMoonie Kim
 
Managing Investment in Employees Strategically
Managing Investment in Employees StrategicallyManaging Investment in Employees Strategically
Managing Investment in Employees StrategicallyPat Wright
 
Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent CBIZ, Inc.
 
Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plotsValPatton
 
Stem and leaf plot
Stem and leaf plotStem and leaf plot
Stem and leaf plotbbeiers
 

Destacado (20)

Lesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number SummaryLesson 1-4 -- Five Number Summary
Lesson 1-4 -- Five Number Summary
 
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram PolygonFinding Interquartile Range from Cumulative Frequency Histogram Polygon
Finding Interquartile Range from Cumulative Frequency Histogram Polygon
 
Finding the Mean from Dot Plot
Finding the Mean from Dot PlotFinding the Mean from Dot Plot
Finding the Mean from Dot Plot
 
Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1Finding Interquartile Range from Stem-Leaf Plot 1
Finding Interquartile Range from Stem-Leaf Plot 1
 
Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2Finding Interquartile Range from Stem-Leaf Plot 2
Finding Interquartile Range from Stem-Leaf Plot 2
 
Box and whiskers power point
Box and whiskers power pointBox and whiskers power point
Box and whiskers power point
 
Inter quartile range
Inter quartile rangeInter quartile range
Inter quartile range
 
Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1Finding Interquartile Range from Dot Plot 1
Finding Interquartile Range from Dot Plot 1
 
Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2Finding Interquartile Range from Dot Plot 2
Finding Interquartile Range from Dot Plot 2
 
HISTOGRAMS
HISTOGRAMSHISTOGRAMS
HISTOGRAMS
 
Further3 summarising univariate data
Further3  summarising univariate dataFurther3  summarising univariate data
Further3 summarising univariate data
 
Further4 box plots, 5 number summary and outliers
Further4  box plots, 5 number summary and outliersFurther4  box plots, 5 number summary and outliers
Further4 box plots, 5 number summary and outliers
 
Finding the Mean Introduction
Finding the Mean IntroductionFinding the Mean Introduction
Finding the Mean Introduction
 
Managing Investment in Employees Strategically
Managing Investment in Employees StrategicallyManaging Investment in Employees Strategically
Managing Investment in Employees Strategically
 
Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent Creative Compensation Strategies to Maintain Morale & Retain Talent
Creative Compensation Strategies to Maintain Morale & Retain Talent
 
Stem and-leaf plots
Stem and-leaf plotsStem and-leaf plots
Stem and-leaf plots
 
Stem and leaf plot
Stem and leaf plotStem and leaf plot
Stem and leaf plot
 
Biostatics ppt
Biostatics pptBiostatics ppt
Biostatics ppt
 
Stats
StatsStats
Stats
 
Cost curve
Cost curveCost curve
Cost curve
 

Similar a 5-Number Summary, Boxplots and Outliers Analysis

Box and whisker plots with five number summary
Box and whisker plots with five number summaryBox and whisker plots with five number summary
Box and whisker plots with five number summaryLearnbay Datascience
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptSyedSaifUrRehman3
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxABCraftsman
 
Revisionf2
Revisionf2Revisionf2
Revisionf2wind12
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1TieeTiee
 
Rt graphical representation
Rt graphical representationRt graphical representation
Rt graphical representationRinchen
 
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxTSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxnanamonkton
 
Statistics Slides.pdf
Statistics Slides.pdfStatistics Slides.pdf
Statistics Slides.pdfYasirAli74993
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.pptfghgjd
 
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxCHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxBeverlyAmoraSerada
 
De vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberDe vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberlenasour
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)CS, NcState
 
analytical representation of data
 analytical representation of data analytical representation of data
analytical representation of dataUnsa Shakir
 
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxmeasures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxRonnelLozano
 
De vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberDe vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberlenasour
 
Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02Alex Robianes Hernandez
 

Similar a 5-Number Summary, Boxplots and Outliers Analysis (20)

Bab 4.ppt
Bab 4.pptBab 4.ppt
Bab 4.ppt
 
Chap004
Chap004Chap004
Chap004
 
Chap004
Chap004Chap004
Chap004
 
Box and whisker plots with five number summary
Box and whisker plots with five number summaryBox and whisker plots with five number summary
Box and whisker plots with five number summary
 
quartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.pptquartiles,deciles,percentiles.ppt
quartiles,deciles,percentiles.ppt
 
Lecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptxLecture 1 Descriptives.pptx
Lecture 1 Descriptives.pptx
 
Revisionf2
Revisionf2Revisionf2
Revisionf2
 
Statistics and probability lec006 part 1
Statistics and probability lec006 part 1Statistics and probability lec006 part 1
Statistics and probability lec006 part 1
 
Rt graphical representation
Rt graphical representationRt graphical representation
Rt graphical representation
 
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docxTSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
TSTD 6251  Fall 2014SPSS Exercise and Assignment 120 PointsI.docx
 
Statistics Slides.pdf
Statistics Slides.pdfStatistics Slides.pdf
Statistics Slides.pdf
 
Chap004.ppt
Chap004.pptChap004.ppt
Chap004.ppt
 
ap_stat_1.3.ppt
ap_stat_1.3.pptap_stat_1.3.ppt
ap_stat_1.3.ppt
 
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptxCHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
CHAPTER 3: FREQUENCY DISTRIBUTION ..pptx
 
De vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 novemberDe vry math 221 all ilabs latest 2016 november
De vry math 221 all ilabs latest 2016 november
 
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)Next Generation “Treatment Learning” (finding the diamonds in the dust)
Next Generation “Treatment Learning” (finding the diamonds in the dust)
 
analytical representation of data
 analytical representation of data analytical representation of data
analytical representation of data
 
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptxmeasures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
measures-of-position-for-ungrouped-data_MAth 10_Part1.pptx
 
De vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 novemberDe vry math221 all ilabs latest 2016 november
De vry math221 all ilabs latest 2016 november
 
Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02Presentationofdata 120111034007-phpapp02
Presentationofdata 120111034007-phpapp02
 

Último

Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 

Último (20)

Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 

5-Number Summary, Boxplots and Outliers Analysis

  • 1. The 5-number summary, boxplots and outliers 9/4/2011 Slide 1
  • 2. The 5-number summary • An example 5-number summary: 5 12 14 17 21 • 5 is the minimum value in the set • 12 is the first quartile • 14 is the median • 17 is the third quartile • 21 is the maximum value 9/4/2011 Slide 2
  • 3. The 5-number summary • Quartiles? – The 1st quartile is the point at which 25% of the data is below and 75% above. – The 2nd quartile (the MEDIAN) is the point at which 50% of the data is below and 50% above. – The 3rd quartile is the point at which 75% of the data is below and 25% above 9/4/2011 Slide 3
  • 4. The 5-number summary • Back to the example 5 12 14 17 21 • So, if these are a scores on a 22 point quiz from a class… – The lowest score in the class was 5 points – 25% of students earned 12 or fewer points (75% earned 12 or more) – 50% of students earned 14 or fewer points (50% earned 14 or more) – the median – 75% of students earned 17 or fewer points (25% earned 17 or more) – The highest score in the class was 21 points 9/4/2011 Slide 4
  • 5. The 5-number summary • Finding the minimum and maximum is pretty simple • Finding the median was discussed in lesson 1.6 • So to find the quartiles… 9/4/2011 Slide 5
  • 6. The 5-number summary • Finding the quartiles – To find the 1st quartile, simply find the median of the lower half of the data (the lower half of the data does NOT include the median of the data). – To find the 3rd quartile, simply find the median of the upper half of the data (the upper half of the data does NOT include the median of the data). 9/4/2011 Slide 6
  • 7. The 5-number summary 2 • Example of finding 3 Median of lower the quartiles Lower Half of Data 3 half is 1st quartile = 3 5 Median is (8+9)/2 or 8.5 8 9 9 Median of Upper Half of Data 12 upper half is 3rd quartile = 12 13 13 9/4/2011 Slide 7
  • 8. The 5-number summary • Find the 5-number 17.1 2.1 summary of the 5.8 2.0 following data (which are salaries (in 5.0 1.0 millions) of an NBA 4.5 1.0 team: 4.3 0.8 4.2 0.7 • Go to the next slide to check your work. 3.1 0.3 9/4/2011 Slide 8
  • 9. The 5-number summary • Your answer should be: 0.3 1.0 2.6 4.5 17.1 • Another measure of spread is found from the 5-number summary, the interquartile range (or IQR). • The IQR is simply the 3rd quartile (Q3) minus the 1st quartile (Q1). • So, IQR = Q3 – Q1 • This is simply a variation on the definition of range. 9/4/2011 Slide 9
  • 10. Outliers and extreme values • The 17.1 million dollar salary is quite high. • Is it an outlier among the data? • Tukey’s rule: A data point is an outlier if it falls more than 1.5 IQR below the 1st quartile OR 1.5 IQR above the 3rd quartile. 9/4/2011 Slide 10
  • 11. Outliers and extreme values • Recall the summary: 0.3 1.0 2.6 4.5 17.1 • The IQR = 4.5 – 1.0 = 3.5 • Check for the high outlier: – Is 17.1 more than 1.5IQR above the 3rd quartile (4.5)? – Is 17.1 > 1.5(3.5) + 4.5 ? – Is 17.1 > 9.75 ? – Yes, so the $17.1 million salary is an outlier on the team’s payroll. 9/4/2011 Slide 11
  • 12. Boxplots • The boxplot displays the 5-number summary: minimum, lower quartile (Q1), median upper quartile (Q3), maximum. • It also shows the Inter-quartile range (IQR) and outliers. • It also gives us information about the symmetry of the distribution. 9/4/2011 Slide 12
  • 13. Example Boxplot The largest observation (max) is approximately 4. The smallest observation (min) is approximately 0. The first The second The third quartile quartile (Q1) is quartile (Q2) is (Q3) is approximately approximately approximately 2.7. 9/4/2011 1. 1.6. Slide 13
  • 14. Example Boxplot (modified) Outliers show up as circles. In this case, it is now the max. This is the largest observation that is NOT an out outlier. 9/4/2011 Slide 14
  • 15. Boxplots • Example – Verbal GMAT scores of 12 students: 10, 22, 24, 27, 31, 33, 39, 40, 42, 43, 44, 45 – The 5-number summary is: 10 25.5 36 42.5 45 – Now the box-plot is constructed as follows: • The line inside the box indicates the median. • The left side of this box indicates the lower quartile (Q1). • The right side of this box indicates the upper quartile (Q3). • A straight line is then drawn from the lowest value of this distribution to the box (at Q1) and another straight line from the box (at Q3) to the highest value of this distribution. 9/4/2011 Slide 15
  • 16. Boxplot 9/4/2011 Slide 16
  • 17. Modified Boxplot • Consider the 5-number summary of NBA salaries: 0.3 1.0 2.6 4.5 17.1 • The modified boxplot shows outliers separately. • On the next slide, note how the outlier 17.1 is plotted separately. • The line only goes to the maximum (or minimum) point that is NOT an outlier. 9/4/2011 Slide 17
  • 19. Using Technology to Make Boxplots • Use StatCrunch • On your calculator: – To create a boxplot: • Enter your list of data. • Now select STAT PLOT by pressing the 2nd key followed by the key labeled Y=. • Press the ENTER key, then select the option of ON, and select the boxplot type that shows outliers (as dots) • The Xlist should indicate the name of your list (L1 or other), and the Freq value should be set to 1. • Now press the ZOOM key and select option 9 (ZoomStat). • Press the ENTER key and the boxplot should be displayed. • You can use the TRACE key along with the arrow keys in order to read the values of the five number summary (Min, Q1, Med, Q3, Max). 9/4/2011 Slide 19
  • 20. The 5-number summary, boxplots and outliers • This concludes the presentation. 9/4/2011 Slide 20