SlideShare una empresa de Scribd logo
1 de 11
Basics of Statistics
1
2
Definition
•Descriptive statistics are used to describe the basic features of the data in a study.
They provide simple summaries about the sample and the measures. Together with
simple graphics analysis, they form the basis of virtually every quantitative
analysis of data.
•Descriptive statistics are brief descriptive coefficients that summarize a given
data set, which can be either a representation of the entire population or a sample
of it. Descriptive statistics are broken down into measures of central tendency and
measures of variability, or spread.
•Measures of variability or spread include the standard deviation (or variance), the
minimum and maximum values of the variables, kurtosis and skewness.
•Descriptive statistics are either quantitative (summary statistics) or visual (simple
graphs)
•Descriptive statistics are limited in so much that they only allow you to make
summations about the people or objects that you have actually measured. You
cannot use the data you have collected to generalize to other people or objects
(i.e., using data from a sample to infer the properties/parameters of a population).
Use in Statistical analysis
Univariate analysis
• It describes the distribution of a single variable.
• It includes central tendency (mean, median and mode), dispersion (range and quantiles) and
spread (variance and standard deviation).
• Distribution is also studied using skewness and kurtosis. They can be graphically represented by
histograms.
Bi- and multivariate
• Bivariate analysis is the simultaneous analysis of two variables (attributes)
• Explores the concept of relationship between two variables, whether there exists an association
and the strength of this association, or whether there are differences between two variables and
the significance of these differences.
3
Maximum and Minimum
• Minimum is the smallest value in the data set. This number is the data value that is less than or
equal to all other values in our set of data
• Maximum is the largest value in the dataset. This number is the data value that is greater than or
equal to all other values in our set of data
• The maximum and minimum provide good examples of the type of descriptive statistic that is easy
to marginalize. Despite these two numbers being extremely easy to determine, they make
appearances in the calculation of other descriptive statistics
Uses:
• Both maximum and minimum is used to calculate the range
4
Mean
• Mean can’t consider when there is a huge
eg:- Mean for the salaried employees across
all position in the organization
• Mean can’t be used in categorical data
• Mean" is the "average" , where we add up
all the numbers and then divide by the
number of numbers
• Mean will say the average value of the
particular variable
Applicable variable type :
Interval and Ratio level data
5
Median
• The Median of the Dataset is dependent on whether the number of
elements in the dataset is odd or even
• If there is even number of of dataset, add the Centre two values and
divide by two
Applicable variable type :
Ordinal and Interval level data
6
Mode
• The “Mode” for a dataset is the element that occurs the most often
• When we have huge difference in datasets this Mode measure is used
, and used for the Categorical data
Applicable variable type :
Nominal, Ordinal and Interval level data
7
Range
• The Range is the difference between the lowest and highest
values
• The range can sometimes be misleading when there are
extremely high or low values
• Range is used to find the Maximum and minimum value in the
Dataset
8
Quartiles
Definition:
Quartiles are measures of central tendency that divide
a group of data into four subgroups or parts. The three
quartiles are denoted as Q1, Q2, and Q3.
Explanation:
• The first quartile, Q1, separates the first, or lowest,
one-fourth of the data from the upper three-fourths
and is equal to the 25th percentile.
• The second quartile, Q2, separates the second
quarter of the data from the third quarter. Q2 is
located at the 50th percentile and equals the median
of the data.
• The third quartile, Q3, divides the first three-
quarters of the data from the last quarter and is
equal to the value of the 75th percentile
Applicable variable type :
Ordinal level data
9
Skewness
• Skewness is a measure of symmetry. If the
skewness of S is zero then the distribution
represented by S is perfectly symmetric. If the
skewness is negative, then the distribution is
skewed to the left, while if the skew is positive
then the distribution is skewed to the right
• Skewness tells us about the direction of variation
of the data set
• Skewness is a measure that studies the degree and
direction of departure from symmetry
Interpretation:
If skewness is equal to zero distribution is normal
If skewness is greater than zero it’s Positive
skewness
If skewness is less than zero it’s Negative skewness
10
Kurtosis
• Kurtosis is a statistical measure that's used to describe the
distribution, or skewness, of observed data around the mean,
sometimes referred to as the volatility of volatility
• Kurtosis is used generally in the statistical field to describes trends
in charts. Kurtosis can be present in a chart with fat tails and a low,
even distribution, as well as be present in a chart with skinny tails
and a distribution concentrated toward the mean
• Kurtosis is one or more symmetrical distributions are
compared, the difference in them are studied with ‘Kurtosis’
11

Más contenido relacionado

La actualidad más candente

Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central TendencySharmin_Abeer
 
Normal Curve in Total Quality Management
Normal Curve in Total Quality ManagementNormal Curve in Total Quality Management
Normal Curve in Total Quality ManagementDr.Raja R
 
Statistics for machine learning shifa noorulain
Statistics for machine learning   shifa noorulainStatistics for machine learning   shifa noorulain
Statistics for machine learning shifa noorulainShifaNoorUlAin1
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)jasondeveau
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendencybujols
 
Measures of central tendency dispersion
Measures of central tendency dispersionMeasures of central tendency dispersion
Measures of central tendency dispersionAbhinav yadav
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyAlex Chris
 
Measure of central tendency(0039)
Measure of central tendency(0039)Measure of central tendency(0039)
Measure of central tendency(0039)Irfan Hussain
 
Quantitative data analysis
Quantitative data analysisQuantitative data analysis
Quantitative data analysisatrantham
 
Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central TendencyAurus Network
 
15. descriptive statistics
15. descriptive statistics15. descriptive statistics
15. descriptive statisticsAshok Kulkarni
 
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxSTATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxMuhammadNafees42
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of datajennytuazon01630
 
Central Tendency and types
Central Tendency and typesCentral Tendency and types
Central Tendency and typesArRaja4
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statisticsAnand Thokal
 

La actualidad más candente (19)

Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central Tendency
 
Normal Curve in Total Quality Management
Normal Curve in Total Quality ManagementNormal Curve in Total Quality Management
Normal Curve in Total Quality Management
 
Statistics for machine learning shifa noorulain
Statistics for machine learning   shifa noorulainStatistics for machine learning   shifa noorulain
Statistics for machine learning shifa noorulain
 
Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)Torturing numbers - Descriptive Statistics for Growers (2013)
Torturing numbers - Descriptive Statistics for Growers (2013)
 
Choosing the best measure of central tendency
Choosing the best measure of central tendencyChoosing the best measure of central tendency
Choosing the best measure of central tendency
 
Measures of central tendency dispersion
Measures of central tendency dispersionMeasures of central tendency dispersion
Measures of central tendency dispersion
 
Descriptive statistics -review(2)
Descriptive statistics -review(2)Descriptive statistics -review(2)
Descriptive statistics -review(2)
 
Descriptive statistics i
Descriptive statistics iDescriptive statistics i
Descriptive statistics i
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measure of central tendency(0039)
Measure of central tendency(0039)Measure of central tendency(0039)
Measure of central tendency(0039)
 
Quantitative data analysis
Quantitative data analysisQuantitative data analysis
Quantitative data analysis
 
Measure of Central Tendency
Measure of Central TendencyMeasure of Central Tendency
Measure of Central Tendency
 
15. descriptive statistics
15. descriptive statistics15. descriptive statistics
15. descriptive statistics
 
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptxSTATISTICAL PROCEDURES (Discriptive Statistics).pptx
STATISTICAL PROCEDURES (Discriptive Statistics).pptx
 
Quants
QuantsQuants
Quants
 
Graphical presentation of data
Graphical presentation of dataGraphical presentation of data
Graphical presentation of data
 
Central Tendency and types
Central Tendency and typesCentral Tendency and types
Central Tendency and types
 
Descriptive statistics
Descriptive statisticsDescriptive statistics
Descriptive statistics
 
Lec 13
Lec 13Lec 13
Lec 13
 

Similar a Basic statisctis -Anandh Shankar

Chapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersChapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersInternational advisers
 
measures of central tendency.pptx
measures of central tendency.pptxmeasures of central tendency.pptx
measures of central tendency.pptxManish Agarwal
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxSailajaReddyGunnam
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendencyMmedsc Hahm
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxSandeAlotaBoco
 
2. chapter ii(analyz)
2. chapter ii(analyz)2. chapter ii(analyz)
2. chapter ii(analyz)Chhom Karath
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptxIndhuGreen
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...EqraBaig
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docxcullenrjzsme
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptxjeyanthisivakumar
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptDiptoKumerSarker1
 
Medical Statistics.ppt
Medical Statistics.pptMedical Statistics.ppt
Medical Statistics.pptssuserf0d95a
 

Similar a Basic statisctis -Anandh Shankar (20)

Chapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index NumbersChapter 12 Data Analysis Descriptive Methods and Index Numbers
Chapter 12 Data Analysis Descriptive Methods and Index Numbers
 
measures of central tendency.pptx
measures of central tendency.pptxmeasures of central tendency.pptx
measures of central tendency.pptx
 
Descriptive Analysis.pptx
Descriptive Analysis.pptxDescriptive Analysis.pptx
Descriptive Analysis.pptx
 
Biostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptxBiostatistics mean median mode unit 1.pptx
Biostatistics mean median mode unit 1.pptx
 
STATISTICS.pptx
STATISTICS.pptxSTATISTICS.pptx
STATISTICS.pptx
 
Statr sessions 4 to 6
Statr sessions 4 to 6Statr sessions 4 to 6
Statr sessions 4 to 6
 
SUMMARY MEASURES.pdf
SUMMARY MEASURES.pdfSUMMARY MEASURES.pdf
SUMMARY MEASURES.pdf
 
Measures of central tendency
Measures of central tendencyMeasures of central tendency
Measures of central tendency
 
Measure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptxMeasure of central tendency grouped data.pptx
Measure of central tendency grouped data.pptx
 
2. chapter ii(analyz)
2. chapter ii(analyz)2. chapter ii(analyz)
2. chapter ii(analyz)
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Statistics
StatisticsStatistics
Statistics
 
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
Descriptive Statistics: Measures of Central Tendency - Measures of Dispersion...
 
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docxANALYSIS ANDINTERPRETATION  OF DATA Analysis and Interpr.docx
ANALYSIS ANDINTERPRETATION OF DATA Analysis and Interpr.docx
 
3. Statistical Analysis.pptx
3. Statistical Analysis.pptx3. Statistical Analysis.pptx
3. Statistical Analysis.pptx
 
Statistics four
Statistics fourStatistics four
Statistics four
 
1 introduction to psychological statistics
1 introduction to psychological statistics1 introduction to psychological statistics
1 introduction to psychological statistics
 
R training4
R training4R training4
R training4
 
Stats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.pptStats-Review-Maie-St-John-5-20-2009.ppt
Stats-Review-Maie-St-John-5-20-2009.ppt
 
Medical Statistics.ppt
Medical Statistics.pptMedical Statistics.ppt
Medical Statistics.ppt
 

Último

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxAleenaJamil4
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 

Último (20)

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
detection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptxdetection and classification of knee osteoarthritis.pptx
detection and classification of knee osteoarthritis.pptx
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 

Basic statisctis -Anandh Shankar

  • 2. 2 Definition •Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple graphics analysis, they form the basis of virtually every quantitative analysis of data. •Descriptive statistics are brief descriptive coefficients that summarize a given data set, which can be either a representation of the entire population or a sample of it. Descriptive statistics are broken down into measures of central tendency and measures of variability, or spread. •Measures of variability or spread include the standard deviation (or variance), the minimum and maximum values of the variables, kurtosis and skewness. •Descriptive statistics are either quantitative (summary statistics) or visual (simple graphs) •Descriptive statistics are limited in so much that they only allow you to make summations about the people or objects that you have actually measured. You cannot use the data you have collected to generalize to other people or objects (i.e., using data from a sample to infer the properties/parameters of a population).
  • 3. Use in Statistical analysis Univariate analysis • It describes the distribution of a single variable. • It includes central tendency (mean, median and mode), dispersion (range and quantiles) and spread (variance and standard deviation). • Distribution is also studied using skewness and kurtosis. They can be graphically represented by histograms. Bi- and multivariate • Bivariate analysis is the simultaneous analysis of two variables (attributes) • Explores the concept of relationship between two variables, whether there exists an association and the strength of this association, or whether there are differences between two variables and the significance of these differences. 3
  • 4. Maximum and Minimum • Minimum is the smallest value in the data set. This number is the data value that is less than or equal to all other values in our set of data • Maximum is the largest value in the dataset. This number is the data value that is greater than or equal to all other values in our set of data • The maximum and minimum provide good examples of the type of descriptive statistic that is easy to marginalize. Despite these two numbers being extremely easy to determine, they make appearances in the calculation of other descriptive statistics Uses: • Both maximum and minimum is used to calculate the range 4
  • 5. Mean • Mean can’t consider when there is a huge eg:- Mean for the salaried employees across all position in the organization • Mean can’t be used in categorical data • Mean" is the "average" , where we add up all the numbers and then divide by the number of numbers • Mean will say the average value of the particular variable Applicable variable type : Interval and Ratio level data 5
  • 6. Median • The Median of the Dataset is dependent on whether the number of elements in the dataset is odd or even • If there is even number of of dataset, add the Centre two values and divide by two Applicable variable type : Ordinal and Interval level data 6
  • 7. Mode • The “Mode” for a dataset is the element that occurs the most often • When we have huge difference in datasets this Mode measure is used , and used for the Categorical data Applicable variable type : Nominal, Ordinal and Interval level data 7
  • 8. Range • The Range is the difference between the lowest and highest values • The range can sometimes be misleading when there are extremely high or low values • Range is used to find the Maximum and minimum value in the Dataset 8
  • 9. Quartiles Definition: Quartiles are measures of central tendency that divide a group of data into four subgroups or parts. The three quartiles are denoted as Q1, Q2, and Q3. Explanation: • The first quartile, Q1, separates the first, or lowest, one-fourth of the data from the upper three-fourths and is equal to the 25th percentile. • The second quartile, Q2, separates the second quarter of the data from the third quarter. Q2 is located at the 50th percentile and equals the median of the data. • The third quartile, Q3, divides the first three- quarters of the data from the last quarter and is equal to the value of the 75th percentile Applicable variable type : Ordinal level data 9
  • 10. Skewness • Skewness is a measure of symmetry. If the skewness of S is zero then the distribution represented by S is perfectly symmetric. If the skewness is negative, then the distribution is skewed to the left, while if the skew is positive then the distribution is skewed to the right • Skewness tells us about the direction of variation of the data set • Skewness is a measure that studies the degree and direction of departure from symmetry Interpretation: If skewness is equal to zero distribution is normal If skewness is greater than zero it’s Positive skewness If skewness is less than zero it’s Negative skewness 10
  • 11. Kurtosis • Kurtosis is a statistical measure that's used to describe the distribution, or skewness, of observed data around the mean, sometimes referred to as the volatility of volatility • Kurtosis is used generally in the statistical field to describes trends in charts. Kurtosis can be present in a chart with fat tails and a low, even distribution, as well as be present in a chart with skinny tails and a distribution concentrated toward the mean • Kurtosis is one or more symmetrical distributions are compared, the difference in them are studied with ‘Kurtosis’ 11