SlideShare una empresa de Scribd logo
1 de 24
1
Basics of Statistics
A Quick reference to the basics of
statistics
by
Shiva Dasharathi
2
1. Population & Sample
2. Descriptive Statistics
1. Central tendency
2. Spread (or ) Dispersion
3. Normal Distribution
4. Central Limit Theorem
5. Correlation, covariance
6. Probability
7. Statistical tests & Significance level
8. Random Variables
9. Other Distributions
Contents
3
Population & Sample
Population: is the complete set of items of a characteristic of given
subject of analysis
Sample: A sample is a part of the population under study selected
so that inferences can be drawn from it about the population
Primary data: Data collected for actual analysis of the subject
Secondary Data: Already collected data used for analyzing other
factors of the subject
Measurement scales :
Nominal, ordinal, interval, ratio
4
Descriptive statistics
Central Tendency: Mean
Mean: is the average value
Arithmetic Mean
Geometric Mean
Harmonic Mean
N
xi∑=
N ix∏=
∑=
= N
i ix
N
1
1
5
Descriptive statistics
Central Tendency: Median
Median: is the middle term in an ordered set
: if ‘N’ is odd number
=
: if ‘N’ is even number
Ex: S = {1,4,7,7,9,11,23}
median of S = 4th
term of set
= 7
xN
2
1+
{ 2
2
1
2
xx NN ++
6
Descriptive statistics
Central Tendency: Mode
Mode: = High frequency term
Ex: S = {1,4,11,4,3,9,3,5,7,5,3}
mode of S = 3
7
Descriptive statistics
Spread:
Variance of a sample
Standard Deviation =
( )
1
2
2
−
−
=
∑
N
X µ
σ
Varianceσ
8
Descriptive statistics
for the given sample S = {1, 2,5,7,9,11,12,15,}
Min:1
Max:15
Range = Max – Min
= 15 -1 = 14
IQR: Inter Quartile Range
= Q3 - Q1
= 11.5 – 3.5 Q3 = (11+12)/2 = 11.5
= 8 Q1 = (2+5)/2 = 3.5
Q1: middle term in the 1st
half of the ordered set
Q2: Median
Q3: middle term in the 2nd
half of the ordered set
{
9
Normal Distribution
Normal (or) Gaussian Distribution:
- Is a bell curve with Mean at the center and
- 68% data distributed in 1 Standard Deviation range
- 95.4% data distributed in 2 Standard Deviations range
- 99.7% data distributed in 3 Standard Deviations range
),( σµN
10
Normal Distribution
In a perfect Normal Distribution,
1. parameters Mean, Median, Mode are aligned
2. data is symmetrically distributed around these parameters
),( σµN
11
Standardized Normal Distribution
)1,0(N
To standardize the distribution,
Step 1. center the mean (minus the mean from all the data points)
Step 2. divide each value by the standard deviation
Standardized Normal Distribution:
Isa normal distribution with mean=0 and standard
deviation=1
12
Skewness
Skewness: indicates the symmetry of the distribution
13
Central Limit Theorem
),(
n
N
σ
µ
pµµ =
)
n
pσ
σ = 




:
:
:
n
p
p
σ
µ
14
Correlation
Correlation: Cor(X,Y) = [-1,1]
-Correlation indicates the strength of association
between two variable
- it ranges from -1 to +1
cor(x,y ) = 1 // being perfectly positively correlated
cor(x,y) = -1 // being perfectly negatively correlated
cor(x,y) = 0 // being independent
Cor(X,Y) =
15
Correlation cont..
Perfect correlation
Weak correlation
16
Covariance
Covariance: is a measure of how two variables are changing
together.
- So correlation is the covariance normalized by the standard
deviation of the 2 variables
17
Probability
Terminology:
Sample space: set of all possible outcomes of an experiment
Event: is a subset of sample space
Probability: is the measure of the likeliness that an event will
occur among all the possible outcomes.
Probability of ‘event x’ =
Example:
Event: tossing a fair coin
Sample Space = {HEAD,TAIL}
Probability of ‘Head’ = P(HEAD) = ½ = 0.5
Probability of ‘Tail’ = P(TAIL) = ½ = 0.5
salloutcome
likelyness
xp =)(
18
Probability cont..
Axioms of probability
1.
2. ,
3. Sum of all the probabilities of events in a sample space
is equal to 1
4.
]1,0[)( =xp
)(xpc
)(1)( xpxpc −=
)(xp are mutually exclusive if,
1..)()( 21 =++ xpxp
)()()()( BAPBPAPBAP ∩−+=∪
19
Conditional Probability
Conditional Probability: is the probability of
event A given event B occurred
Where,
: Probability of A conditioned on B
: Joint probability of A and B
: Probability of B
for independent events A, B
)(
)(
)/(
BP
BAP
BAP
∩
=





∩
)(
)(
)/(
BP
BAP
BAP
)(*)()( BPAPBAP =∩
)(
)(
)(*)(
)/( AP
BP
BPAP
BAP ==
∴
20
Bayes Rule
Bayes Rule: Posterior * evidence = Prior * likelihood
from conditional probability,
)(*)/()(*)/(
)(*)/()(
)(
)(
)/(
)(*)/()(
)(
)(
)/(
APABPBPBAP
APABPBAP
AP
BAP
ABP
BPBAPBAP
BP
BAP
BAP
=
=∩⇒
∩
=
=∩⇒
∩
=

)(
)/(*)(
)/(
BP
ABPBP
BAP =
21
Naïve Bayesian
Naïve Bayesian: Bayes Rule + Conditional independence
If ‘E’ is the event conditioned on x1,x2, .. xn, then
by applying the chain rule & independence condition we
get the final equation
∑=
= n
i
ii
n
n
xPx
ExPExPExPEP
xxxEP
1
*
21
2,1
)(
)/(..*)/(*)/(*)(
),../(
)/(...*)/(*)/(*)( 21 ExPExPExPEP n),../( 2,1 nxxxEP α
∏=
n
i
i ExPEP
1
)/(*)(α
22
Statistical tests & Significance level
t-test:
z-test:
Chi-square test:
ANOVA:
Factor ANOVA:
H0 – Null Hypothesis
H1 – Alternative Hypothesis
p-value (significance level, for Null hypothesis)
Type -1 error
Type-2 error
23
Random Variable
Random Variables & Probability Distribution:
Discrete:
Probability mass Function
Cumulative mass function
Continuous
Probability Density Function
Cumulative density function
24
Few other Distributions
Bernoulli distribution:
Binomial Distribution:
Poisson Distribution:

Más contenido relacionado

La actualidad más candente

Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"muhammad raza
 
Stat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesisStat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesisForensic Pathology
 
Central tendency and Variation or Dispersion
Central tendency and Variation or DispersionCentral tendency and Variation or Dispersion
Central tendency and Variation or DispersionJohny Kutty Joseph
 
Basic statistics 1
Basic statistics  1Basic statistics  1
Basic statistics 1Kumar P
 
Measures of Central Tendency and Dispersion
Measures of Central Tendency and DispersionMeasures of Central Tendency and Dispersion
Measures of Central Tendency and DispersionPharmacy Universe
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central TendencyNida Nafees
 
Normal distribution
Normal distributionNormal distribution
Normal distributionCamilleJoy3
 
3 descritive statistics measure of central tendency variatio
3 descritive statistics measure of   central   tendency variatio3 descritive statistics measure of   central   tendency variatio
3 descritive statistics measure of central tendency variatioLama K Banna
 
Data Distribution &The Probability Distributions
Data Distribution &The Probability DistributionsData Distribution &The Probability Distributions
Data Distribution &The Probability Distributionsmahaaltememe
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersionGilbert Joseph Abueg
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimationTech_MX
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsUniversity of Salerno
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbersUlster BOCES
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyPrithwis Mukerjee
 

La actualidad más candente (20)

Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"Presentation on "Measure of central tendency"
Presentation on "Measure of central tendency"
 
Stat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesisStat 4 the normal distribution & steps of testing hypothesis
Stat 4 the normal distribution & steps of testing hypothesis
 
Central tendency and Variation or Dispersion
Central tendency and Variation or DispersionCentral tendency and Variation or Dispersion
Central tendency and Variation or Dispersion
 
Basic statistics 1
Basic statistics  1Basic statistics  1
Basic statistics 1
 
Measures of Central Tendency and Dispersion
Measures of Central Tendency and DispersionMeasures of Central Tendency and Dispersion
Measures of Central Tendency and Dispersion
 
Measures of Central Tendency
Measures of Central TendencyMeasures of Central Tendency
Measures of Central Tendency
 
Point Estimation
Point EstimationPoint Estimation
Point Estimation
 
The Normal Distribution
The Normal DistributionThe Normal Distribution
The Normal Distribution
 
Normal distribution
Normal distributionNormal distribution
Normal distribution
 
Parameter estimation
Parameter estimationParameter estimation
Parameter estimation
 
3 descritive statistics measure of central tendency variatio
3 descritive statistics measure of   central   tendency variatio3 descritive statistics measure of   central   tendency variatio
3 descritive statistics measure of central tendency variatio
 
Statistics - Basics
Statistics - BasicsStatistics - Basics
Statistics - Basics
 
Statistical parameters
Statistical parametersStatistical parameters
Statistical parameters
 
Data Distribution &The Probability Distributions
Data Distribution &The Probability DistributionsData Distribution &The Probability Distributions
Data Distribution &The Probability Distributions
 
CABT Math 8 measures of central tendency and dispersion
CABT Math 8   measures of central tendency and dispersionCABT Math 8   measures of central tendency and dispersion
CABT Math 8 measures of central tendency and dispersion
 
Chapter 5 and Chapter 6
Chapter 5 and Chapter 6 Chapter 5 and Chapter 6
Chapter 5 and Chapter 6
 
Theory of estimation
Theory of estimationTheory of estimation
Theory of estimation
 
Point Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis testsPoint Estimate, Confidence Interval, Hypotesis tests
Point Estimate, Confidence Interval, Hypotesis tests
 
Describing quantitative data with numbers
Describing quantitative data with numbersDescribing quantitative data with numbers
Describing quantitative data with numbers
 
QT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central TendencyQT1 - 03 - Measures of Central Tendency
QT1 - 03 - Measures of Central Tendency
 

Destacado

Lecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airLecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airatutor_te
 
Presentation non parametric
Presentation non parametricPresentation non parametric
Presentation non parametricIrfan Hussain
 
presentation of data
presentation of datapresentation of data
presentation of dataChie Pegollo
 
Sampling techniquesmod5
Sampling techniquesmod5Sampling techniquesmod5
Sampling techniquesmod5Chie Pegollo
 
Commonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research HandoutCommonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research HandoutPat Barlow
 
What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?Ken Plummer
 
Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric testsArun Kumar
 
Null hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis TestNull hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis TestKen Plummer
 
Definitions And Scope Of Economics 5
Definitions And Scope Of Economics  5Definitions And Scope Of Economics  5
Definitions And Scope Of Economics 5siraj2762268
 
Parametric and non parametric test
Parametric and non parametric testParametric and non parametric test
Parametric and non parametric testAjay Malpani
 
Basics of statistics
Basics of statisticsBasics of statistics
Basics of statisticsdonthuraj
 
Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)mini4deb
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Jan Nah
 
Introduction to economics
Introduction to economicsIntroduction to economics
Introduction to economicsMichael Noel
 

Destacado (20)

Lecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.airLecture slides stats1.13.l22.air
Lecture slides stats1.13.l22.air
 
Presentation non parametric
Presentation non parametricPresentation non parametric
Presentation non parametric
 
presentation of data
presentation of datapresentation of data
presentation of data
 
Sampling techniquesmod5
Sampling techniquesmod5Sampling techniquesmod5
Sampling techniquesmod5
 
Berd 5-6
Berd 5-6Berd 5-6
Berd 5-6
 
Commonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research HandoutCommonly used Statistics in Medical Research Handout
Commonly used Statistics in Medical Research Handout
 
What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?What is a Kruskal Wallis-Test?
What is a Kruskal Wallis-Test?
 
Samplels & Sampling Techniques
Samplels & Sampling TechniquesSamplels & Sampling Techniques
Samplels & Sampling Techniques
 
Nonparametric tests
Nonparametric testsNonparametric tests
Nonparametric tests
 
Null hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis TestNull hypothesis for Kruskal Wallis Test
Null hypothesis for Kruskal Wallis Test
 
Definitions And Scope Of Economics 5
Definitions And Scope Of Economics  5Definitions And Scope Of Economics  5
Definitions And Scope Of Economics 5
 
Chapter 3
Chapter 3Chapter 3
Chapter 3
 
Non parametric tests
Non parametric testsNon parametric tests
Non parametric tests
 
Parametric and non parametric test
Parametric and non parametric testParametric and non parametric test
Parametric and non parametric test
 
Non-Parametric Tests
Non-Parametric TestsNon-Parametric Tests
Non-Parametric Tests
 
Presentation of data
Presentation of dataPresentation of data
Presentation of data
 
Basics of statistics
Basics of statisticsBasics of statistics
Basics of statistics
 
Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)Statistics for Class XI (CBSE)
Statistics for Class XI (CBSE)
 
Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency Mean, Median, Mode: Measures of Central Tendency
Mean, Median, Mode: Measures of Central Tendency
 
Introduction to economics
Introduction to economicsIntroduction to economics
Introduction to economics
 

Similar a Basics of Statistics

Sriram seminar on introduction to statistics
Sriram seminar on introduction to statisticsSriram seminar on introduction to statistics
Sriram seminar on introduction to statisticsSriram Chakravarthy
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsBabasab Patil
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variablesnszakir
 
Lect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spreadLect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spreadRione Drevale
 
Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docxsimonithomas47935
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersionKirti Gupta
 
AP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).pptAP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).pptAlfredNavea1
 
Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and ProbabilityBhavana Singh
 
C2 st lecture 13 revision for test b handout
C2 st lecture 13   revision for test b handoutC2 st lecture 13   revision for test b handout
C2 st lecture 13 revision for test b handoutfatima d
 
Standard deviation
Standard deviationStandard deviation
Standard deviationMai Ngoc Duc
 
2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.WeihanKhor2
 

Similar a Basics of Statistics (20)

Sriram seminar on introduction to statistics
Sriram seminar on introduction to statisticsSriram seminar on introduction to statistics
Sriram seminar on introduction to statistics
 
Descriptive
DescriptiveDescriptive
Descriptive
 
Talk 3
Talk 3Talk 3
Talk 3
 
S t a t i s t i c s
S t a t i s t i c sS t a t i s t i c s
S t a t i s t i c s
 
S t a t i s t i c s
S t a t i s t i c sS t a t i s t i c s
S t a t i s t i c s
 
Numerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec domsNumerical measures stat ppt @ bec doms
Numerical measures stat ppt @ bec doms
 
Basic statistics
Basic statisticsBasic statistics
Basic statistics
 
Chapter 4 part2- Random Variables
Chapter 4 part2- Random VariablesChapter 4 part2- Random Variables
Chapter 4 part2- Random Variables
 
Statistical Methods
Statistical MethodsStatistical Methods
Statistical Methods
 
Lect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spreadLect w2 measures_of_location_and_spread
Lect w2 measures_of_location_and_spread
 
Descriptive Statistics Formula Sheet Sample Populatio.docx
Descriptive Statistics Formula Sheet    Sample Populatio.docxDescriptive Statistics Formula Sheet    Sample Populatio.docx
Descriptive Statistics Formula Sheet Sample Populatio.docx
 
Central tendency _dispersion
Central tendency _dispersionCentral tendency _dispersion
Central tendency _dispersion
 
AP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).pptAP Statistic and Probability 6.1 (1).ppt
AP Statistic and Probability 6.1 (1).ppt
 
Normal as Approximation to Binomial
Normal as Approximation to Binomial  Normal as Approximation to Binomial
Normal as Approximation to Binomial
 
Central Tendency.pptx
Central Tendency.pptxCentral Tendency.pptx
Central Tendency.pptx
 
Introduction to Statistics and Probability
Introduction to Statistics and ProbabilityIntroduction to Statistics and Probability
Introduction to Statistics and Probability
 
statistics
statisticsstatistics
statistics
 
C2 st lecture 13 revision for test b handout
C2 st lecture 13   revision for test b handoutC2 st lecture 13   revision for test b handout
C2 st lecture 13 revision for test b handout
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.2 Review of Statistics. 2 Review of Statistics.
2 Review of Statistics. 2 Review of Statistics.
 

Último

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...Amil Baba Dawood bangali
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in collegessuser7a7cd61
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改yuu sss
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 

Último (20)

9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
NO1 Certified Black Magic Specialist Expert Amil baba in Lahore Islamabad Raw...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
While-For-loop in python used in college
While-For-loop in python used in collegeWhile-For-loop in python used in college
While-For-loop in python used in college
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
专业一比一美国俄亥俄大学毕业证成绩单pdf电子版制作修改
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 

Basics of Statistics

  • 1. 1 Basics of Statistics A Quick reference to the basics of statistics by Shiva Dasharathi
  • 2. 2 1. Population & Sample 2. Descriptive Statistics 1. Central tendency 2. Spread (or ) Dispersion 3. Normal Distribution 4. Central Limit Theorem 5. Correlation, covariance 6. Probability 7. Statistical tests & Significance level 8. Random Variables 9. Other Distributions Contents
  • 3. 3 Population & Sample Population: is the complete set of items of a characteristic of given subject of analysis Sample: A sample is a part of the population under study selected so that inferences can be drawn from it about the population Primary data: Data collected for actual analysis of the subject Secondary Data: Already collected data used for analyzing other factors of the subject Measurement scales : Nominal, ordinal, interval, ratio
  • 4. 4 Descriptive statistics Central Tendency: Mean Mean: is the average value Arithmetic Mean Geometric Mean Harmonic Mean N xi∑= N ix∏= ∑= = N i ix N 1 1
  • 5. 5 Descriptive statistics Central Tendency: Median Median: is the middle term in an ordered set : if ‘N’ is odd number = : if ‘N’ is even number Ex: S = {1,4,7,7,9,11,23} median of S = 4th term of set = 7 xN 2 1+ { 2 2 1 2 xx NN ++
  • 6. 6 Descriptive statistics Central Tendency: Mode Mode: = High frequency term Ex: S = {1,4,11,4,3,9,3,5,7,5,3} mode of S = 3
  • 7. 7 Descriptive statistics Spread: Variance of a sample Standard Deviation = ( ) 1 2 2 − − = ∑ N X µ σ Varianceσ
  • 8. 8 Descriptive statistics for the given sample S = {1, 2,5,7,9,11,12,15,} Min:1 Max:15 Range = Max – Min = 15 -1 = 14 IQR: Inter Quartile Range = Q3 - Q1 = 11.5 – 3.5 Q3 = (11+12)/2 = 11.5 = 8 Q1 = (2+5)/2 = 3.5 Q1: middle term in the 1st half of the ordered set Q2: Median Q3: middle term in the 2nd half of the ordered set {
  • 9. 9 Normal Distribution Normal (or) Gaussian Distribution: - Is a bell curve with Mean at the center and - 68% data distributed in 1 Standard Deviation range - 95.4% data distributed in 2 Standard Deviations range - 99.7% data distributed in 3 Standard Deviations range ),( σµN
  • 10. 10 Normal Distribution In a perfect Normal Distribution, 1. parameters Mean, Median, Mode are aligned 2. data is symmetrically distributed around these parameters ),( σµN
  • 11. 11 Standardized Normal Distribution )1,0(N To standardize the distribution, Step 1. center the mean (minus the mean from all the data points) Step 2. divide each value by the standard deviation Standardized Normal Distribution: Isa normal distribution with mean=0 and standard deviation=1
  • 12. 12 Skewness Skewness: indicates the symmetry of the distribution
  • 13. 13 Central Limit Theorem ),( n N σ µ pµµ = ) n pσ σ =      : : : n p p σ µ
  • 14. 14 Correlation Correlation: Cor(X,Y) = [-1,1] -Correlation indicates the strength of association between two variable - it ranges from -1 to +1 cor(x,y ) = 1 // being perfectly positively correlated cor(x,y) = -1 // being perfectly negatively correlated cor(x,y) = 0 // being independent Cor(X,Y) =
  • 16. 16 Covariance Covariance: is a measure of how two variables are changing together. - So correlation is the covariance normalized by the standard deviation of the 2 variables
  • 17. 17 Probability Terminology: Sample space: set of all possible outcomes of an experiment Event: is a subset of sample space Probability: is the measure of the likeliness that an event will occur among all the possible outcomes. Probability of ‘event x’ = Example: Event: tossing a fair coin Sample Space = {HEAD,TAIL} Probability of ‘Head’ = P(HEAD) = ½ = 0.5 Probability of ‘Tail’ = P(TAIL) = ½ = 0.5 salloutcome likelyness xp =)(
  • 18. 18 Probability cont.. Axioms of probability 1. 2. , 3. Sum of all the probabilities of events in a sample space is equal to 1 4. ]1,0[)( =xp )(xpc )(1)( xpxpc −= )(xp are mutually exclusive if, 1..)()( 21 =++ xpxp )()()()( BAPBPAPBAP ∩−+=∪
  • 19. 19 Conditional Probability Conditional Probability: is the probability of event A given event B occurred Where, : Probability of A conditioned on B : Joint probability of A and B : Probability of B for independent events A, B )( )( )/( BP BAP BAP ∩ =      ∩ )( )( )/( BP BAP BAP )(*)()( BPAPBAP =∩ )( )( )(*)( )/( AP BP BPAP BAP == ∴
  • 20. 20 Bayes Rule Bayes Rule: Posterior * evidence = Prior * likelihood from conditional probability, )(*)/()(*)/( )(*)/()( )( )( )/( )(*)/()( )( )( )/( APABPBPBAP APABPBAP AP BAP ABP BPBAPBAP BP BAP BAP = =∩⇒ ∩ = =∩⇒ ∩ =  )( )/(*)( )/( BP ABPBP BAP =
  • 21. 21 Naïve Bayesian Naïve Bayesian: Bayes Rule + Conditional independence If ‘E’ is the event conditioned on x1,x2, .. xn, then by applying the chain rule & independence condition we get the final equation ∑= = n i ii n n xPx ExPExPExPEP xxxEP 1 * 21 2,1 )( )/(..*)/(*)/(*)( ),../( )/(...*)/(*)/(*)( 21 ExPExPExPEP n),../( 2,1 nxxxEP α ∏= n i i ExPEP 1 )/(*)(α
  • 22. 22 Statistical tests & Significance level t-test: z-test: Chi-square test: ANOVA: Factor ANOVA: H0 – Null Hypothesis H1 – Alternative Hypothesis p-value (significance level, for Null hypothesis) Type -1 error Type-2 error
  • 23. 23 Random Variable Random Variables & Probability Distribution: Discrete: Probability mass Function Cumulative mass function Continuous Probability Density Function Cumulative density function
  • 24. 24 Few other Distributions Bernoulli distribution: Binomial Distribution: Poisson Distribution: