SlideShare a Scribd company logo
1 of 67
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Terminology
Categories in Statistics
Descriptive & Inferential
Statistics
Statistics in R
Descriptive Statistics in R
Inferential Statistics in R
Agenda
Introduction to Statistics
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Introduction to Statistics
Statistics is a branch of mathematics dealing with data collection and organization, analysis,
interpretation and presentation.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Introduction to Statistics
Statistics is a branch of mathematics dealing with data collection and organization, analysis,
interpretation and presentation.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Introduction to Statistics
Statistics is a branch of mathematics dealing with data collection and organization, analysis,
interpretation and presentation.
Analyse Data
Build a Model
Infer Result
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Introduction to Statistics
Statistics is a branch of mathematics dealing with data collection and organization, analysis,
interpretation and presentation.
Statistics
Stock
Market
Life
Sciences
Weather
Retail
Insurance
Education
Terminology
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Basic Terminology
There are a few statistical terms one should be aware of while dealing with statistics.
Population ParameterSample Variable
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Basic Terminology
There are a few statistical terms one should be aware of while dealing with statistics.
Population ParameterSample Variable
Population is the set of sources from which data has to be
collected.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Basic Terminology
There are a few statistical terms one should be aware of while dealing with statistics.
Population ParameterSample Variable
A Sample is a subset of the Population.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Basic Terminology
There are a few statistical terms one should be aware of while dealing with statistics.
Population ParameterSample Variable
A variable is any characteristics, number, or quantity that can
be measured or counted.
A variable may also be called a data item.
Gender Age Region
Height
Weight
Income
Blood Group Ethnicity
Degree
Time
Language
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Basic Terminology
There are a few statistical terms one should be aware of while dealing with statistics.
Population ParameterSample Variable
Also known as a statistical model, A statistical
Parameter or population parameter is a quantity that
indexes a family of probability distributions.
µ
∑
х
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Types of Analysis
An analysis can be done in one of two ways.
Analysis
Quantitative Qualitative
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Types of Analysis
An analysis can be done in one of two ways.
Also known as Statistical Analysis,
it is the science of collecting &
interpreting objects with numbers.
Also known as Non-statistical
Analysis, it mostly deals with
generic data using text, media, etc
Analysis
Quantitative Qualitative
Categories in Statistics
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Inferential statistics makes inferences and predictions about a
population based on a sample of data taken from the population in
question.
Descriptive statistics uses the data to provide descriptions of the
population, either through numerical calculations or graphs or
tables.
Categories in Statistics
There are two major categories in Statistics.
Descriptive
InferentialInferential
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics
This method, is mainly focused upon the main characteristics of data. It provides graphical
summary of the data.
Characteristics of Data
Descriptive Statistics
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics
Maximum
Minimum
Average
This method, is mainly focused upon the main characteristics of data. It provides graphical
summary of the data.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Inferential Statistics
This method, generalizes a large dataset and applies probability to draw a conclusion. It allows us
to infer data parameters based on a statistical model using a sample data.
Statistical Model
Start
Process Step
Decision
Answer
Choice I
Choice II
Inferential Statistics
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Inferential Statistics
Tall
Short
Average
This method, generalizes a large dataset and applies probability to draw a conclusion. It allows us
to infer data parameters based on a statistical model using a sample data.
Descriptive Statistics – Statistical Measures
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics – Use Case
Here is a sample dataset of cars containing
the variables: Cars, Mileage per
Gallon(mpg), Cylinder Type (cyl),
Displacement (disp), Horse Power(hp) &
Real Axle Ratio(drat).
Using descriptive Analysis, you can analyse
each of the variables in the dataset for
mean, standard deviation, minimum and
maximum.
Cars mpg cyl disp hp drat
A 21 6 160 110 3.9
B 21 6 160 110 3.9
C 22.8 4 108 93 3.85
D 21.3 6 108 96 3
E 23 4 150 90 4
F 23 6 108 110 3.9
G 23 4 160 110 3.9
H 23 6 160 110 3.9
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of the Centre
There are a few statistical terms one should be aware of while dealing with statistics.
Mean Median Mode
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics – Use Case
If we want to find out the average
horsepower of the cars among the
population of cars, we will check and
calculate the average of all values. In this
case,
Cars mpg cyl disp hp drat
A 21 6 160 110 3.9
B 21 6 160 110 3.9
C 22.8 4 108 93 3.85
D 21.3 6 108 96 3
E 23 4 150 90 4
F 23 6 108 110 3.9
G 23 4 160 110 3.9
H 23 6 160 110 3.9
110 + 110 + 93 + 96 + 90 + 110 + 110 + 110
8
= 103.625
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of the Centre
There are a few statistical terms one should be aware of while dealing with statistics.
Mean Median Mode
Measure of average of all the values in a sample is called Mean.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics – Use Case
If we want to find out the centre value of
mpg among the population of cars, we will
arrange the mpg values in ascending order
to choose the middle value. In this case,
21,21,21.3,22.8,23,23,23,23
But in case of even entries, we take
average of the two middle values. In this
case,
22.8+23
2
= 22.9
Cars mpg cyl disp hp drat
A 21 6 160 110 3.9
B 21 6 160 110 3.9
C 22.8 4 108 93 3.85
D 21.3 6 108 96 3
E 23 4 150 90 4
F 23 6 108 110 3.9
G 23 4 160 110 3.9
H 23 6 160 110 3.9
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of the Centre
There are a few statistical terms one should be aware of while dealing with statistics.
Mean Median Mode
Measure of the central value of the sample set is called Median.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Descriptive Statistics – Use Case
If we want to find out the most common
type of cylinder among the population of
cars, we will check the value which is
repeated most number of times.
4 6
4 6
Cars mpg cyl disp hp drat
A 21 6 160 110 3.9
B 21 6 160 110 3.9
C 22.8 4 108 93 3.85
D 21.3 6 108 96 3
E 23 4 150 90 4
F 23 6 108 110 3.9
G 23 4 160 110 3.9
H 23 6 160 110 3.9
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of the Centre
There are a few statistical terms one should be aware of while dealing with statistics.
Mean Median Mode
The value most recurrent in the sample set is known as Mode.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of the Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Range is the given measure of how spread apart the values in a dataset are.
Range = Max(𝑥𝑖) - Min(𝑥𝑖)
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Inter Quartile Range(IQR) is the measure of variability, based on dividing a dataset into
quartiles.
1 2 3 4 5 6 7 8
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Quartile
1 2 3 4 5 6 7 8
Q1 Q2 Q3
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Quartile
1 2 3 4 5 6 7 8
Q1 Q2 Q3
Q1=
2+3
2
=2.5
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Quartile
1 2 3 4 5 6 7 8
Q1 Q2 Q3
Q2=
4+5
2
=4.5
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Quartile
1 2 3 4 5 6 7 8
Q1 Q2 Q3
Q3=
6+7
2
=6.5
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Inter Quartile Range
1 2 3 4 5 6 7 8
Q1 Q3
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Variance describes how much a random variable differs from its expected value.
It entails computing squares of deviations.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
❖ Deviation is the difference between each element from the mean.
Deviation = (𝑥𝑖-µ)
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
❖ Population Variance is the average of squared deviations.
σ² = ෍
𝑖=1
𝑁
= (𝑥𝑖−𝜇)²
1
𝑁
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
❖ Sample Variance is the average of squared differences from the mean.
s² = ෍
𝑖=1
𝑁
= (𝑥𝑖− ҧ𝑥)²
1
(𝑛 − 1)
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Measures of Spread
There are a few statistical terms one should be aware of while dealing with statistics.
Range Standard DeviationInter Quartile Range Variance
Standard Deviation is the measure of the dispersion of a set of data from its mean.
σ = ෍
𝑖=1
𝑁
= (𝑥𝑖−𝜇)²
1
𝑁
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Standard Deviation– Use Case
Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4,
10, 9, 6, 9, 4. Work out the Standard Deviation.
Find out the
mean for your
sample set.
STEP 1 The Mean is:
9+2+5+4+12+7+8+11+9+3+7+4+12+5+4+10+9+6+9+4
20
⸫µ=7
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Standard Deviation– Use Case
Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4,
10, 9, 6, 9, 4. Work out the Standard Deviation.
Then for each
number, subtract
the Mean and
square the result.
STEP 2
(𝑥𝑖−𝜇)²
(9-7)²= 2²=4
(2-7)²= (-5)²=25
(5-7)²= (-2)²=4
And so on…
⸫ We get the following results:
4, 25, 4, 9, 25, 0, 1, 16, 4, 16, 0, 9, 25, 4, 9, 9, 4, 1, 4, 9
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Standard Deviation– Use Case
Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4,
10, 9, 6, 9, 4. Work out the Standard Deviation.
Then work out the
mean of those
squared
differences.
STEP 3 ෍
𝑖=1
𝑁
= (𝑥𝑖−𝜇)²
1
𝑁
4+25+4+9+25+0+1+16+4+16+0+9+25+4+9+9+4+1+4+9
20
⸫ σ² = 8.9
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Standard Deviation– Use Case
Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4,
10, 9, 6, 9, 4. Work out the Standard Deviation.
Take square root
of σ².
STEP 4
⸫ σ = 2.983
෍
𝑖=1
𝑁
= (𝑥𝑖−𝜇)²
1
𝑁
σ =
Statistics in R
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Statistics in R
❖ R is open-source and freely available.
❖ R is cross-platform compatible.
❖ R is a powerful scripting language.
❖ R is highly flexible and evolved.
Reasons for moving to R
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Statistics in R
❖ R is open-source and freely available.
❖ R is cross-platform compatible.
❖ R is a powerful scripting language.
❖ R is highly flexible and evolved.
Reasons for moving to R
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Statistics in R
❖ R is open-source and freely available.
❖ R is cross-platform compatible.
❖ R is a powerful scripting language.
❖ R is highly flexible and evolved.
Reasons for moving to R
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Statistics in R
❖ R is open-source and freely available.
❖ R is cross-platform compatible.
❖ R is a powerful scripting language.
❖ R is highly flexible and evolved.
Reasons for moving to R
Descriptive statistics in R
Inferential Statistics – Hypothesis Testing
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Hypothesis Testing
Statisticians use hypothesis testing to formally check whether the hypothesis is accepted or
rejected.
Hypothesis testing is conducted in the following manner:
❖ State the Hypotheses – This stage involves stating the null and alternative hypotheses.
❖ Formulate an Analysis Plan – This stage involves the construction of an analysis plan.
❖ Analyse Sample Data – This stage involves the calculation and interpretation of the test
statistic as described in the analysis plan.
❖ Interpret Results – This stage involves the application of the decision rule described in the
analysis plan.
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Hypothesis Testing
Nick John Bob Harry
Assume the event is free of bias.
So, what is the probability of John not cheating?
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Hypothesis Testing
Nick John Bob Harry
P(John not picked for a day) =
3
4
P(John not picked for 3 days) =
3
4
×
3
4
×
3
4
= 0.42 (approx)
P(John not picked for 12 days) = (
3
4
) 12
= 0.032 < 𝟎. 𝟎𝟓
Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification
Hypothesis Testing
Nick John Bob Harry
Null Hypothesis (𝑯 𝟎) : Result is no different from assumption.
Alternate Hypothesis (𝑯 𝒂) : Result disproves the assumption.
Probability of Event < 𝟎. 𝟎𝟓 (5%)
Inferential Statistics in R
www.edureka.co/masters-program/business-intelligence-certification

More Related Content

What's hot

Exploratory Data Analysis using Python
Exploratory Data Analysis using PythonExploratory Data Analysis using Python
Exploratory Data Analysis using PythonShirin Mojarad, Ph.D.
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn
 
decision tree regression
decision tree regressiondecision tree regression
decision tree regressionAkhilesh Joshi
 
Linear regression
Linear regressionLinear regression
Linear regressionMartinHogg9
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualizationDr. Hamdan Al-Sabri
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Usama Fayyaz
 
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...Simplilearn
 
Machine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionMachine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionGianluca Bontempi
 
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Simplilearn
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysisGramener
 
Linear Regression With R
Linear Regression With RLinear Regression With R
Linear Regression With REdureka!
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsDerek Kane
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Md. Main Uddin Rony
 
Data science life cycle
Data science life cycleData science life cycle
Data science life cycleManoj Mishra
 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine LearningUpekha Vandebona
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDouglas Joubert
 

What's hot (20)

Data Management in R
Data Management in RData Management in R
Data Management in R
 
Statistics and data science
Statistics and data scienceStatistics and data science
Statistics and data science
 
Exploratory Data Analysis using Python
Exploratory Data Analysis using PythonExploratory Data Analysis using Python
Exploratory Data Analysis using Python
 
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...
 
decision tree regression
decision tree regressiondecision tree regression
decision tree regression
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Exploratory data analysis data visualization
Exploratory data analysis data visualizationExploratory data analysis data visualization
Exploratory data analysis data visualization
 
Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning Supervised learning and Unsupervised learning
Supervised learning and Unsupervised learning
 
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
Linear Regression Analysis | Linear Regression in Python | Machine Learning A...
 
Machine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series PredictionMachine Learning Strategies for Time Series Prediction
Machine Learning Strategies for Time Series Prediction
 
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
Machine Learning Tutorial Part - 2 | Machine Learning Tutorial For Beginners ...
 
Data preprocessing
Data preprocessingData preprocessing
Data preprocessing
 
Exploratory data analysis
Exploratory data analysisExploratory data analysis
Exploratory data analysis
 
Linear Regression With R
Linear Regression With RLinear Regression With R
Linear Regression With R
 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
 
Data science life cycle
Data science life cycleData science life cycle
Data science life cycle
 
Feature Selection in Machine Learning
Feature Selection in Machine LearningFeature Selection in Machine Learning
Feature Selection in Machine Learning
 
Descriptive Statistics and Data Visualization
Descriptive Statistics and Data VisualizationDescriptive Statistics and Data Visualization
Descriptive Statistics and Data Visualization
 

Similar to Statistics For Data Science | Statistics Using R Programming Language | Hypothesis Testing | Edureka

Application of time series analysis in financial economics
Application of time series analysis in financial economicsApplication of time series analysis in financial economics
Application of time series analysis in financial economicsStats Statswork
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
Quantitative techniques for
Quantitative techniques forQuantitative techniques for
Quantitative techniques forsmumbahelp
 
Time Series Analysis
Time Series AnalysisTime Series Analysis
Time Series AnalysisAmanda Reed
 
Driving Data to Cut Healthcare Costs
Driving Data to Cut Healthcare CostsDriving Data to Cut Healthcare Costs
Driving Data to Cut Healthcare CostsDan Wellisch
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industryskewdlogix
 
What we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analyticsWhat we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analyticsWeibull AS
 
N ch01
N ch01N ch01
N ch01kk3bii
 
Business analytics -Abhay Mahalley
Business analytics -Abhay MahalleyBusiness analytics -Abhay Mahalley
Business analytics -Abhay MahalleyAbhay Mahalley
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Stats Statswork
 
Meter Reading Benchmarking & Best Practices
Meter Reading Benchmarking & Best PracticesMeter Reading Benchmarking & Best Practices
Meter Reading Benchmarking & Best PracticesThe Ascent Group,Inc.
 
Statistics Assignments 090427
Statistics Assignments 090427Statistics Assignments 090427
Statistics Assignments 090427amykua
 
Alteryx q1'18 investor deck ir
Alteryx q1'18 investor deck   irAlteryx q1'18 investor deck   ir
Alteryx q1'18 investor deck iralteryxinvestor
 

Similar to Statistics For Data Science | Statistics Using R Programming Language | Hypothesis Testing | Edureka (20)

Application of time series analysis in financial economics
Application of time series analysis in financial economicsApplication of time series analysis in financial economics
Application of time series analysis in financial economics
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Quantitative techniques for
Quantitative techniques forQuantitative techniques for
Quantitative techniques for
 
Time Series Analysis
Time Series AnalysisTime Series Analysis
Time Series Analysis
 
statistics.ppt
statistics.pptstatistics.ppt
statistics.ppt
 
Lecture-1.ppt
Lecture-1.pptLecture-1.ppt
Lecture-1.ppt
 
Lecture 1.ppt
Lecture 1.pptLecture 1.ppt
Lecture 1.ppt
 
Lecture 1.ppt
Lecture 1.pptLecture 1.ppt
Lecture 1.ppt
 
Driving Data to Cut Healthcare Costs
Driving Data to Cut Healthcare CostsDriving Data to Cut Healthcare Costs
Driving Data to Cut Healthcare Costs
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
What we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analyticsWhat we do; predictive and prescriptive analytics
What we do; predictive and prescriptive analytics
 
N ch01
N ch01N ch01
N ch01
 
Business analytics -Abhay Mahalley
Business analytics -Abhay MahalleyBusiness analytics -Abhay Mahalley
Business analytics -Abhay Mahalley
 
Business analytics !!
Business analytics !!Business analytics !!
Business analytics !!
 
Business analytics
Business analytics Business analytics
Business analytics
 
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
Statistical Data Analysis | Data Analysis | Statistics Services | Data Collec...
 
Module1
Module1Module1
Module1
 
Meter Reading Benchmarking & Best Practices
Meter Reading Benchmarking & Best PracticesMeter Reading Benchmarking & Best Practices
Meter Reading Benchmarking & Best Practices
 
Statistics Assignments 090427
Statistics Assignments 090427Statistics Assignments 090427
Statistics Assignments 090427
 
Alteryx q1'18 investor deck ir
Alteryx q1'18 investor deck   irAlteryx q1'18 investor deck   ir
Alteryx q1'18 investor deck ir
 

More from Edureka!

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaEdureka!
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaEdureka!
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaEdureka!
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaEdureka!
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaEdureka!
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaEdureka!
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaEdureka!
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaEdureka!
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaEdureka!
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaEdureka!
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | EdurekaEdureka!
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEdureka!
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEdureka!
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaEdureka!
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaEdureka!
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaEdureka!
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Edureka!
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaEdureka!
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaEdureka!
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | EdurekaEdureka!
 

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
 

Recently uploaded

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 

Recently uploaded (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 

Statistics For Data Science | Statistics Using R Programming Language | Hypothesis Testing | Edureka

  • 1.
  • 2. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 3. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 4. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 5. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 6. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 7. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 8. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 9. Introduction to Statistics Terminology Categories in Statistics Descriptive & Inferential Statistics Statistics in R Descriptive Statistics in R Inferential Statistics in R Agenda
  • 11. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Introduction to Statistics Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation.
  • 12. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Introduction to Statistics Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation.
  • 13. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Introduction to Statistics Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation. Analyse Data Build a Model Infer Result
  • 14. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Introduction to Statistics Statistics is a branch of mathematics dealing with data collection and organization, analysis, interpretation and presentation. Statistics Stock Market Life Sciences Weather Retail Insurance Education
  • 16. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Basic Terminology There are a few statistical terms one should be aware of while dealing with statistics. Population ParameterSample Variable
  • 17. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Basic Terminology There are a few statistical terms one should be aware of while dealing with statistics. Population ParameterSample Variable Population is the set of sources from which data has to be collected.
  • 18. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Basic Terminology There are a few statistical terms one should be aware of while dealing with statistics. Population ParameterSample Variable A Sample is a subset of the Population.
  • 19. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Basic Terminology There are a few statistical terms one should be aware of while dealing with statistics. Population ParameterSample Variable A variable is any characteristics, number, or quantity that can be measured or counted. A variable may also be called a data item. Gender Age Region Height Weight Income Blood Group Ethnicity Degree Time Language
  • 20. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Basic Terminology There are a few statistical terms one should be aware of while dealing with statistics. Population ParameterSample Variable Also known as a statistical model, A statistical Parameter or population parameter is a quantity that indexes a family of probability distributions. µ ∑ х
  • 21. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Types of Analysis An analysis can be done in one of two ways. Analysis Quantitative Qualitative
  • 22. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Types of Analysis An analysis can be done in one of two ways. Also known as Statistical Analysis, it is the science of collecting & interpreting objects with numbers. Also known as Non-statistical Analysis, it mostly deals with generic data using text, media, etc Analysis Quantitative Qualitative
  • 24. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Inferential statistics makes inferences and predictions about a population based on a sample of data taken from the population in question. Descriptive statistics uses the data to provide descriptions of the population, either through numerical calculations or graphs or tables. Categories in Statistics There are two major categories in Statistics. Descriptive InferentialInferential
  • 25. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics This method, is mainly focused upon the main characteristics of data. It provides graphical summary of the data. Characteristics of Data Descriptive Statistics
  • 26. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics Maximum Minimum Average This method, is mainly focused upon the main characteristics of data. It provides graphical summary of the data.
  • 27. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Inferential Statistics This method, generalizes a large dataset and applies probability to draw a conclusion. It allows us to infer data parameters based on a statistical model using a sample data. Statistical Model Start Process Step Decision Answer Choice I Choice II Inferential Statistics
  • 28. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Inferential Statistics Tall Short Average This method, generalizes a large dataset and applies probability to draw a conclusion. It allows us to infer data parameters based on a statistical model using a sample data.
  • 29. Descriptive Statistics – Statistical Measures
  • 30. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics – Use Case Here is a sample dataset of cars containing the variables: Cars, Mileage per Gallon(mpg), Cylinder Type (cyl), Displacement (disp), Horse Power(hp) & Real Axle Ratio(drat). Using descriptive Analysis, you can analyse each of the variables in the dataset for mean, standard deviation, minimum and maximum. Cars mpg cyl disp hp drat A 21 6 160 110 3.9 B 21 6 160 110 3.9 C 22.8 4 108 93 3.85 D 21.3 6 108 96 3 E 23 4 150 90 4 F 23 6 108 110 3.9 G 23 4 160 110 3.9 H 23 6 160 110 3.9
  • 31. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of the Centre There are a few statistical terms one should be aware of while dealing with statistics. Mean Median Mode
  • 32. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics – Use Case If we want to find out the average horsepower of the cars among the population of cars, we will check and calculate the average of all values. In this case, Cars mpg cyl disp hp drat A 21 6 160 110 3.9 B 21 6 160 110 3.9 C 22.8 4 108 93 3.85 D 21.3 6 108 96 3 E 23 4 150 90 4 F 23 6 108 110 3.9 G 23 4 160 110 3.9 H 23 6 160 110 3.9 110 + 110 + 93 + 96 + 90 + 110 + 110 + 110 8 = 103.625
  • 33. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of the Centre There are a few statistical terms one should be aware of while dealing with statistics. Mean Median Mode Measure of average of all the values in a sample is called Mean.
  • 34. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics – Use Case If we want to find out the centre value of mpg among the population of cars, we will arrange the mpg values in ascending order to choose the middle value. In this case, 21,21,21.3,22.8,23,23,23,23 But in case of even entries, we take average of the two middle values. In this case, 22.8+23 2 = 22.9 Cars mpg cyl disp hp drat A 21 6 160 110 3.9 B 21 6 160 110 3.9 C 22.8 4 108 93 3.85 D 21.3 6 108 96 3 E 23 4 150 90 4 F 23 6 108 110 3.9 G 23 4 160 110 3.9 H 23 6 160 110 3.9
  • 35. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of the Centre There are a few statistical terms one should be aware of while dealing with statistics. Mean Median Mode Measure of the central value of the sample set is called Median.
  • 36. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Descriptive Statistics – Use Case If we want to find out the most common type of cylinder among the population of cars, we will check the value which is repeated most number of times. 4 6 4 6 Cars mpg cyl disp hp drat A 21 6 160 110 3.9 B 21 6 160 110 3.9 C 22.8 4 108 93 3.85 D 21.3 6 108 96 3 E 23 4 150 90 4 F 23 6 108 110 3.9 G 23 4 160 110 3.9 H 23 6 160 110 3.9
  • 37. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of the Centre There are a few statistical terms one should be aware of while dealing with statistics. Mean Median Mode The value most recurrent in the sample set is known as Mode.
  • 38. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of the Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance
  • 39. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Range is the given measure of how spread apart the values in a dataset are. Range = Max(𝑥𝑖) - Min(𝑥𝑖)
  • 40. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Inter Quartile Range(IQR) is the measure of variability, based on dividing a dataset into quartiles. 1 2 3 4 5 6 7 8
  • 41. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Quartile 1 2 3 4 5 6 7 8 Q1 Q2 Q3
  • 42. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Quartile 1 2 3 4 5 6 7 8 Q1 Q2 Q3 Q1= 2+3 2 =2.5
  • 43. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Quartile 1 2 3 4 5 6 7 8 Q1 Q2 Q3 Q2= 4+5 2 =4.5
  • 44. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Quartile 1 2 3 4 5 6 7 8 Q1 Q2 Q3 Q3= 6+7 2 =6.5
  • 45. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Inter Quartile Range 1 2 3 4 5 6 7 8 Q1 Q3
  • 46. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Variance describes how much a random variable differs from its expected value. It entails computing squares of deviations.
  • 47. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance ❖ Deviation is the difference between each element from the mean. Deviation = (𝑥𝑖-µ)
  • 48. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance ❖ Population Variance is the average of squared deviations. σ² = ෍ 𝑖=1 𝑁 = (𝑥𝑖−𝜇)² 1 𝑁
  • 49. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance ❖ Sample Variance is the average of squared differences from the mean. s² = ෍ 𝑖=1 𝑁 = (𝑥𝑖− ҧ𝑥)² 1 (𝑛 − 1)
  • 50. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Measures of Spread There are a few statistical terms one should be aware of while dealing with statistics. Range Standard DeviationInter Quartile Range Variance Standard Deviation is the measure of the dispersion of a set of data from its mean. σ = ෍ 𝑖=1 𝑁 = (𝑥𝑖−𝜇)² 1 𝑁
  • 51. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Standard Deviation– Use Case Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4. Work out the Standard Deviation. Find out the mean for your sample set. STEP 1 The Mean is: 9+2+5+4+12+7+8+11+9+3+7+4+12+5+4+10+9+6+9+4 20 ⸫µ=7
  • 52. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Standard Deviation– Use Case Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4. Work out the Standard Deviation. Then for each number, subtract the Mean and square the result. STEP 2 (𝑥𝑖−𝜇)² (9-7)²= 2²=4 (2-7)²= (-5)²=25 (5-7)²= (-2)²=4 And so on… ⸫ We get the following results: 4, 25, 4, 9, 25, 0, 1, 16, 4, 16, 0, 9, 25, 4, 9, 9, 4, 1, 4, 9
  • 53. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Standard Deviation– Use Case Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4. Work out the Standard Deviation. Then work out the mean of those squared differences. STEP 3 ෍ 𝑖=1 𝑁 = (𝑥𝑖−𝜇)² 1 𝑁 4+25+4+9+25+0+1+16+4+16+0+9+25+4+9+9+4+1+4+9 20 ⸫ σ² = 8.9
  • 54. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Standard Deviation– Use Case Ross has 20 Dinosaur figures. They have the numbers 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4. Work out the Standard Deviation. Take square root of σ². STEP 4 ⸫ σ = 2.983 ෍ 𝑖=1 𝑁 = (𝑥𝑖−𝜇)² 1 𝑁 σ =
  • 56. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Statistics in R ❖ R is open-source and freely available. ❖ R is cross-platform compatible. ❖ R is a powerful scripting language. ❖ R is highly flexible and evolved. Reasons for moving to R
  • 57. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Statistics in R ❖ R is open-source and freely available. ❖ R is cross-platform compatible. ❖ R is a powerful scripting language. ❖ R is highly flexible and evolved. Reasons for moving to R
  • 58. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Statistics in R ❖ R is open-source and freely available. ❖ R is cross-platform compatible. ❖ R is a powerful scripting language. ❖ R is highly flexible and evolved. Reasons for moving to R
  • 59. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Statistics in R ❖ R is open-source and freely available. ❖ R is cross-platform compatible. ❖ R is a powerful scripting language. ❖ R is highly flexible and evolved. Reasons for moving to R
  • 61. Inferential Statistics – Hypothesis Testing
  • 62. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Hypothesis Testing Statisticians use hypothesis testing to formally check whether the hypothesis is accepted or rejected. Hypothesis testing is conducted in the following manner: ❖ State the Hypotheses – This stage involves stating the null and alternative hypotheses. ❖ Formulate an Analysis Plan – This stage involves the construction of an analysis plan. ❖ Analyse Sample Data – This stage involves the calculation and interpretation of the test statistic as described in the analysis plan. ❖ Interpret Results – This stage involves the application of the decision rule described in the analysis plan.
  • 63. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Hypothesis Testing Nick John Bob Harry Assume the event is free of bias. So, what is the probability of John not cheating?
  • 64. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Hypothesis Testing Nick John Bob Harry P(John not picked for a day) = 3 4 P(John not picked for 3 days) = 3 4 × 3 4 × 3 4 = 0.42 (approx) P(John not picked for 12 days) = ( 3 4 ) 12 = 0.032 < 𝟎. 𝟎𝟓
  • 65. Copyright © 2018, edureka and/or its affiliates. All rights reserved.www.edureka.co/masters-program/business-intelligence-certification Hypothesis Testing Nick John Bob Harry Null Hypothesis (𝑯 𝟎) : Result is no different from assumption. Alternate Hypothesis (𝑯 𝒂) : Result disproves the assumption. Probability of Event < 𝟎. 𝟎𝟓 (5%)