SlideShare una empresa de Scribd logo
1 de 15
Quantitative analysis
A brief introduction
Petri Lankoski, 2018 1
You should be familiar with following
• Mean (medelvärde), for a normal distribution
• Median (median)
• Mode (typvärde)
• Line chart (linjediagram)
• Bar chart (stapeldiagram)
Petri Lankoski, 2018 2
Is the Die Loaded?
11st throw
12st throw
43st throw
14st throw
25st throw
We cannot say for certain, but we can estimate how
likely or unlikely the perceived sequence is
In long run we expect to see equal amount of 1s, 2s,
3s, 4s, 5s and 6s
16st throw
Chance to get 1 is 1/6, but as first throw, this is as
likely as any other result. We do not have enough
information to say anything more about this
six throws is probably still too little to estimate the
die, so we would need to roll more…
Petri Lankoski, 2018 3
Is the Die Loaded?
1
1
4
1
2
1
3
6
1
1
1
5
Testing this sequence against expected sequence
indicate that the die is loaded
• But we have around 1% change to be wrong
We roll following sequence: 2 6 2 6 6 4 6 5 4 1 3 4
4 6 5 3 5 3 2 5
• Amounts of 6s and 1s does not match to
expected amounts
• We would have 70% likelihood of being wrong
if we claim that the die is load
Petri Lankoski, 2018 4
Boxplot
Median
IQR,
50% of data
1.5 * IQR
Petri Lankoski, 2018 5
density and violin plot
Violin plot is a form
of density plot
Petri Lankoski, 2018 6
Density plot and data points
Scatter plot
-2 -1 0 1 2
-3-2-10123
Variable 1
Variable2
Scatter plot shows values of two variables
• For example how a participant answered
to questions
Petri Lankoski, 2018 7
Random sampling Predicting election results
- It is not practically possible to ask all what they will vote
- Picking a sample of people randomly & asking them
However, we know that there is uncertainty here
If random sample again, we might get something else
We get:
A: 37.6%
B: 12.3%
C: 33.1%
D: 5.2%
…
We get:
A: 36.9%
B: 13.0%
C: 32.7%
D: 6.1%
…
We can estimate uncertainty, but we need to make some
assumptions
Petri Lankoski, 2018
8
We get:
A: 38.7%
B: 11.0%
C: 31.7%
D: 6.3%
…
Normal distribution
1𝜎 2𝜎-2𝜎 -1𝜎 0𝜎
68.3%
95.4% of data
9
𝜎 = standard deviation
• describes the width of distribution
Back to polling
1.96𝜎-1.96𝜎 0𝜎
95% of population is in the
area of ∓1.96𝜎; sample
distribution behaves similarly
However, within 95% certainty
what we observed falls in area
between -1.96𝜎 and 1.96𝜎.
We cannot know where in
population distribution what
we observed was (red vertical
lines).
10
We do not know true
population value (black
vertical line).
Support for A
36.1%
38.7%
37.6%
Random sampling
Instead of uncertainty, confidence is usually used.
Confidence interval (CI), usually 95%, is function of sample
size and probability of someone choosing a candidate.
0.376 ∓ 1.96 ∗ √
0.376(1 − 0.376)
𝑁
𝜎95%A
Petri Lankoski, 2018 11
We can backtrack from the sample distribution and estimate
the uncertainty in what we observed when polling
• When we poll next time within 95% certainty what we
observed falls in area between -1.96𝜎 and 1.96𝜎
Are two means different, t-test?
A B∆
We have two sample means A and B
Their difference is ∆=B-A
Mean is calculated based on sampled values
Mean(A) =
∑𝑎
𝑛
(for normally distruted variables)
To extrapolate if the there is difference between
groups A and B in population level (from witch A and
B were sampled) we need to account uncertainty.
Again population mean and sample mean can be
different.
Petri Lankoski, 2018 12
Are two means different, t-test?
A B∆
We have two sample means A and B
Their difference is ∆=B-A
t statistic describes difference so that it takes into
account variance (𝜎2) and sample size
p describes probability that perceived data deviates
from null hypothesis; in case null hypothesis of t-test, is
the means are not different.
p depends on t-value and sample size; high t-value
means lower p.
p = 0.05 means that there is 5% change that observed
data did not deviate from expected, there is no
difference. P<0.05 is a typical statistically significant
result criterion.
Petri Lankoski, 2018 13
Are tree means different, one-way ANOVA
• One-way ANOVA is similar to t-test
• F-statistic describes difference so that it takes into account variance
and sample size
• p describes probability that perceived data deviates from null
hypothesis; in case null hypothesis of ANOVA, is the means are not
different
• A significant result (p<0.05) tells that at least one mean differ from
others
• But not which
• Post hoc comparisons are needed to determine which variable differs from
which
Petri Lankoski, 2018 14
Correlation
Correlation (r) describes the strength of
association between two variables
p describes the likelihood that the observed
correlation deviates from what is expected
under null hypothesis (which is that there is no
relation between the two variables)
Correlation does not tell if v1 causes v2 or vice
versa
• There is a strong correlation between ice
cream sales and drowning
• Either is causing another
• Third variable, temperature, related to both
Petri Lankoski, 2018 15

Más contenido relacionado

La actualidad más candente

Classification via Logistic Regression
Classification via Logistic RegressionClassification via Logistic Regression
Classification via Logistic Regression
Taweh Beysolow II
 

La actualidad más candente (19)

beyond objectivity and subjectivity; a discussion paper
beyond objectivity and subjectivity; a discussion paperbeyond objectivity and subjectivity; a discussion paper
beyond objectivity and subjectivity; a discussion paper
 
P value wars
P value warsP value wars
P value wars
 
Discussion a 4th BFFF Harvard
Discussion a 4th BFFF HarvardDiscussion a 4th BFFF Harvard
Discussion a 4th BFFF Harvard
 
The Seven Habits of Highly Effective Statisticians
The Seven Habits of Highly Effective StatisticiansThe Seven Habits of Highly Effective Statisticians
The Seven Habits of Highly Effective Statisticians
 
Statistics for UX Professionals - Jessica Cameron
Statistics for UX Professionals - Jessica CameronStatistics for UX Professionals - Jessica Cameron
Statistics for UX Professionals - Jessica Cameron
 
Biostatistics Workshop: Missing Data
Biostatistics Workshop: Missing DataBiostatistics Workshop: Missing Data
Biostatistics Workshop: Missing Data
 
Statistics for UX Professionals
Statistics for UX ProfessionalsStatistics for UX Professionals
Statistics for UX Professionals
 
P1 Stroop
P1 StroopP1 Stroop
P1 Stroop
 
Classification via Logistic Regression
Classification via Logistic RegressionClassification via Logistic Regression
Classification via Logistic Regression
 
Chp1 Methods and Stats
Chp1 Methods and StatsChp1 Methods and Stats
Chp1 Methods and Stats
 
Research Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar MalikResearch Sample size by Dr Allah Yar Malik
Research Sample size by Dr Allah Yar Malik
 
Think Like a Strategist - Confab 2019
Think Like a Strategist - Confab 2019Think Like a Strategist - Confab 2019
Think Like a Strategist - Confab 2019
 
The revenge of RA Fisher
The revenge of RA FisherThe revenge of RA Fisher
The revenge of RA Fisher
 
The revenge of RA Fisher
The revenge of RA Fisher The revenge of RA Fisher
The revenge of RA Fisher
 
Clinical trials: three statistical traps for the unwary
Clinical trials: three statistical traps for the unwaryClinical trials: three statistical traps for the unwary
Clinical trials: three statistical traps for the unwary
 
On p-values
On p-valuesOn p-values
On p-values
 
How to do the maths
How to do the mathsHow to do the maths
How to do the maths
 
Hypothesis
HypothesisHypothesis
Hypothesis
 
Chi-Square Test of Independence
Chi-Square Test of IndependenceChi-Square Test of Independence
Chi-Square Test of Independence
 

Similar a Quantitative analysis: A brief introduction

1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
cartlidge
 
Chapter 15 Marketing Research Malhotra
Chapter 15 Marketing Research MalhotraChapter 15 Marketing Research Malhotra
Chapter 15 Marketing Research Malhotra
AADITYA TANTIA
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
AASTHA76
 
Hypothesis TestingIn doing research, one of the most common acti
Hypothesis TestingIn doing research, one of the most common actiHypothesis TestingIn doing research, one of the most common acti
Hypothesis TestingIn doing research, one of the most common acti
NarcisaBrandenburg70
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
karlhennesey
 
Descriptive And Inferential Statistics for Nursing Research
Descriptive And Inferential Statistics for Nursing ResearchDescriptive And Inferential Statistics for Nursing Research
Descriptive And Inferential Statistics for Nursing Research
enamprofessor
 

Similar a Quantitative analysis: A brief introduction (20)

T test
T test T test
T test
 
Lec 5 - Normality Testing.pptx
Lec 5 - Normality Testing.pptxLec 5 - Normality Testing.pptx
Lec 5 - Normality Testing.pptx
 
M.Ed Tcs 2 seminar ppt npc to submit
M.Ed Tcs 2 seminar ppt npc   to submitM.Ed Tcs 2 seminar ppt npc   to submit
M.Ed Tcs 2 seminar ppt npc to submit
 
1 statistical analysis notes
1 statistical analysis notes1 statistical analysis notes
1 statistical analysis notes
 
Statistics
StatisticsStatistics
Statistics
 
Statistics
StatisticsStatistics
Statistics
 
Chapter 15 Marketing Research Malhotra
Chapter 15 Marketing Research MalhotraChapter 15 Marketing Research Malhotra
Chapter 15 Marketing Research Malhotra
 
Nonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contdNonparametric and Distribution- Free Statistics _contd
Nonparametric and Distribution- Free Statistics _contd
 
Seventy years of RCTs
Seventy years of RCTsSeventy years of RCTs
Seventy years of RCTs
 
A Lecture on Sample Size and Statistical Inference for Health Researchers
A Lecture on Sample Size and Statistical Inference for Health ResearchersA Lecture on Sample Size and Statistical Inference for Health Researchers
A Lecture on Sample Size and Statistical Inference for Health Researchers
 
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docxTopic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
Topic Learning TeamNumber of Pages 2 (Double Spaced)Num.docx
 
Freq distribution
Freq distributionFreq distribution
Freq distribution
 
Topic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis TestingTopic 2 - More on Hypothesis Testing
Topic 2 - More on Hypothesis Testing
 
Hypothesis TestingIn doing research, one of the most common acti
Hypothesis TestingIn doing research, one of the most common actiHypothesis TestingIn doing research, one of the most common acti
Hypothesis TestingIn doing research, one of the most common acti
 
Introduction to Statistical Methods
Introduction to Statistical MethodsIntroduction to Statistical Methods
Introduction to Statistical Methods
 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
 
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docxPage 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
Page 266LEARNING OBJECTIVES· Explain how researchers use inf.docx
 
250Lec5INFERENTIAL STATISTICS FOR RESEARC
250Lec5INFERENTIAL STATISTICS FOR RESEARC250Lec5INFERENTIAL STATISTICS FOR RESEARC
250Lec5INFERENTIAL STATISTICS FOR RESEARC
 
Descriptive And Inferential Statistics for Nursing Research
Descriptive And Inferential Statistics for Nursing ResearchDescriptive And Inferential Statistics for Nursing Research
Descriptive And Inferential Statistics for Nursing Research
 
Ds vs Is discuss 3.1
Ds vs Is discuss 3.1Ds vs Is discuss 3.1
Ds vs Is discuss 3.1
 

Más de Petri Lankoski

Formal analysis of gameplay
Formal analysis of gameplayFormal analysis of gameplay
Formal analysis of gameplay
Petri Lankoski
 
Gameplay Design Workshop 1/2 (2011)
Gameplay Design Workshop 1/2 (2011)Gameplay Design Workshop 1/2 (2011)
Gameplay Design Workshop 1/2 (2011)
Petri Lankoski
 
Gameplay Design Workshop 2/2 (2011)
Gameplay Design Workshop 2/2 (2011)Gameplay Design Workshop 2/2 (2011)
Gameplay Design Workshop 2/2 (2011)
Petri Lankoski
 
How can game studies support game design practice?
How can game studies support game design practice?How can game studies support game design practice?
How can game studies support game design practice?
Petri Lankoski
 
Game Project / Assignement
Game Project / AssignementGame Project / Assignement
Game Project / Assignement
Petri Lankoski
 
Game Project / Working with Unity
Game Project / Working with UnityGame Project / Working with Unity
Game Project / Working with Unity
Petri Lankoski
 

Más de Petri Lankoski (20)

Game Analysis at HEVGA PhD Summer School
Game Analysis at HEVGA PhD Summer SchoolGame Analysis at HEVGA PhD Summer School
Game Analysis at HEVGA PhD Summer School
 
Constructive Alignment in Teaching Game Research in Game Development Bachelor...
Constructive Alignment in Teaching Game Research in Game Development Bachelor...Constructive Alignment in Teaching Game Research in Game Development Bachelor...
Constructive Alignment in Teaching Game Research in Game Development Bachelor...
 
Perforce
PerforcePerforce
Perforce
 
Level Design Course Intro and Assingnts
Level Design Course Intro and AssingntsLevel Design Course Intro and Assingnts
Level Design Course Intro and Assingnts
 
Embodiment, Game Characters and Game Design
Embodiment, Game Characters and Game DesignEmbodiment, Game Characters and Game Design
Embodiment, Game Characters and Game Design
 
Game research methods book introduction
Game research methods book introductionGame research methods book introduction
Game research methods book introduction
 
Escape: Level Design Exercise in Unity
Escape: Level Design Exercise in UnityEscape: Level Design Exercise in Unity
Escape: Level Design Exercise in Unity
 
Formal analysis of gameplay
Formal analysis of gameplayFormal analysis of gameplay
Formal analysis of gameplay
 
Level Design
Level Design Level Design
Level Design
 
Game system design
Game system designGame system design
Game system design
 
Simulations: Evaluating game system behavior
Simulations: Evaluating game system behavior Simulations: Evaluating game system behavior
Simulations: Evaluating game system behavior
 
Models for story
Models for storyModels for story
Models for story
 
Designprocesser lecture1
Designprocesser lecture1Designprocesser lecture1
Designprocesser lecture1
 
Unity programming 1
Unity programming 1Unity programming 1
Unity programming 1
 
Gameplay Design Workshop 1/2 (2011)
Gameplay Design Workshop 1/2 (2011)Gameplay Design Workshop 1/2 (2011)
Gameplay Design Workshop 1/2 (2011)
 
Gameplay Design Workshop 2/2 (2011)
Gameplay Design Workshop 2/2 (2011)Gameplay Design Workshop 2/2 (2011)
Gameplay Design Workshop 2/2 (2011)
 
How can game studies support game design practice?
How can game studies support game design practice?How can game studies support game design practice?
How can game studies support game design practice?
 
Game Project / Focus
Game Project / FocusGame Project / Focus
Game Project / Focus
 
Game Project / Assignement
Game Project / AssignementGame Project / Assignement
Game Project / Assignement
 
Game Project / Working with Unity
Game Project / Working with UnityGame Project / Working with Unity
Game Project / Working with Unity
 

Último

Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
ranjankumarbehera14
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 

Último (20)

Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Hapur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book nowVadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
Vadodara 💋 Call Girl 7737669865 Call Girls in Vadodara Escort service book now
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 

Quantitative analysis: A brief introduction

  • 1. Quantitative analysis A brief introduction Petri Lankoski, 2018 1
  • 2. You should be familiar with following • Mean (medelvärde), for a normal distribution • Median (median) • Mode (typvärde) • Line chart (linjediagram) • Bar chart (stapeldiagram) Petri Lankoski, 2018 2
  • 3. Is the Die Loaded? 11st throw 12st throw 43st throw 14st throw 25st throw We cannot say for certain, but we can estimate how likely or unlikely the perceived sequence is In long run we expect to see equal amount of 1s, 2s, 3s, 4s, 5s and 6s 16st throw Chance to get 1 is 1/6, but as first throw, this is as likely as any other result. We do not have enough information to say anything more about this six throws is probably still too little to estimate the die, so we would need to roll more… Petri Lankoski, 2018 3
  • 4. Is the Die Loaded? 1 1 4 1 2 1 3 6 1 1 1 5 Testing this sequence against expected sequence indicate that the die is loaded • But we have around 1% change to be wrong We roll following sequence: 2 6 2 6 6 4 6 5 4 1 3 4 4 6 5 3 5 3 2 5 • Amounts of 6s and 1s does not match to expected amounts • We would have 70% likelihood of being wrong if we claim that the die is load Petri Lankoski, 2018 4
  • 5. Boxplot Median IQR, 50% of data 1.5 * IQR Petri Lankoski, 2018 5
  • 6. density and violin plot Violin plot is a form of density plot Petri Lankoski, 2018 6 Density plot and data points
  • 7. Scatter plot -2 -1 0 1 2 -3-2-10123 Variable 1 Variable2 Scatter plot shows values of two variables • For example how a participant answered to questions Petri Lankoski, 2018 7
  • 8. Random sampling Predicting election results - It is not practically possible to ask all what they will vote - Picking a sample of people randomly & asking them However, we know that there is uncertainty here If random sample again, we might get something else We get: A: 37.6% B: 12.3% C: 33.1% D: 5.2% … We get: A: 36.9% B: 13.0% C: 32.7% D: 6.1% … We can estimate uncertainty, but we need to make some assumptions Petri Lankoski, 2018 8 We get: A: 38.7% B: 11.0% C: 31.7% D: 6.3% …
  • 9. Normal distribution 1𝜎 2𝜎-2𝜎 -1𝜎 0𝜎 68.3% 95.4% of data 9 𝜎 = standard deviation • describes the width of distribution
  • 10. Back to polling 1.96𝜎-1.96𝜎 0𝜎 95% of population is in the area of ∓1.96𝜎; sample distribution behaves similarly However, within 95% certainty what we observed falls in area between -1.96𝜎 and 1.96𝜎. We cannot know where in population distribution what we observed was (red vertical lines). 10 We do not know true population value (black vertical line). Support for A 36.1% 38.7% 37.6%
  • 11. Random sampling Instead of uncertainty, confidence is usually used. Confidence interval (CI), usually 95%, is function of sample size and probability of someone choosing a candidate. 0.376 ∓ 1.96 ∗ √ 0.376(1 − 0.376) 𝑁 𝜎95%A Petri Lankoski, 2018 11 We can backtrack from the sample distribution and estimate the uncertainty in what we observed when polling • When we poll next time within 95% certainty what we observed falls in area between -1.96𝜎 and 1.96𝜎
  • 12. Are two means different, t-test? A B∆ We have two sample means A and B Their difference is ∆=B-A Mean is calculated based on sampled values Mean(A) = ∑𝑎 𝑛 (for normally distruted variables) To extrapolate if the there is difference between groups A and B in population level (from witch A and B were sampled) we need to account uncertainty. Again population mean and sample mean can be different. Petri Lankoski, 2018 12
  • 13. Are two means different, t-test? A B∆ We have two sample means A and B Their difference is ∆=B-A t statistic describes difference so that it takes into account variance (𝜎2) and sample size p describes probability that perceived data deviates from null hypothesis; in case null hypothesis of t-test, is the means are not different. p depends on t-value and sample size; high t-value means lower p. p = 0.05 means that there is 5% change that observed data did not deviate from expected, there is no difference. P<0.05 is a typical statistically significant result criterion. Petri Lankoski, 2018 13
  • 14. Are tree means different, one-way ANOVA • One-way ANOVA is similar to t-test • F-statistic describes difference so that it takes into account variance and sample size • p describes probability that perceived data deviates from null hypothesis; in case null hypothesis of ANOVA, is the means are not different • A significant result (p<0.05) tells that at least one mean differ from others • But not which • Post hoc comparisons are needed to determine which variable differs from which Petri Lankoski, 2018 14
  • 15. Correlation Correlation (r) describes the strength of association between two variables p describes the likelihood that the observed correlation deviates from what is expected under null hypothesis (which is that there is no relation between the two variables) Correlation does not tell if v1 causes v2 or vice versa • There is a strong correlation between ice cream sales and drowning • Either is causing another • Third variable, temperature, related to both Petri Lankoski, 2018 15

Notas del editor

  1. https://stats.stackexchange.com/questions/3194/how-can-i-test-the-fairness-of-a-d20/3735#3735 chisq.test(table(c(1,1,4,1,2,1,3,6,1,1,1,5)), p = rep(1/6,6)) Chi-squared test for given probabilities data: table(c(1, 1, 4, 1, 2, 1, 3, 6, 1, 1, 1, 5)) X-squared = 15, df = 5, p-value = 0.01036 Note that we cannot test if the die is not biased. We can only test if behaves enough unexpectly rolls = sample(1:6, 20, replace=TRUE) # 20 times d6 chisq.test(table(rolls), p = rep(1/6,6))
  2. Polling is done via random sampling using telephone catalog. However, people owning a phone and people voting are not the same populations and the poll results are systematically off; however, there are techniques counter the sampling bias, especially in the case of voting when it is possible to compare results to poll results.
  3. 𝜎=standard deviation, describes the width of distribution Black vertical line: population value Red vertical line: sample values
  4. 𝜎=standard deviation, describes the width of distribution Black vertical line: population value Red vertical line: sample values
  5. The standard deviation is the square root of the variance.