Psbe2 08 research methods 2011-2012 - week 2

•Descargar como PPTX, PDF•

0 recomendaciones•139 vistas

The document discusses null hypothesis significance testing (NHST) and power. It explains that NHST is used to determine if mean differences between groups are greater than would be expected by chance. A statistically significant result is one that has a low probability of occurring if the null hypothesis is true. The level of significance is typically set at p<.05. Limitations of NHST include that small sample sizes may lack power to detect real effects. The document also discusses how power and effect size determine the likelihood of correctly rejecting the null hypothesis and avoiding Type II errors.

Tecnología Empresariales

PSBE2-08
Research Methods
Week 2

Tassos Sarampalis 1

Null Hypothesis Significance
Testing
and
Power

2

Null Hypothesis Significance
Testing
• Goal
– determine whether mean differences among groups
in an experiment are greater than differences
expected simply because of chance (error variation)
• First step
– assume that the groups do not differ (H0)
• = null hypothesis
• assume the independent variable did not have an effect

3

Null Hypothesis Significance
Testing
• Next steps
– Probability theory: estimate likelihood of observed
outcome, while assuming null hypothesis is true.
– “statistically significant”
• outcome has small likelihood of occurring under H0
• reject H0
• conclude IV had an effect on DV
– difference between means is larger than what would be expected
if error variation alone caused the outcome

4

probability

0 2 4 6 8 10 12 14 16 18
“heads” count
6

probability

0 2 4 6 8 10 12 14 16 18
“heads” count
7

Null Hypothesis Significance
Testing
• How small does the likelihood have to be to
decide outcome isn’t due to chance?
• scientific consensus: p < .05
• = alpha (α) or level of significance
• What does a statistically significant outcome tell us?
– outcome at p ≈ .05 has about a 50/50 chance of being repeated
(at p < .05) in an exact replication
– as probability of outcome decreases (e.g., p = .025, p = .01),
likelihood of observing a statistically significant outcome (p < .05)
in an exact replication increases
– APA recommends reporting exact probability of outcome

8

Null Hypothesis Significance
Testing
• What do we conclude when a finding is not
statistically significant?
– do not reject the null hypothesis of no difference
– don’t accept the null hypothesis
• don’t conclude that the IV didn’t produce an effect
– cannot make a conclusion about the effect of an IV
• some factor in experiment may have prevented us from
observing an effect of the IV
• most common factor: too few participants

10

NHST Criticisms
• A difference between populations can almost
always be found, given a large enough sample
• A statistically significant finding may not be
relevant in practice, whilst a true effect of
practical significance may not appear
statistically significant if the test lacks the
power
• Fairness of exclusion
• Publication bias and the file-drawer problem
11

Experimental Sensitivity and
Power
• Sensitivity
– likelihood an experiment will detect the effect of
an IV when in fact, the IV has an effect
• affected by experiment methods and procedures
• sensitivity increases with good research design and
methods
– high degree of experimental control
– little opportunity for biases

12

Experimental Sensitivity and
Power
• Power
– likelihood that a statistical test will allow
researchers to reject correctly H0
• low statistical power increases Type II errors
• Power = 1 - β
• three factors affect power of statistical tests
– level of significance (alpha)
– size of the effect of the IV
– sample size (N)

13

Experimental Sensitivity and
Power
• Prospective Power Analysis
• step 1: estimate effect size of IV
– examine previous research involving the IV
• step 2: refer to “Power Tables”
– identify sample size needed to observe effect of IV
• step 3: use adequate sample size
– most studies in psychology are “underpowered” because of
low sample size
• Retrospective Power Analysis
• Determine the power of a study based on the effect
size, sample size, and significance level

14

Más contenido relacionado

La actualidad más candente

Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."jemille6

"The Statistical Replication Crisis: Paradoxes and Scapegoats”jemille6

Senn repligatejemille6

Vaccine trials in the age of COVID-19Stephen Senn

D. Mayo: Replication Research Under an Error Statistical Philosophy jemille6

Minimally important differencesStephen Senn

Hypothesis TestingTushar Kumar

On being BayesianStephen Senn

To p or not to pMike LaValley

Ioannidis 2005Aurélien Madouasse

To infinity and beyond v2Stephen Senn

Clinical trials: quo vadis in the age of covid?Stephen Senn

How to do the mathsAhmed Elfaitury

Sample sizeLeonard Cheserem

The challenge of small dataStephen Senn

What is your questionStephenSenn2

Yates and cochranStephenSenn2

Improve your study with pre-registrationDorothy Bishop

Minimally important differences v2Stephen Senn

Trends towards significanceStephenSenn2

La actualidad más candente (20)

Yoav Benjamini, "In the world beyond p<.05: When & How to use P<.0499..."

"The Statistical Replication Crisis: Paradoxes and Scapegoats”

Senn repligate

Vaccine trials in the age of COVID-19

D. Mayo: Replication Research Under an Error Statistical Philosophy

Minimally important differences

Hypothesis Testing

On being Bayesian

To p or not to p

Ioannidis 2005

To infinity and beyond v2

Clinical trials: quo vadis in the age of covid?

How to do the maths

Sample size

The challenge of small data

What is your question

Yates and cochran

Improve your study with pre-registration

Minimally important differences v2

Trends towards significance

Destacado

Psbe2 08 research methods 2011-2012 - week 6Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 7Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 5Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 6Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 7Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 4Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 2Vlady Fckfb

Psbe2 08 research methods 2011-2012 - week 4Vlady Fckfb

Destacado (8)

Psbe2 08 research methods 2011-2012 - week 6

Psbe2 08 research methods 2011-2012 - week 7

Psbe2 08 research methods 2011-2012 - week 5

Psbe2 08 research methods 2011-2012 - week 6

Psbe2 08 research methods 2011-2012 - week 7

Psbe2 08 research methods 2011-2012 - week 4

Psbe2 08 research methods 2011-2012 - week 2

Psbe2 08 research methods 2011-2012 - week 4

Similar a Psbe2 08 research methods 2011-2012 - week 2

P-values the gold measure of statistical validity are not as reliable as many...David Pratap

Chapter 18 Hypothesis testing (1).pptxNELVINNOOL1

Introduction to Hypothesis Testingjasondroesch

Lecture by Professor Imre Janszky about random error. EPINOR

D.G. Mayo Slides LSE PH500 Meeting #1jemille6

D.g. mayo 1st mtg lse ph 500jemille6

Bill howe 5_statisticsMahammad Valiyev

Hypothesis testing 1.0Dr. C.V. Suresh Babu

Inferential StatisticsUniversity of Jaffna

Topic 7 stat inferenceSizwan Ahammed

Chapter 28 clincal trials Nilesh Kucha

Reporting Results of Statistical Analysis Centre for Social Initiative and Management

Tests of significance PeriodontologySaiLakshmi128

One sample test of hypothesis(3).pptxMdMehediHasan803536

One sample test of hypothesis(4).pptxMdMehediHasan803536

Class 5 Hypothesis & Normal Disdribution.pptxCallplanetsDeveloper

Lund 2009Jonas Ranstam PhD

Replication Crises and the Statistics Wars: Hidden Controversiesjemille6

sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel

Some statistical concepts relevant to proteomics data analysisUC Davis

Similar a Psbe2 08 research methods 2011-2012 - week 2 (20)

P-values the gold measure of statistical validity are not as reliable as many...

Chapter 18 Hypothesis testing (1).pptx

Introduction to Hypothesis Testing

Lecture by Professor Imre Janszky about random error.

D.G. Mayo Slides LSE PH500 Meeting #1

D.g. mayo 1st mtg lse ph 500

Bill howe 5_statistics

Hypothesis testing 1.0

Inferential Statistics

Topic 7 stat inference

Chapter 28 clincal trials

Reporting Results of Statistical Analysis

Tests of significance Periodontology

One sample test of hypothesis(3).pptx

One sample test of hypothesis(4).pptx

Class 5 Hypothesis & Normal Disdribution.pptx

Lund 2009

Replication Crises and the Statistics Wars: Hidden Controversies

sience 2.0 : an illustration of good research practices in a real study

Some statistical concepts relevant to proteomics data analysis

Último

DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity

Understanding the FAA Part 107 License ..Christopher Logan Kennedy

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

AI+A11Y 11MAY2024 HYDERBAD GAAD 2024 - HelloA11Y (11 May 2024)Samir Dash

Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede

Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea

CNIC Information System with Pakdata Cf In Pakistandanishmna97

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot

Why Teams call analytics are critical to your entire businesspanagenda

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays

Psbe2 08 research methods 2011-2012 - week 2

1. PSBE2-08 Research Methods Week 2 Tassos Sarampalis 1

2. Null Hypothesis Significance Testing and Power 2

3. Null Hypothesis Significance Testing • Goal – determine whether mean differences among groups in an experiment are greater than differences expected simply because of chance (error variation) • First step – assume that the groups do not differ (H0) • = null hypothesis • assume the independent variable did not have an effect 3

4. Null Hypothesis Significance Testing • Next steps – Probability theory: estimate likelihood of observed outcome, while assuming null hypothesis is true. – “statistically significant” • outcome has small likelihood of occurring under H0 • reject H0 • conclude IV had an effect on DV – difference between means is larger than what would be expected if error variation alone caused the outcome 4

5. 5

6. probability 0 2 4 6 8 10 12 14 16 18 “heads” count 6

7. probability 0 2 4 6 8 10 12 14 16 18 “heads” count 7

8. Null Hypothesis Significance Testing • How small does the likelihood have to be to decide outcome isn’t due to chance? • scientific consensus: p < .05 • = alpha (α) or level of significance • What does a statistically significant outcome tell us? – outcome at p ≈ .05 has about a 50/50 chance of being repeated (at p < .05) in an exact replication – as probability of outcome decreases (e.g., p = .025, p = .01), likelihood of observing a statistically significant outcome (p < .05) in an exact replication increases – APA recommends reporting exact probability of outcome 8

9. 9

10. Null Hypothesis Significance Testing • What do we conclude when a finding is not statistically significant? – do not reject the null hypothesis of no difference – don’t accept the null hypothesis • don’t conclude that the IV didn’t produce an effect – cannot make a conclusion about the effect of an IV • some factor in experiment may have prevented us from observing an effect of the IV • most common factor: too few participants 10

11. NHST Criticisms • A difference between populations can almost always be found, given a large enough sample • A statistically significant finding may not be relevant in practice, whilst a true effect of practical significance may not appear statistically significant if the test lacks the power • Fairness of exclusion • Publication bias and the file-drawer problem 11

12. Experimental Sensitivity and Power • Sensitivity – likelihood an experiment will detect the effect of an IV when in fact, the IV has an effect • affected by experiment methods and procedures • sensitivity increases with good research design and methods – high degree of experimental control – little opportunity for biases 12

13. Experimental Sensitivity and Power • Power – likelihood that a statistical test will allow researchers to reject correctly H0 • low statistical power increases Type II errors • Power = 1 - β • three factors affect power of statistical tests – level of significance (alpha) – size of the effect of the IV – sample size (N) 13

14. Experimental Sensitivity and Power • Prospective Power Analysis • step 1: estimate effect size of IV – examine previous research involving the IV • step 2: refer to “Power Tables” – identify sample size needed to observe effect of IV • step 3: use adequate sample size – most studies in psychology are “underpowered” because of low sample size • Retrospective Power Analysis • Determine the power of a study based on the effect size, sample size, and significance level 14

Psbe2 08 research methods 2011-2012 - week 2

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (8)

Similar a Psbe2 08 research methods 2011-2012 - week 2

Similar a Psbe2 08 research methods 2011-2012 - week 2 (20)

Último

Último (20)

Psbe2 08 research methods 2011-2012 - week 2