SlideShare una empresa de Scribd logo
1 de 12
Sample Size Considerations CARMA Internet Research Module Jeff Stanton
Key Considerations Sample size versus response rate – planning for the number of usable data points you will actually obtain Attrition – Repeated measures, panel designs, and diary studies all lose participants over time Statistical power – ability to draw inferences from the sample obtained Margin of error – to the extent that the resulting statistics must be projectable to the larger population
May 15-17, 2008 Internet Data Collection Methods (Day 2-3) Response Rate Reminder 70% 65% 60% 55% 50% 45% 40% 1975 1995 Academic Surveys
Hope for the best / Plan for the worst Try to achieve an 80% response rate Hope to achieve a 50% response rate Plan ahead for a 30% response rate Means you need to sample 1000 people to obtain a sample of 300
Bad Data Unproctored, anonymous self report instruments generally have a higher percentage of: Unusual outliers Missing data Carelessly entered data Intentionally sabotaged data Another aspect of dealing with nonresponse is to anticipate, prepare for, and deal with item level data losses
Attrition
The Best Articles on Statistical Power Cohen, J. (1992). "A power primer." Psychological bulletin 112(1): 155-159. Cohen, J. (1992). "Statistical power analysis." Current Directions in Psychological Science: 98-101. Kraemer, H. and S. Thiemann (1987). How many subjects?: Statistical power analysis in research, Sage Publications, Inc.
May 15-17, 2008 Internet Data Collection Methods (Day 2-8) Sample Size “Guestimates”(With apologies to Jacob Cohen)
May 15-17, 2008 Internet Data Collection Methods (Day 2-9) Estimating Effect Size(Also with apologies to Jacob Cohen) Mean differences, calibrated in standard deviations: Large = .8+, Medium = .5, Small = .2 Multiple regression, size of R-squared: Large =.35+, Medium = .15, Small = .02 Chi-square, calibrated in the difference between null and alternate population proportions: Large = . 50, Medium = .30, Small = .10
Margin of Error Generally represents only sampling error: Other sources of error will often make the margin much larger Assumes a large population, with no more than 5% drawn into the sample Margin of error is half the width of a confidence interval Straightforward calculation for a CI around a mean or a mean difference: generally about 1.96 standard errors CI around a proportion/percentage is more complex: Use 1.96 times this SE; works fine for even splits; can be a little funky for extreme proportions
Margin of Error Calculators http://www.raosoft.com/samplesize.html Trades off sample size and margin of error http://www.surveysystem.com/sscalc.htm Explains terminology http://faculty.vassar.edu/lowry/polls/calcs.html Various tools for assessing poll data http://glass.ed.asu.edu/stats/analysis/rci.html Confidence intervals for correlations http://www.stat.tamu.edu/~jhardin/applets/signed/case11.html Java-based applet
An Overall Sampling Plan Estimate the expected effect size for the most important tests you plan to conduct For inferential testing, use power estimation tools to plan sample size For projectability, use margin of error tools to plan sample size Take into account item level data loss due to bad data Take attrition into account for longitudinal designs Take overall response rate into account for all types of designs Determine overall initial sample size based on all of the factors listed above

Más contenido relacionado

La actualidad más candente

Simulating Patient Populations
Simulating Patient PopulationsSimulating Patient Populations
Simulating Patient PopulationsMichael Ghen
 
MLSEV Virtual. Evaluations
MLSEV Virtual. EvaluationsMLSEV Virtual. Evaluations
MLSEV Virtual. EvaluationsBigML, Inc
 
Spreadsheet Errors
Spreadsheet ErrorsSpreadsheet Errors
Spreadsheet Errorsktrout11
 
Spreadsheet Errors John Park
Spreadsheet  Errors  John ParkSpreadsheet  Errors  John Park
Spreadsheet Errors John ParkJohn Park
 
MLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in MLMLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in MLBigML, Inc
 
Statistics in the age of data science, issues you can not ignore
Statistics in the age of data science, issues you can not ignoreStatistics in the age of data science, issues you can not ignore
Statistics in the age of data science, issues you can not ignoreTuri, Inc.
 
MLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionMLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionBigML, Inc
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive StatisticsMatt Hansen
 
MLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for AnomaliesMLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for AnomaliesBigML, Inc
 
Stayer mat 510 final exam2
Stayer mat 510 final exam2Stayer mat 510 final exam2
Stayer mat 510 final exam2shyaminfo15
 
MLSEV Virtual. Supervised vs Unsupervised
MLSEV Virtual. Supervised vs UnsupervisedMLSEV Virtual. Supervised vs Unsupervised
MLSEV Virtual. Supervised vs UnsupervisedBigML, Inc
 
Stayer mat 510 final exam2
Stayer mat 510 final exam2Stayer mat 510 final exam2
Stayer mat 510 final exam2vikscarter
 
Dfmw Spreadsheet Errors Presentation Jake Carney
Dfmw Spreadsheet Errors Presentation Jake CarneyDfmw Spreadsheet Errors Presentation Jake Carney
Dfmw Spreadsheet Errors Presentation Jake CarneyJake Carney
 
10 things A.I. can do better than you
10 things A.I. can do better than you10 things A.I. can do better than you
10 things A.I. can do better than youJan Wiegelmann
 
A Pocket Guide in Machine Learning for Beginners
A Pocket Guide in Machine Learning for BeginnersA Pocket Guide in Machine Learning for Beginners
A Pocket Guide in Machine Learning for BeginnersRajat Gupta
 

La actualidad más candente (19)

Simulating Patient Populations
Simulating Patient PopulationsSimulating Patient Populations
Simulating Patient Populations
 
MLSEV Virtual. Evaluations
MLSEV Virtual. EvaluationsMLSEV Virtual. Evaluations
MLSEV Virtual. Evaluations
 
Spreadsheet Errors
Spreadsheet ErrorsSpreadsheet Errors
Spreadsheet Errors
 
Spreadsheet Errors John Park
Spreadsheet  Errors  John ParkSpreadsheet  Errors  John Park
Spreadsheet Errors John Park
 
MLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in MLMLSEV Virtual. State of the Art in ML
MLSEV Virtual. State of the Art in ML
 
Statistics in the age of data science, issues you can not ignore
Statistics in the age of data science, issues you can not ignoreStatistics in the age of data science, issues you can not ignore
Statistics in the age of data science, issues you can not ignore
 
ICAR - IFPRI- Power Calculation
ICAR - IFPRI- Power CalculationICAR - IFPRI- Power Calculation
ICAR - IFPRI- Power Calculation
 
MLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model SelectionMLSEV Virtual. Automating Model Selection
MLSEV Virtual. Automating Model Selection
 
Descriptive Statistics
Descriptive StatisticsDescriptive Statistics
Descriptive Statistics
 
MLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for AnomaliesMLSEV Virtual. Searching for Anomalies
MLSEV Virtual. Searching for Anomalies
 
Stayer mat 510 final exam2
Stayer mat 510 final exam2Stayer mat 510 final exam2
Stayer mat 510 final exam2
 
MLSEV Virtual. Supervised vs Unsupervised
MLSEV Virtual. Supervised vs UnsupervisedMLSEV Virtual. Supervised vs Unsupervised
MLSEV Virtual. Supervised vs Unsupervised
 
Stayer mat 510 final exam2
Stayer mat 510 final exam2Stayer mat 510 final exam2
Stayer mat 510 final exam2
 
Medical data diagnosis
Medical data diagnosisMedical data diagnosis
Medical data diagnosis
 
07 Thompson, James R.
07 Thompson, James R.07 Thompson, James R.
07 Thompson, James R.
 
Dfmw Spreadsheet Errors Presentation Jake Carney
Dfmw Spreadsheet Errors Presentation Jake CarneyDfmw Spreadsheet Errors Presentation Jake Carney
Dfmw Spreadsheet Errors Presentation Jake Carney
 
10 things A.I. can do better than you
10 things A.I. can do better than you10 things A.I. can do better than you
10 things A.I. can do better than you
 
Lecture 5
Lecture 5Lecture 5
Lecture 5
 
A Pocket Guide in Machine Learning for Beginners
A Pocket Guide in Machine Learning for BeginnersA Pocket Guide in Machine Learning for Beginners
A Pocket Guide in Machine Learning for Beginners
 

Destacado

Introduction to Sample size decision
Introduction to Sample size decision Introduction to Sample size decision
Introduction to Sample size decision Hiraj Ramachandran
 
Sample size calculation - a brief overview
Sample size calculation - a brief overviewSample size calculation - a brief overview
Sample size calculation - a brief overviewAzmi Mohd Tamil
 
Phenomenological research
Phenomenological researchPhenomenological research
Phenomenological researchclarombe
 
03 phenomenology
03   phenomenology03   phenomenology
03 phenomenologyPeter Miles
 
Sample size
Sample sizeSample size
Sample sizezubis
 
Collecting Qualitative Data
Collecting Qualitative DataCollecting Qualitative Data
Collecting Qualitative Datahighness85
 
Sampling methods PPT
Sampling methods PPTSampling methods PPT
Sampling methods PPTVijay Mehta
 
Sample and sampling techniques
Sample and sampling techniquesSample and sampling techniques
Sample and sampling techniquesNursing Path
 
Sampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSam Ladner
 
RESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGRESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGHafizah Hajimia
 

Destacado (13)

Introduction to Sample size decision
Introduction to Sample size decision Introduction to Sample size decision
Introduction to Sample size decision
 
Determining the Sample Size
Determining the Sample SizeDetermining the Sample Size
Determining the Sample Size
 
Sample size calculation - a brief overview
Sample size calculation - a brief overviewSample size calculation - a brief overview
Sample size calculation - a brief overview
 
Phenomenological research
Phenomenological researchPhenomenological research
Phenomenological research
 
03 phenomenology
03   phenomenology03   phenomenology
03 phenomenology
 
Sample size calculation
Sample size calculationSample size calculation
Sample size calculation
 
Sample size
Sample sizeSample size
Sample size
 
Collecting Qualitative Data
Collecting Qualitative DataCollecting Qualitative Data
Collecting Qualitative Data
 
Sampling methods PPT
Sampling methods PPTSampling methods PPT
Sampling methods PPT
 
Sample and sampling techniques
Sample and sampling techniquesSample and sampling techniques
Sample and sampling techniques
 
Sampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative ResearchSampling Methods in Qualitative and Quantitative Research
Sampling Methods in Qualitative and Quantitative Research
 
sampling ppt
sampling pptsampling ppt
sampling ppt
 
RESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLINGRESEARCH METHOD - SAMPLING
RESEARCH METHOD - SAMPLING
 

Similar a Carma internet research module sample size considerations

Estimation of sample size
Estimation of sample sizeEstimation of sample size
Estimation of sample sizeshyana
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample SizeMatt Hansen
 
Sampling
SamplingSampling
Samplingal amin
 
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptx
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptxPertemuan 3 & 4 - Pengendalian Mutu Statistik.pptx
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptxgigol12808
 
26738157 sampling-design
26738157 sampling-design26738157 sampling-design
26738157 sampling-designMounzer BOUBOU
 
Measurement
MeasurementMeasurement
Measurementwilsone
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxdarwinming1
 
Research Methods in Marketing
Research Methods in MarketingResearch Methods in Marketing
Research Methods in MarketingVartika Kundu
 
Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdfAnuj Bhatia
 
audit sampling notes
audit sampling notesaudit sampling notes
audit sampling notesstudent
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inferenceBhavik A Shah
 
Findings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingFindings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingShainaBoling829
 
T8 audit sampling
T8 audit samplingT8 audit sampling
T8 audit samplingnamninh
 

Similar a Carma internet research module sample size considerations (20)

Sampling Technique
Sampling TechniqueSampling Technique
Sampling Technique
 
Sampling
SamplingSampling
Sampling
 
Estimation of sample size
Estimation of sample sizeEstimation of sample size
Estimation of sample size
 
Calculating a Sample Size
Calculating a Sample SizeCalculating a Sample Size
Calculating a Sample Size
 
Sampling
SamplingSampling
Sampling
 
Sampling
SamplingSampling
Sampling
 
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptx
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptxPertemuan 3 & 4 - Pengendalian Mutu Statistik.pptx
Pertemuan 3 & 4 - Pengendalian Mutu Statistik.pptx
 
26738157 sampling-design
26738157 sampling-design26738157 sampling-design
26738157 sampling-design
 
Sampling methods in medical research
Sampling methods in medical researchSampling methods in medical research
Sampling methods in medical research
 
CH 3 Sampling (3).pptx.ppt
CH 3 Sampling (3).pptx.pptCH 3 Sampling (3).pptx.ppt
CH 3 Sampling (3).pptx.ppt
 
Measurement
MeasurementMeasurement
Measurement
 
Chapter_05.ppt
Chapter_05.pptChapter_05.ppt
Chapter_05.ppt
 
Statistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docxStatistical ProcessesCan descriptive statistical processes b.docx
Statistical ProcessesCan descriptive statistical processes b.docx
 
Research Methods in Marketing
Research Methods in MarketingResearch Methods in Marketing
Research Methods in Marketing
 
Missing data and non response pdf
Missing data and non response pdfMissing data and non response pdf
Missing data and non response pdf
 
audit sampling notes
audit sampling notesaudit sampling notes
audit sampling notes
 
Sampling and statistical inference
Sampling and statistical inferenceSampling and statistical inference
Sampling and statistical inference
 
Findings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport WritingFindings, Conclusions, & RecommendationsReport Writing
Findings, Conclusions, & RecommendationsReport Writing
 
T8 audit sampling
T8 audit samplingT8 audit sampling
T8 audit sampling
 
Sampling brm chap-4
Sampling brm chap-4Sampling brm chap-4
Sampling brm chap-4
 

Más de Syracuse University

Basic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultyBasic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultySyracuse University
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale developmentSyracuse University
 
Carma internet research module getting started with question pro
Carma internet research module   getting started with question proCarma internet research module   getting started with question pro
Carma internet research module getting started with question proSyracuse University
 
Carma internet research module visual design issues
Carma internet research module   visual design issuesCarma internet research module   visual design issues
Carma internet research module visual design issuesSyracuse University
 
Introduction to Advance Analytics Course
Introduction to Advance Analytics CourseIntroduction to Advance Analytics Course
Introduction to Advance Analytics CourseSyracuse University
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Syracuse University
 

Más de Syracuse University (20)

Discovery informaticsstanton
Discovery informaticsstantonDiscovery informaticsstanton
Discovery informaticsstanton
 
Basic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University FacultyBasic SEVIS Overview for U.S. University Faculty
Basic SEVIS Overview for U.S. University Faculty
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
Chapter9 r studio2
Chapter9 r studio2Chapter9 r studio2
Chapter9 r studio2
 
Basic Overview of Data Mining
Basic Overview of Data MiningBasic Overview of Data Mining
Basic Overview of Data Mining
 
Strategic planning
Strategic planningStrategic planning
Strategic planning
 
Carma internet research module scale development
Carma internet research module   scale developmentCarma internet research module   scale development
Carma internet research module scale development
 
Carma internet research module getting started with question pro
Carma internet research module   getting started with question proCarma internet research module   getting started with question pro
Carma internet research module getting started with question pro
 
Carma internet research module visual design issues
Carma internet research module   visual design issuesCarma internet research module   visual design issues
Carma internet research module visual design issues
 
Siop impact of social media
Siop impact of social mediaSiop impact of social media
Siop impact of social media
 
Basic Graphics with R
Basic Graphics with RBasic Graphics with R
Basic Graphics with R
 
R-Studio Vs. Rcmdr
R-Studio Vs. RcmdrR-Studio Vs. Rcmdr
R-Studio Vs. Rcmdr
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
Moving Data to and From R
Moving Data to and From RMoving Data to and From R
Moving Data to and From R
 
Introduction to Advance Analytics Course
Introduction to Advance Analytics CourseIntroduction to Advance Analytics Course
Introduction to Advance Analytics Course
 
Installing R and R-Studio
Installing R and R-StudioInstalling R and R-Studio
Installing R and R-Studio
 
Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)Mining tweets for security information (rev 2)
Mining tweets for security information (rev 2)
 
What is Data Science
What is Data ScienceWhat is Data Science
What is Data Science
 
Reducing Response Burden
Reducing Response BurdenReducing Response Burden
Reducing Response Burden
 
PACIS Survey Workshop
PACIS Survey WorkshopPACIS Survey Workshop
PACIS Survey Workshop
 

Carma internet research module sample size considerations

  • 1. Sample Size Considerations CARMA Internet Research Module Jeff Stanton
  • 2. Key Considerations Sample size versus response rate – planning for the number of usable data points you will actually obtain Attrition – Repeated measures, panel designs, and diary studies all lose participants over time Statistical power – ability to draw inferences from the sample obtained Margin of error – to the extent that the resulting statistics must be projectable to the larger population
  • 3. May 15-17, 2008 Internet Data Collection Methods (Day 2-3) Response Rate Reminder 70% 65% 60% 55% 50% 45% 40% 1975 1995 Academic Surveys
  • 4. Hope for the best / Plan for the worst Try to achieve an 80% response rate Hope to achieve a 50% response rate Plan ahead for a 30% response rate Means you need to sample 1000 people to obtain a sample of 300
  • 5. Bad Data Unproctored, anonymous self report instruments generally have a higher percentage of: Unusual outliers Missing data Carelessly entered data Intentionally sabotaged data Another aspect of dealing with nonresponse is to anticipate, prepare for, and deal with item level data losses
  • 7. The Best Articles on Statistical Power Cohen, J. (1992). "A power primer." Psychological bulletin 112(1): 155-159. Cohen, J. (1992). "Statistical power analysis." Current Directions in Psychological Science: 98-101. Kraemer, H. and S. Thiemann (1987). How many subjects?: Statistical power analysis in research, Sage Publications, Inc.
  • 8. May 15-17, 2008 Internet Data Collection Methods (Day 2-8) Sample Size “Guestimates”(With apologies to Jacob Cohen)
  • 9. May 15-17, 2008 Internet Data Collection Methods (Day 2-9) Estimating Effect Size(Also with apologies to Jacob Cohen) Mean differences, calibrated in standard deviations: Large = .8+, Medium = .5, Small = .2 Multiple regression, size of R-squared: Large =.35+, Medium = .15, Small = .02 Chi-square, calibrated in the difference between null and alternate population proportions: Large = . 50, Medium = .30, Small = .10
  • 10. Margin of Error Generally represents only sampling error: Other sources of error will often make the margin much larger Assumes a large population, with no more than 5% drawn into the sample Margin of error is half the width of a confidence interval Straightforward calculation for a CI around a mean or a mean difference: generally about 1.96 standard errors CI around a proportion/percentage is more complex: Use 1.96 times this SE; works fine for even splits; can be a little funky for extreme proportions
  • 11. Margin of Error Calculators http://www.raosoft.com/samplesize.html Trades off sample size and margin of error http://www.surveysystem.com/sscalc.htm Explains terminology http://faculty.vassar.edu/lowry/polls/calcs.html Various tools for assessing poll data http://glass.ed.asu.edu/stats/analysis/rci.html Confidence intervals for correlations http://www.stat.tamu.edu/~jhardin/applets/signed/case11.html Java-based applet
  • 12. An Overall Sampling Plan Estimate the expected effect size for the most important tests you plan to conduct For inferential testing, use power estimation tools to plan sample size For projectability, use margin of error tools to plan sample size Take into account item level data loss due to bad data Take attrition into account for longitudinal designs Take overall response rate into account for all types of designs Determine overall initial sample size based on all of the factors listed above