SlideShare a Scribd company logo
1 of 21
Psychometrics 101:
Know what your assessment
data is telling you
Eric Ermie – Director of Client Solutions, ExamSoft
(Formerly) Program Manager for Assessment and Evaluation,
The Ohio State University College of Medicine.
AGENDA
• Overview
• Types of stats
• Interpreting the item analysis
report
• Examples
• General statistical guidelines
How can I reconcile what I know about my assessment’s
past with what the data is telling me?
Item analysis is not a fool proof answer to these
questions.
But…
THE
OVERVIEW
YOU HAVE TO START SOMEWHERE.
Where do I start?
Is this a good or bad question? Can statistics
even tell me that?
TYPES
OF STATS
Common Stats:
• Item Difficulty/p Value- decimal
representation of difficulty using
the percentage of students who
got the item correct. The lower
the decimal the higher the
difficulty
• Upper 27% - what percentage of
the top 27% of performers got
the question correct
• Lower 27% - what percentage of
the bottom 27% of performers
got the question correct.
Common Stats Cont’d:
• Discrimination index – the
difference in performance
between the Upper 27% and the
Lower 27%
• Point-Biserial- a discrimination
statistic that indicates whether
doing well on that specific item
correlated with doing well on the
exam overall. Thus was that item
a good or bad predictor of overall
performance on the exam.
ITEM ANALYSIS
REPORT
But with any statistic it is important to
remember context matters!
ITEM ANALYSIS
EXAMPLES
Diff(p) Upper A B D E
0.98 100.00% 0.10 0 1 1 *178
0.00 0.55 0.55 98.34
0.00 0.02 -0.10 0.10
0.00 0.00 -0.02 0.02
0.00 0.00 0.00 1.00
0.00 0.00 0.02 0.98Lower 27%
Upper 27%
Disc. Index 0.00
0.00
0.00
0.00
0
0.00
Lower
Disc.
Index
1
% Selected
Point Biserial (rpb)
96.15% E0.04
Item
#
Correct Responses Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
C
Diff(p) Upper A B D E
0.66 82.00% 0.28 7 17 *120 9
3.87 9.39 66.30 4.97
-0.11 -0.19 0.28 -0.07
-0.04 -0.19 0.36 -0.04
0.00 0.00 0.82 0.06
0.04 0.19 0.46 0.10
Lower C
Item
#
Correct Responses Disc.
Index
Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
0.36
Lower 27%
Upper 27%
Disc. Index -0.09
0.21
0.12
Point Biserial (rpb)
46.15% D 28
15.47
-0.12
7
% Selected
ITEM ANALYSIS
EXAMPLES
ITEM ANALYSIS
EXAMPLES
Diff(p) Upper A B D E
0.36 52.00% 0.22 35 34 *66 25
19.34 18.78 36.46 13.81
-0.09 0.04 0.22 -0.06
-0.15 0.07 0.25 -0.02
0.10 0.24 0.52 0.10
0.25 0.17 0.27 0.12
Item
#
Correct Responses Disc.
Index
Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
Lower C
0.25
Lower 27%
Upper 27%
Disc. Index -0.15
0.19
0.04
Point Biserial (rpb)
26.92% D 21
11.60
-0.20
22
% Selected
ITEM ANALYSIS
EXAMPLES
Diff(p) Upper A B D E
0.55 25.00% -0.43 7 17 *120 9
3.87 9.39 55.00 7.46
-0.11 -0.19 -0.43 0.00
-0.04 -0.19 -0.57 0.00
0.00 0.00 0.25 0.00
0.00 0.00 0.83 0.00
Lower C
Item
#
Correct Responses Disc.
Index
Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
-0.57
Lower 27%
Upper 27%
Disc. Index -0.09
0.17
0.75
Point Biserial (rpb)
82.50% D 28
37.54
-0.12
82
% Selected
ITEM ANALYSIS
EXAMPLES
Diff(p) Upper A B D E
0.52 64.00% 0.18 61 21 5 0
33.70 11.60 2.76 0.00
-0.10 -0.19 0.12 0.00
-0.12 -0.13 0.04 0.00
0.26 0.04 0.06 0.00
0.38 0.17 0.02 0.00
Item
#
Correct Responses Disc.
Index
Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
Lower C
0.22
Lower 27%
Upper 27%
Disc. Index 0.22
0.42
0.64
Point Biserial (rpb)
42.31% C *94
51.93
0.18
24
% Selected
ITEM ANALYSIS
EXAMPLES
Diff(p) Upper A B D E
0.71 90.00% 0.31 0 *129 30 21
0.00 71.27 16.57 11.60
0.00 0.31 -0.25 -0.11
0.00 0.34 -0.23 -0.09
0.00 0.90 0.06 0.04
0.00 0.56 0.29 0.13
Item
#
Correct Responses Disc.
Index
Point
Biserial
Correct
Answer
Response Frequencies (*Indicates correct answer)
Lower C
0.34
Lower 27%
Upper 27%
Disc. Index -0.02
0.02
0.00
Point Biserial (rpb)
55.77% B 1
0.55
-0.16
34
% Selected
GENERAL
GUIDELINES
Desired statistical range’s - opinions differ but most commonly used are:
• Item Difficulty/p Value - Acceptable item difficulty is not a set number but more a
correlation with question intention. If you intended the item to be a mastery item you
want the difficulty as close to 1.00 as possible. If you desired a discriminating question
significantly lower levels are acceptable.
• Upper 27% - if less than 60% of your top performers are getting a question correct a
further analysis is needed to see if there are issues with the question. Also if less of
your upper 27% get a question correct than your lower 27% then there is also an issue.
• Lower 27% - generally you never want it to be higher than the upper 27%. As low as
0% can be acceptable as high as 100% can be acceptable if it is a mastery question.
GENERAL
GUIDELINES
Desired statistical range’s - opinions differ but most commonly used are:
• Discrimination index – some set specific numbers of acceptable and unacceptable
values, I would argue the more accurate guide is that the lower the p value the higher
the discrimination index needs to be.
Generally .2 the item is considered to have discriminated, less than that is considered
no discrimination. .3 or greater is consider highly discriminating.
• Point-Biserial – similarly to discrimination index some set specific numbers of
acceptable and unacceptable values.
Generally .2 and above is considered to have discrimination and have positive
association with overall performance on the assessment, lower levels are acceptable for
mastery and .3+ would be desired for discriminating questions.
GENERAL
GUIDELINES
KR-20
Used as an overall measure of reliability for the assessment.
Measured on a scale from 0.0 to 1.0 with 0.0 being very poor and 1.0 being excellent.
Quick notes:
Heavily influenced by number of questions in assessment
Heavily influenced by number of students taking the assessments
The combination can FREQUENTLY lead to false positive and false negative KR-20
values.
EXTRANEOUS
FACTORS
Stats alone do not tell the whole story:
• Student behavior
– Cheating
– Return on investment
• Conflicting content/faculty
• “six degrees from Sunday”
Ways to increase the accuracy/usefulness of your stats:
• Item review process
– Format
– Level of difficulty
– Alternative correct options
• Historical item analysis
– Across assessments
– Across versions
• Reuse/Recycle
WHERE DO
WE FIT IN?
• Simplified and detailed versions of item analysis
reports
• Historical item analysis data by version,
assessment and in aggregate
• Ability to pull item analysis by discipline/question
author/category
EXAMSOFT
FIT
THE DATA YOU NEED
Thank you for attending!
• Check our resource library:
resources.examsoft.com to re-watch
the webinar, download a PDF of the
presentation or access a certificate of
completion.
• Be sure to check out our upcoming
webinars:
• Creating a Secure Testing Environment
for Distance Education Programs
• Learning about the Learners: Using
Analytical Tools to Drive Curricular
Decisions
Questions?
Click to edit Master title style
Click to edit Master subtitle style
For More Information:
Call: 1.866.429.8889
Email: info@examsoft.com
Visit: learn.examsoft.com

More Related Content

Similar to Psychometrics 101: Know What Your Assessment Data is Telling You

item analysis.pptx education pnc item analysis
item analysis.pptx education pnc item analysisitem analysis.pptx education pnc item analysis
item analysis.pptx education pnc item analysisswatisheth8
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptxNithuNithu7
 
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...GalataaAGoobanaa
 
Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Beamsync
 
Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysisankurjain1909
 
Item analysis- 1st yr Msc[n] research
Item analysis- 1st yr Msc[n] researchItem analysis- 1st yr Msc[n] research
Item analysis- 1st yr Msc[n] researchSUCHITRARATI1976
 
Item Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexItem Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexMr. Ronald Quileste, PhD
 
measurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmeasurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmaheebhavinshah
 
ITEM ANALYSIS.pptx
ITEM ANALYSIS.pptxITEM ANALYSIS.pptx
ITEM ANALYSIS.pptxRizaGarganza
 
Item analysis and validation
Item analysis and validationItem analysis and validation
Item analysis and validationKEnkenken Tan
 
Fdu item analysis (1).ppt revised by dd
Fdu item analysis (1).ppt revised by ddFdu item analysis (1).ppt revised by dd
Fdu item analysis (1).ppt revised by dddettmore
 
Measurement & Scales.pptx
Measurement & Scales.pptxMeasurement & Scales.pptx
Measurement & Scales.pptxdrcharlydaniel
 

Similar to Psychometrics 101: Know What Your Assessment Data is Telling You (20)

item analysis.pptx education pnc item analysis
item analysis.pptx education pnc item analysisitem analysis.pptx education pnc item analysis
item analysis.pptx education pnc item analysis
 
educatiinar.pptx
educatiinar.pptxeducatiinar.pptx
educatiinar.pptx
 
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...
ITEM ANALYSIS 2023.pptx uses for exam development especially national examina...
 
DepEd Item Analysis
DepEd Item AnalysisDepEd Item Analysis
DepEd Item Analysis
 
Item analysis with spss software
Item analysis with spss softwareItem analysis with spss software
Item analysis with spss software
 
Item analysis
Item analysisItem analysis
Item analysis
 
New item analysis
New item analysisNew item analysis
New item analysis
 
Item analysis
Item analysisItem analysis
Item analysis
 
Item analysis2
Item analysis2Item analysis2
Item analysis2
 
Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10Introduction to Business Analytics Course Part 10
Introduction to Business Analytics Course Part 10
 
Item analysis report
Item analysis report Item analysis report
Item analysis report
 
Item analysis
Item analysisItem analysis
Item analysis
 
Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysis
 
Item analysis- 1st yr Msc[n] research
Item analysis- 1st yr Msc[n] researchItem analysis- 1st yr Msc[n] research
Item analysis- 1st yr Msc[n] research
 
Item Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty IndexItem Analysis - Discrimination and Difficulty Index
Item Analysis - Discrimination and Difficulty Index
 
measurment scaling & businesses data.pdf
measurment scaling & businesses data.pdfmeasurment scaling & businesses data.pdf
measurment scaling & businesses data.pdf
 
ITEM ANALYSIS.pptx
ITEM ANALYSIS.pptxITEM ANALYSIS.pptx
ITEM ANALYSIS.pptx
 
Item analysis and validation
Item analysis and validationItem analysis and validation
Item analysis and validation
 
Fdu item analysis (1).ppt revised by dd
Fdu item analysis (1).ppt revised by ddFdu item analysis (1).ppt revised by dd
Fdu item analysis (1).ppt revised by dd
 
Measurement & Scales.pptx
Measurement & Scales.pptxMeasurement & Scales.pptx
Measurement & Scales.pptx
 

More from ExamSoft

More Than Assessment: Using computer-based testing software to deliver instru...
More Than Assessment: Using computer-based testing software to deliver instru...More Than Assessment: Using computer-based testing software to deliver instru...
More Than Assessment: Using computer-based testing software to deliver instru...ExamSoft
 
Creating Test Items Using NCLEX® Alternative Item Types
Creating Test Items Using NCLEX® Alternative Item TypesCreating Test Items Using NCLEX® Alternative Item Types
Creating Test Items Using NCLEX® Alternative Item TypesExamSoft
 
Psychometrics 201: Putting assessment data into action
Psychometrics 201: Putting assessment data into actionPsychometrics 201: Putting assessment data into action
Psychometrics 201: Putting assessment data into actionExamSoft
 
Programs Coming Together Using ExamSoft to assess interprofessional education...
Programs Coming Together Using ExamSoft to assess interprofessional education...Programs Coming Together Using ExamSoft to assess interprofessional education...
Programs Coming Together Using ExamSoft to assess interprofessional education...ExamSoft
 
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...5 Tips for Course Alignment: Improve student outcomes while mapping your curr...
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...ExamSoft
 
Last Minute Tips to Positively Impact Your Students This Semester
Last Minute Tips to Positively Impact Your Students This SemesterLast Minute Tips to Positively Impact Your Students This Semester
Last Minute Tips to Positively Impact Your Students This SemesterExamSoft
 
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...ExamSoft
 
Check, Check, Check in the Simulation Lab
Check, Check, Check in the Simulation LabCheck, Check, Check in the Simulation Lab
Check, Check, Check in the Simulation LabExamSoft
 
Retiring Exam Questions? How to Use These Items in Formative Assessments
Retiring Exam Questions? How to Use These Items in Formative AssessmentsRetiring Exam Questions? How to Use These Items in Formative Assessments
Retiring Exam Questions? How to Use These Items in Formative AssessmentsExamSoft
 
Using ExamSoft to Evaluate NCLEX Test Plan Success
Using ExamSoft to Evaluate NCLEX Test Plan SuccessUsing ExamSoft to Evaluate NCLEX Test Plan Success
Using ExamSoft to Evaluate NCLEX Test Plan SuccessExamSoft
 
Using ExamSoft Rubrics to Assess Student Medical Research
Using ExamSoft Rubrics to Assess Student Medical Research Using ExamSoft Rubrics to Assess Student Medical Research
Using ExamSoft Rubrics to Assess Student Medical Research ExamSoft
 
Improve your test item writing skills to help create better nursing exams
Improve your test item writing skills to help create better nursing examsImprove your test item writing skills to help create better nursing exams
Improve your test item writing skills to help create better nursing examsExamSoft
 
From Conception to Execution: Strategies for designing and implementing a com...
From Conception to Execution: Strategies for designing and implementing a com...From Conception to Execution: Strategies for designing and implementing a com...
From Conception to Execution: Strategies for designing and implementing a com...ExamSoft
 
What's in it for me-- said the student, faculty, and curriculum
 What's in it for me-- said the student, faculty, and curriculum What's in it for me-- said the student, faculty, and curriculum
What's in it for me-- said the student, faculty, and curriculumExamSoft
 
"What's in it for me?" said the Student, Faculty, and Curriculum
"What's in it for me?" said the Student, Faculty, and Curriculum"What's in it for me?" said the Student, Faculty, and Curriculum
"What's in it for me?" said the Student, Faculty, and CurriculumExamSoft
 
Closing the Loop on Clinical Competency Based Assessments
 Closing the Loop on Clinical Competency Based Assessments Closing the Loop on Clinical Competency Based Assessments
Closing the Loop on Clinical Competency Based AssessmentsExamSoft
 
Using Exam Data for Scholarly Activities
Using Exam Data for Scholarly Activities Using Exam Data for Scholarly Activities
Using Exam Data for Scholarly Activities ExamSoft
 
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...ExamSoft
 
Stop the cheating! best practices to minimize security risks on exams
Stop the cheating! best practices to minimize security risks on examsStop the cheating! best practices to minimize security risks on exams
Stop the cheating! best practices to minimize security risks on examsExamSoft
 
Using ExamSoft Data for Item Revision and Faculty Development
Using ExamSoft Data for Item Revision and Faculty DevelopmentUsing ExamSoft Data for Item Revision and Faculty Development
Using ExamSoft Data for Item Revision and Faculty DevelopmentExamSoft
 

More from ExamSoft (20)

More Than Assessment: Using computer-based testing software to deliver instru...
More Than Assessment: Using computer-based testing software to deliver instru...More Than Assessment: Using computer-based testing software to deliver instru...
More Than Assessment: Using computer-based testing software to deliver instru...
 
Creating Test Items Using NCLEX® Alternative Item Types
Creating Test Items Using NCLEX® Alternative Item TypesCreating Test Items Using NCLEX® Alternative Item Types
Creating Test Items Using NCLEX® Alternative Item Types
 
Psychometrics 201: Putting assessment data into action
Psychometrics 201: Putting assessment data into actionPsychometrics 201: Putting assessment data into action
Psychometrics 201: Putting assessment data into action
 
Programs Coming Together Using ExamSoft to assess interprofessional education...
Programs Coming Together Using ExamSoft to assess interprofessional education...Programs Coming Together Using ExamSoft to assess interprofessional education...
Programs Coming Together Using ExamSoft to assess interprofessional education...
 
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...5 Tips for Course Alignment: Improve student outcomes while mapping your curr...
5 Tips for Course Alignment: Improve student outcomes while mapping your curr...
 
Last Minute Tips to Positively Impact Your Students This Semester
Last Minute Tips to Positively Impact Your Students This SemesterLast Minute Tips to Positively Impact Your Students This Semester
Last Minute Tips to Positively Impact Your Students This Semester
 
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...
Using the NCLEX RN Blueprint as a Guide for Testing in a Nursing Baccalaureat...
 
Check, Check, Check in the Simulation Lab
Check, Check, Check in the Simulation LabCheck, Check, Check in the Simulation Lab
Check, Check, Check in the Simulation Lab
 
Retiring Exam Questions? How to Use These Items in Formative Assessments
Retiring Exam Questions? How to Use These Items in Formative AssessmentsRetiring Exam Questions? How to Use These Items in Formative Assessments
Retiring Exam Questions? How to Use These Items in Formative Assessments
 
Using ExamSoft to Evaluate NCLEX Test Plan Success
Using ExamSoft to Evaluate NCLEX Test Plan SuccessUsing ExamSoft to Evaluate NCLEX Test Plan Success
Using ExamSoft to Evaluate NCLEX Test Plan Success
 
Using ExamSoft Rubrics to Assess Student Medical Research
Using ExamSoft Rubrics to Assess Student Medical Research Using ExamSoft Rubrics to Assess Student Medical Research
Using ExamSoft Rubrics to Assess Student Medical Research
 
Improve your test item writing skills to help create better nursing exams
Improve your test item writing skills to help create better nursing examsImprove your test item writing skills to help create better nursing exams
Improve your test item writing skills to help create better nursing exams
 
From Conception to Execution: Strategies for designing and implementing a com...
From Conception to Execution: Strategies for designing and implementing a com...From Conception to Execution: Strategies for designing and implementing a com...
From Conception to Execution: Strategies for designing and implementing a com...
 
What's in it for me-- said the student, faculty, and curriculum
 What's in it for me-- said the student, faculty, and curriculum What's in it for me-- said the student, faculty, and curriculum
What's in it for me-- said the student, faculty, and curriculum
 
"What's in it for me?" said the Student, Faculty, and Curriculum
"What's in it for me?" said the Student, Faculty, and Curriculum"What's in it for me?" said the Student, Faculty, and Curriculum
"What's in it for me?" said the Student, Faculty, and Curriculum
 
Closing the Loop on Clinical Competency Based Assessments
 Closing the Loop on Clinical Competency Based Assessments Closing the Loop on Clinical Competency Based Assessments
Closing the Loop on Clinical Competency Based Assessments
 
Using Exam Data for Scholarly Activities
Using Exam Data for Scholarly Activities Using Exam Data for Scholarly Activities
Using Exam Data for Scholarly Activities
 
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...
Communication is Key! Using ExamSoft to Keep Everyone Involved In the Teachin...
 
Stop the cheating! best practices to minimize security risks on exams
Stop the cheating! best practices to minimize security risks on examsStop the cheating! best practices to minimize security risks on exams
Stop the cheating! best practices to minimize security risks on exams
 
Using ExamSoft Data for Item Revision and Faculty Development
Using ExamSoft Data for Item Revision and Faculty DevelopmentUsing ExamSoft Data for Item Revision and Faculty Development
Using ExamSoft Data for Item Revision and Faculty Development
 

Recently uploaded

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 

Recently uploaded (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 

Psychometrics 101: Know What Your Assessment Data is Telling You

  • 1. Psychometrics 101: Know what your assessment data is telling you Eric Ermie – Director of Client Solutions, ExamSoft (Formerly) Program Manager for Assessment and Evaluation, The Ohio State University College of Medicine.
  • 2. AGENDA • Overview • Types of stats • Interpreting the item analysis report • Examples • General statistical guidelines
  • 3. How can I reconcile what I know about my assessment’s past with what the data is telling me? Item analysis is not a fool proof answer to these questions. But… THE OVERVIEW YOU HAVE TO START SOMEWHERE. Where do I start? Is this a good or bad question? Can statistics even tell me that?
  • 4. TYPES OF STATS Common Stats: • Item Difficulty/p Value- decimal representation of difficulty using the percentage of students who got the item correct. The lower the decimal the higher the difficulty • Upper 27% - what percentage of the top 27% of performers got the question correct • Lower 27% - what percentage of the bottom 27% of performers got the question correct. Common Stats Cont’d: • Discrimination index – the difference in performance between the Upper 27% and the Lower 27% • Point-Biserial- a discrimination statistic that indicates whether doing well on that specific item correlated with doing well on the exam overall. Thus was that item a good or bad predictor of overall performance on the exam.
  • 6. But with any statistic it is important to remember context matters!
  • 7. ITEM ANALYSIS EXAMPLES Diff(p) Upper A B D E 0.98 100.00% 0.10 0 1 1 *178 0.00 0.55 0.55 98.34 0.00 0.02 -0.10 0.10 0.00 0.00 -0.02 0.02 0.00 0.00 0.00 1.00 0.00 0.00 0.02 0.98Lower 27% Upper 27% Disc. Index 0.00 0.00 0.00 0.00 0 0.00 Lower Disc. Index 1 % Selected Point Biserial (rpb) 96.15% E0.04 Item # Correct Responses Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) C
  • 8. Diff(p) Upper A B D E 0.66 82.00% 0.28 7 17 *120 9 3.87 9.39 66.30 4.97 -0.11 -0.19 0.28 -0.07 -0.04 -0.19 0.36 -0.04 0.00 0.00 0.82 0.06 0.04 0.19 0.46 0.10 Lower C Item # Correct Responses Disc. Index Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) 0.36 Lower 27% Upper 27% Disc. Index -0.09 0.21 0.12 Point Biserial (rpb) 46.15% D 28 15.47 -0.12 7 % Selected ITEM ANALYSIS EXAMPLES
  • 9. ITEM ANALYSIS EXAMPLES Diff(p) Upper A B D E 0.36 52.00% 0.22 35 34 *66 25 19.34 18.78 36.46 13.81 -0.09 0.04 0.22 -0.06 -0.15 0.07 0.25 -0.02 0.10 0.24 0.52 0.10 0.25 0.17 0.27 0.12 Item # Correct Responses Disc. Index Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) Lower C 0.25 Lower 27% Upper 27% Disc. Index -0.15 0.19 0.04 Point Biserial (rpb) 26.92% D 21 11.60 -0.20 22 % Selected
  • 10. ITEM ANALYSIS EXAMPLES Diff(p) Upper A B D E 0.55 25.00% -0.43 7 17 *120 9 3.87 9.39 55.00 7.46 -0.11 -0.19 -0.43 0.00 -0.04 -0.19 -0.57 0.00 0.00 0.00 0.25 0.00 0.00 0.00 0.83 0.00 Lower C Item # Correct Responses Disc. Index Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) -0.57 Lower 27% Upper 27% Disc. Index -0.09 0.17 0.75 Point Biserial (rpb) 82.50% D 28 37.54 -0.12 82 % Selected
  • 11. ITEM ANALYSIS EXAMPLES Diff(p) Upper A B D E 0.52 64.00% 0.18 61 21 5 0 33.70 11.60 2.76 0.00 -0.10 -0.19 0.12 0.00 -0.12 -0.13 0.04 0.00 0.26 0.04 0.06 0.00 0.38 0.17 0.02 0.00 Item # Correct Responses Disc. Index Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) Lower C 0.22 Lower 27% Upper 27% Disc. Index 0.22 0.42 0.64 Point Biserial (rpb) 42.31% C *94 51.93 0.18 24 % Selected
  • 12. ITEM ANALYSIS EXAMPLES Diff(p) Upper A B D E 0.71 90.00% 0.31 0 *129 30 21 0.00 71.27 16.57 11.60 0.00 0.31 -0.25 -0.11 0.00 0.34 -0.23 -0.09 0.00 0.90 0.06 0.04 0.00 0.56 0.29 0.13 Item # Correct Responses Disc. Index Point Biserial Correct Answer Response Frequencies (*Indicates correct answer) Lower C 0.34 Lower 27% Upper 27% Disc. Index -0.02 0.02 0.00 Point Biserial (rpb) 55.77% B 1 0.55 -0.16 34 % Selected
  • 13. GENERAL GUIDELINES Desired statistical range’s - opinions differ but most commonly used are: • Item Difficulty/p Value - Acceptable item difficulty is not a set number but more a correlation with question intention. If you intended the item to be a mastery item you want the difficulty as close to 1.00 as possible. If you desired a discriminating question significantly lower levels are acceptable. • Upper 27% - if less than 60% of your top performers are getting a question correct a further analysis is needed to see if there are issues with the question. Also if less of your upper 27% get a question correct than your lower 27% then there is also an issue. • Lower 27% - generally you never want it to be higher than the upper 27%. As low as 0% can be acceptable as high as 100% can be acceptable if it is a mastery question.
  • 14. GENERAL GUIDELINES Desired statistical range’s - opinions differ but most commonly used are: • Discrimination index – some set specific numbers of acceptable and unacceptable values, I would argue the more accurate guide is that the lower the p value the higher the discrimination index needs to be. Generally .2 the item is considered to have discriminated, less than that is considered no discrimination. .3 or greater is consider highly discriminating. • Point-Biserial – similarly to discrimination index some set specific numbers of acceptable and unacceptable values. Generally .2 and above is considered to have discrimination and have positive association with overall performance on the assessment, lower levels are acceptable for mastery and .3+ would be desired for discriminating questions.
  • 15. GENERAL GUIDELINES KR-20 Used as an overall measure of reliability for the assessment. Measured on a scale from 0.0 to 1.0 with 0.0 being very poor and 1.0 being excellent. Quick notes: Heavily influenced by number of questions in assessment Heavily influenced by number of students taking the assessments The combination can FREQUENTLY lead to false positive and false negative KR-20 values.
  • 16. EXTRANEOUS FACTORS Stats alone do not tell the whole story: • Student behavior – Cheating – Return on investment • Conflicting content/faculty • “six degrees from Sunday” Ways to increase the accuracy/usefulness of your stats: • Item review process – Format – Level of difficulty – Alternative correct options • Historical item analysis – Across assessments – Across versions • Reuse/Recycle
  • 18. • Simplified and detailed versions of item analysis reports • Historical item analysis data by version, assessment and in aggregate • Ability to pull item analysis by discipline/question author/category EXAMSOFT FIT THE DATA YOU NEED
  • 19. Thank you for attending! • Check our resource library: resources.examsoft.com to re-watch the webinar, download a PDF of the presentation or access a certificate of completion. • Be sure to check out our upcoming webinars: • Creating a Secure Testing Environment for Distance Education Programs • Learning about the Learners: Using Analytical Tools to Drive Curricular Decisions
  • 21. Click to edit Master title style Click to edit Master subtitle style For More Information: Call: 1.866.429.8889 Email: info@examsoft.com Visit: learn.examsoft.com