Exam and item development

Examination’s Purpose
The goal of the exam development process is to
accurately measure the candidate’s ability in
the field practice.

An examination is created to measure the ability
of the candidate based upon the knowledge
and skills represented in each test question.

An examination is NOT created to measure the
ability of the candidate to take an exam.

Joosten || 2007

Considerations in Criterion-referenced Testing
actions necessary for effective and efficient performance

Exam Validity Content Areas
& Reliability

Task/Skill Areas
Item Taxonomy

Joosten || 2007

Item Taxonomy

RECALL

PROBLEM SOLVING INTERPRETATION

Joosten || 2007

Sample Recall Item
Which of the following describes the active growth
phase of the cycle of normal human hair
growth?

A. Anagen.
B. Betagen.
C. Catagen.
D. Telogen.

Joosten || 2007

Sample Interpretation Item
23-year-old woman who is acutely febrile has had
an untreated, painful lower left third molar for 3
weeks. The patient can open her mouth only
8mm, has some pain on swallowing, and has
moderate swelling just beneath the angle of the
mandible on the left side. The diagnosis most
likely is an abscess in which of the following
spaces?

A. lateral pharyngeal.
B. retropharyngeal.
C. submandibular.
D. masticatory.
Joosten || 2007

Sample Problem Solving Item
A periapical roetgenogram reveals an impacted
lower third molar in an edentulous mandible. The
patient is experiencing recurrent acute and
chronic infection of the overlying soft tissue
denture base. For definitive treatment, the
surgeon should:

A. reline and relieve the denture base.
B. remove the tooth using appropriate antibiotic
control.
C. trim the swollen tissue and prescribe antibiotics.
D. advise the patient to remove the denture when
eating.

Joosten || 2007

Developing Multiple Choice
Items

Issues and Methods

Multiple Choice Items
GOAL: Maintain a pool of exam items which are
appropriate to measure the knowledge and
skills necessary for safe and effective
performance in the field of practice.

Item construction affects the performance of
your exam.

A multiple choice item is a specific form of item
that is composed of a stem and options

Parts of an item:
Stem
Distractors
Correct answer

Joosten || 2007

Stem
The stem of a multiple choice item may:

ask a question
Which of the following microscopic
subtypes of ameloblastoma is most
common?

give an incomplete statement
The most common microscopic
subtype of ameloblastoma is:The
stem of a multiple choice item may:

describe a situation (along with a question or
incomplete statement)
A 25 year-old man is brought to the
emergency room. He was found
lying unconscious on the sidewalk.
After ascertaining that the airway is
open, the next step in management
Joosten || 2007 be:
should

Item Response Options
Options are all the possible answers for a stem.
One correct (best) answer
Three distractors

The best answer is agreed upon by experts.

The distractors are logical misconceptions of
the best answer.

Joosten || 2007

Developing Items
Items should have one best answer. Avoid
items based on opinion or for which there is
not an accepted answer.
Items must focus on a single issue, fact, or
problem in each item.
Items should test important and pertinent
material while avoiding trivial facts.
Items should be developed utilizing good
grammar, punctuation, and spelling.
Attempt to write interpretation and problem
solving items.
Use a standard number of responses.
Options should avoid “all of the above” and
“none of the above.”

Joosten || 2007

Stem Construction
Stems should:

Avoid over specific knowledge, excess
information, and teaching in the stem.
Include the central idea and most verbiage
in the stem.
Be stated positively and avoid negative
phrasing.
Avoid personal pronouns (i.e., you).
Use terminology common to practice and
avoid textbook verbatim phrasing.
Avoid superlatives such as “always” and
“never.”

Joosten || 2007

Responses Construction
Responses should be:

Organized in a logical order
Independent and not overlapping
Fairly consistent in length
Homogeneous
Plausible

Joosten || 2007

Any Questions?

Joosten || 2007

Item Evaluation
P-value: percent of candidates who selected a response.

Point Biserial Correlation: correlation between those
candidates who did well on the test and those
candidates who selected the correct response.

Joosten || 2007

Good Item

1ST # *
2ND
# *
3RD #
*
4TH # *
5TH # *
# #-----#-----#-----#-----
#-----#-----#-----#-----#-----#-----#
0 10 20 30 40 50 60 70 80 90 100

A IS THE CORRECT ANSWER
A B C D
P-VALUE 0.70 0.15 0.05 0.01 Joosten || 2007

Good P-value:
Poor Discrimination

1ST # *
2ND # *
3RD # *
4TH # *
5TH # *
#-----#---------#-----#-----#-----#-----#-----#-----#-----#-----#
0 10 20 30 40 50 60 70 80 90 100

C IS THE CORRECT ANSWER
A B C D
P-VALUE 0.05 0. 07 0.73 0.15
RPBI 0.11 -0. 10 0.02 -0.02 || 2007
Joosten

Low P-value:
Low Discrimination

1ST # *
2ND # *
3RD # *
4TH # *
5TH # *
#-----#-----#-----#-----#-----#-----#-----#-----#-----#-----#
0 10 20 30 40 50 60 70 80 90 100

A IS THE CORRECT ANSWER
A B C D
P-VALUE 0. 47 0. 33 0.15 0.05
RPBI 0. 08 -0. 13 0.01 Joosten || 2007
0.09

Evaluating Item Stems
1. Focus on a single issue, fact, or problem in each item.

2. Avoid over specific knowledge.

3. Avoid textbook verbatim phrasing for items.

4. Avoid items based on opinion.

5. Avoid items for which there is not an accepted answer.

Joosten || 2007

6. Test important material, while avoiding trivial facts.

7. State the item positively and avoid negative phrasing.

8. Include the central idea and most verbiage in the
stem.

9. Use one best answer format.

10. Use good grammar, punctuation, and spelling.

Joosten || 2007

11. Avoid excess information in the stem as well as
teaching in the stem.

12. Avoid personal pronouns (i.e., you).

13. Attempt to write stems that require interpretation and
problem solving from the candidate (rather than recall).

Joosten || 2007

Anatomy of Item Responses
Item responses should consist of:

1.) the best answer (agreed upon by experts).

2.) logical misconceptions of the best answer or
distractors.

Joosten || 2007

Evaluating Item Responses
1. Use a standard number of responses.

2. Place options in a logical order.

3. Keep options independent and not overlapping.

4. Keep options homogeneous in content.

5. Keep the length of the options fairly consistent.

Joosten || 2007

6. Be sure all distractors are plausible.

7. Be sure all distractors are logical
misconceptions.

8. Avoid “all of the above” and “none of the above.”

9. Phrase options positively, not negatively.

10. Avoid use of slang.

Joosten || 2007

11. Avoid absurd or “fantastic” options.

12. Avoid giving clues through faulty grammar.

13. Make sure there is only one best answer.

14. Avoid superlatives such as “always” and “never.”

15. Evenly distribute position of the correct answer.

Joosten || 2007

General Considerations
•Does the item deal with trivial content?
•Is the answer discrimination too fine?
•Does the item stem includes unrelated information?
•Is there more than one correct answer?
•Is the item highly ambiguous?
•Is the question so obvious that the best answer appeared to be
the only plausible choice?
•Are some distractors ‘tip-offs’ because of the choice of words or
phrasing in the responses or stems?
•Are all of the distractors parallel?
•Are the responses of comparable plausibility?

Joosten || 2007

In Summary
The goal of item writing or editing is to create items that
will measure the skills and abilities of the candidates.
To do that the items must be clear, concise, accurate
and be of sound structure and of pertinent content.

Joosten || 2007

Review Item Statistics
P-value – percent of candidates who selected a response
Point Biserial Correlation – correlation between those
candidates who did well on the test and those
candidates who selected the correct response:
positive – correct answer
negative - distractors

Joosten || 2007

Exam and item development

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Último

Último (20)

Exam and item development

Notas del editor