2. Sampling-Writing items-Moderating items
• Choices have to be made: Avoid easy to test elements and
focus on the elements that are particular important
• Write to be understood (Imagine you are taking the test) and
rewrite if necessary and offer a key example. (Do not strive for
perfection)
• Colleagues can help identify weakness and remedy them.
3. Informal trailling of items on native speakers
• Moderated items should be tested by native speakers of
the same conditions and characteristics as the ones
taking the test
• Items that seem too difficult for them should be revised
or replaced
4. Trailling of the test on a group of non-native speakers
similar to those for whom the test is intended
• After the native speaker trial administer the test to non-native
speakers similar to those who are taking the test.
• Of course this is difficult to do, so…
5. Analysis of results of the trial
• Two kinds:
• Statistical: Reliability of the test as a whole and individual
elements
• Qualitative: Detect misinterpretation, unanticipated but
possibly correct responses, and other faulty items.
6. Calibration of scales
• Every rating used for oral or writing scale should be calibrated. This
means collecting samples of performances which cover the full range
of the scales.
• A team of experts then analyze the samples and assign appoint on the
relevant scale.
• The assigned samples provide reference points for all future uses of
the scales.
• Final version of the test can be validated
7. Writing handbooks for test takers, test users
and staff
• They expect to have:
• The rationale for the test
• An account of how the test was developed and validated
• A description of the test (Which may include a version of the specifications)
• Sample items (Or a complete sample test)
• Advice on preparing for taking the test
• An explanation of how scores are to be interpreted
• Training materials (For interviewers, raters, etc.)
• Details of testing administration.
• Everyone who uses the handbook should be trained to use it