ICT role in 21st century education and it's challenges.
Creating an in-house computerized adaptive testing (CAT) program with Concerto
1. Creating an in-house
computerized adaptive testing
(CAT) program with Concerto
Atsushi, MIZUMOTO
(Kansai University)
2013/09/20
JLTA at Waseda University
11. Advantages of CAT
•Tailored for individual test-takers
•Shorter test time
•More precision (= SE smaller)
•No need for random sampling
www.geocities.jp/kosugitti/labo/irtnote.pdf
13. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
14. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
23. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
24. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
25. Constructing an Item Bank
(Pretest)
•Vocabulary Test (Mizumoto, 2006)
http://www.mizumot.com/files/VocSizeMeasure.pdf
•Based on SVL 12,000
(Up to 8,000 level; 30 items for each level)
•716 university EFL learners
27. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
28. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
29. Calibrating the Item Bank
•240 items analyzed (Rasch model)
•150 items left for the item bank
•Calibrated with two parameter
logistic model (item difficulty & discrimination)
•Update the csv file to Concerto
30. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
31. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
32. Specifications of CAT
•Starting point
(parameters, initial ability, randmized/fixed)
•Ability estimation method
(empirical Bayes and others)
•Stopping rule
(Number of items/Standard error)
•Final ability estimation
40. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
41. Creating a CAT Program
•Choosing the CAT System
•Constructing an Item Bank (Pretest)
•Calibrating the Item Bank
•Determine Specifications & Feedback
•Administering the CAT
42. 268 test takers
(university first year)
(1) CAT
(2) Paper-pencil version
(68 items) common person linking
(3) Questionnaire
“What did you think of
the CAT result?”
45. -1 0 1 2 3
01234
Pape
n = 268
CAT
(30Qs)
M = 1.71
SD = 1.13
P-P
(68Qs)
M = 1.72
SD = 0.95
46. -1 0 1 2 3
01234
Pape
n = 268
CAT
(30Qs)
M = 1.71
SD = 1.13
P-P
(68Qs)
M = 1.72
SD = 0.95
Mean diff. = -0.02
95% CI [-0.07, 0.04]
d = 0.01
Power = .06
47. -1 0 1 2 3
01234
Pape
n = 268
CAT SE
(30Qs)
M = 0.39
SD = 0.11
P-P SE
(68Qs)
M = 1.71
SD = 1.13
48. -1 0 1 2 3
01234
Pape
n = 268
CAT SE
(30Qs)
M = 0.39
SD = 0.11
P-P SE
(68Qs)
M = 1.71
SD = 1.13
Mean diff. of SE
= -1.32
95% CI [-1.44, -1.19]
d = 1.65
Power = 0.99
49. Evaluation
CAT vs. Paper-pencil
Means: CAT = Paper-pencil
SEs: CAT < Paper-pencil
CAT measures the same ability
with much more precision
(with fewer items).
51. Result of the Questionnaire
Frequency
Response
150 100 50 0 50 100 150
Very inaccurate Inaccurate Rather Inaccurate Rather accurate Accurate Very accurate