Skip to main content

Table 6 Mean agreement rate inter-judge agreement rates for each of the six quality etrics and the overall cognitive domain judgment item by modality

From: Can automated item generation be used to develop high quality MCQs that assess application of knowledge?

Item

Mean (SD)

 

AIG

Traditional

1a

0.78 (0.17)

0.68 (0.19)

2b

0.86 (0.10)

0.73 (0.09)

3c

0.72 (0.18)

0.71 (0.07)

4d

0.79 (0.05)

0.71 (0.08)

5e

0.42 (0.18)

0.49 (0.11)

6f

0.27 (0.16)

0.27 (0.10)

Overall cognitive domain judgment

0.92 (0.07)

0.88 (0.04)

  1. Overall cognitive domain judgment is the item tests factual knowledge only/the item tests application of knowledge.
  2. aThe central idea is in the stem (i.e., stem is required to answer the item)
  3. bThe directions in the stem are very clear
  4. cThere are no obvious cues or item flaws (grammatical cues, conspicuous right answer, etc.)
  5. dThe length of the choices is about equal
  6. eAll distractors are plausible
  7. fThis is a high-quality item