First, we confirmed that there were significant individual differences between observers. This was achieved by comparing a pair of nested models using the generalized likelihood ratio test. The model with the fewest parameters had 55 free parameters: 54 of these set the early noise
σE for each combination of six observers, three display durations, and three memory conditions, and there was a further free parameter that set the same effective set size
M for each observer and condition. Against this model, we compared the fit of a 60-parameter model, which again had 54
σE parameters but now had six
M parameters, one for each observer. Because they are nested, the model with more parameters will always fit at least as well as the more-restricted model. To determine whether the fit is significantly better, we calculate the statistic,
D, given by
where
L2 is the likelihood of the best-fitting model with more parameters, and
L1 is the likelihood of the more restricted model. If the model with more parameters is no better (i.e., the null hypothesis is true), then
D is distributed approximately as the
χ2 distribution with degrees of freedom given by the difference between the numbers of parameters in the two models (Mood, Graybill, & Boes,
1974, pp. 440–441). Therefore, a
χ2 test indicates whether the less-restricted model is significantly better. For the comparison between the two models described previously, we have
,
p = 0.0001, indicating that the six observers were not all equally efficient. Graphically, this can be appreciated by the scatter of thin lines in
Figure 2b.