The ideal correlation method does not give a final and definitive test of the linear observer model as a model of human performance. First of all, it only tests whether human performance is consistent with the classification image over the narrow range of stimuli encountered in the classification image experiment (i.e., two signals at a fixed contrast, perturbed by Gaussian noise). Outside this range, the linear model will often fail (e.g., the linear model predicts that psychometric functions are linear when plotted as
d′ versus signal contrast [see
Equation 10 in the
1], but it is well known that psychometric functions are often nonlinear [Legge, Kersten, & Burgess,
1987; Pelli,
1985]). Second, other tests may reveal failings of the linear model that are not detected by the ideal correlation method, even within the same narrow range of stimuli (e.g., early pointwise nonlinearities are not detected by the ideal correlation method, as discussed earlier, but they can be detected by histogram contrast analysis) (Chubb, Econopouly, & Landy,
1994; Murray et al.,
2001). Our goal in introducing the ideal correlation method is not to see whether there is a need for more complex models than the linear model as a general model of human vision, for there is ample evidence that more complex models are needed. Rather, our goal is to provide one test of whether classification images provide a reasonable phenomenological description of observers’ strategies, even over a limited range of stimuli.
In this regard, our simulations of observers with spatial uncertainty are instructive. The model observers were substantially different from the linear model, in ways that had large effects on their performance. Nevertheless, in some cases we were able to predict observers’ performance from their classification images. For instance, the model observers with zero and two pixels of spatial uncertainty had absolute efficiencies of 0.50 and 0.20, respectively — thus two pixels of spatial uncertainty had a large effect on performance — and yet even for the observer with two pixels of uncertainty, the predictions were accurate. Obviously, then, just because these predictions succeed, we cannot conclude that an observer is linear. Rather, if the predictions succeed, this only suggests that over the narrow range of stimuli encountered in the experiment, the observer’s strategy is described reasonably well by the linear model. For our model observers, spatial uncertainty is reflected in the classification image as a smearing of the underlying template. In general, smearing of a linear template is an inadequate model of spatial uncertainty, but our simulations suggest that over a small stimulus range, such smearing gives an adequate description of how different parts of the stimulus affect an observer’s responses. Again, this does not rule out the possibility that other tests could detect failings of the linear model, even over the same limited range of stimuli.
An analogy is the linear Taylor series approximation to an arbitrary function:
f(
x) =
f(
x0) +
f′(
x0)(
x −
x0). If this approximation is accurate over an interval surrounding
x0, we do not conclude that
f is a linear function. We conclude that although the full definition of
f may involve nonlinear terms, over the interval of interest, the behavior of
f is adequately characterized by a linear approximation, and we will not be greatly misled about the values of
f over that interval by examining the linear approximation. Just so, visual processes can often be approximated as being linear over a limited range (e.g., Ahumada,
1987), and this is the approximation that underlies use of classification images to characterize observers’ strategies.