In contrast, the IC theory assumes that the noise in the estimation process is negligible. In other words, perceived depth is approximately the same across repeated viewings of the same stimulus under the same viewing conditions. However, noise in the response distributions of a task (task noise, which is often considered negligible by MLE models of cue combination) may arise due to factors such as response execution and memory requirements. Importantly, this noise is independent of distal stimulus properties such as texture quality or viewing distance. This leads to a different interpretation of the JND: Given a particular cue strength, the JND is the change in distal stimulus magnitude needed to produce a perceptual difference that is large enough to overcome the effects of task noise which we represent as σ
N. As shown in the hypothetical experiment of
Figure 6b, the JND is larger at the far viewing distance because the cue strength becomes weaker (consistent with the fact that binocular disparities and their gradients decrease with viewing distance). We see that the JND is inversely proportional to the cue strength (
\(J_{i}\,=\,\frac{\sigma_{N}}{k_{i}}\)). Recall that the vector sum model posits that adding cues to a stimulus increases the combined-cue strength according to the magnitude of the vector of cue signals. Because the JND is inversely proportional to cue strength, the vector sum model therefore predicts that the JND shrinks with additional cues, similar to the MLE model. Specifically, the texture-only, disparity-only, and combined-cue JNDs are given by
\({J_t} = \frac{{{\sigma _N}}}{{{k_t}}}\) ,
\({J_d} = \frac{{{\sigma _N}}}{{{k_d}}}\), and
\({J_c} = \frac{{{\sigma _N}}}{{{k_c}}} = \frac{{{\sigma _N}}}{{\sqrt {{k_t}^2 + {k_d}^2} }}\), respectively.
Appendix 3 shows how, from these equations, we can predict the combined-cue JND directly from the single-cue JNDs as follows:
\({J_c} = \frac{1}{{\sqrt {\frac{1}{{J_t^2}} + \frac{1}{{J_d^2}}} }}\). Notice that this equation is formally identical to
Equation 2 of the MLE model, where JNDs are assumed to measure the estimation noise (i.e.,
Ji = σ
i). However, the vector sum model predicts that this relationship will hold at the same
perceived depth, where task-related task noise is expected to be equivalent as the decision process operates on perceived depth, whereas the MLE model predicts it will hold at the same
simulated depth, where estimation noise is expected to be equivalent. Thus, the prediction of the two models for a given dataset may slightly differ, as we will show.