The weights (
wi) are proportional to the normalized inverse variances (
) of the cue distributions (
), so greater weight is assigned to less variable (i.e., more reliable) cues (Backus & Banks,
1999; Ernst & Banks,
2002; Ghahramani, Wolpert, & Jordan,
1997; Jacobs,
1999; Oruç, Maloney, & Landy,
2003). The variance of the combined estimate is lower than the variance of any single-cue estimate, so by combining information from several depth cues, the visual system can in principle estimate slant (or any other 3-D property) with greater precision than it can by relying on one cue alone. There are now many empirical studies showing that cue reliability is taken into account when combining sensory signals (e.g., Backus & Banks,
1999; Buckley & Frisby,
1993; Jacobs,
1999; Körding & Wolpert,
2004; van Beers, Sittig, & Denier van der Gon,
1998; van Beers, Wolpert, & Haggard,
2002). Furthermore, several studies have tested the quantitative predictions of this model by measuring the reliability of the underlying estimators when only one cue is informative and using these to predict performance when multiple cues are available (Alais & Burr,
2004; Ernst & Banks,
2002; Gepshtein & Banks,
2003; Hillis, Watt, Landy, & Banks,
2004; Knill & Saunders,
2003; Landy & Kojima,
2001). These studies show that performance is often close to that predicted by the statistically optimal model (in the sense of being the minimum variance unbiased estimate; Ghahramani et al.,
1997).