The process by which retinal images become the visual percepts we “see” is still a mysterious one. It is nevertheless clear that theory can be usefully applied to predict perceptual experience (or appearance or qualia) when a person opens her eyes and looks at a particular stimulus. Recent work describes appearance as resulting from the construction of representations of the environment by processes that are near-optimal in their ability to use measured visual signals (e.g., Backus & Banks,
1999; Brainard & Freeman,
1997; Feldman,
2001; Feldman & Tremoulet,
2006; Geisler & Kersten,
2002; Geisler, Perry, Super, & Gallogly,
2001; Hillis, Watt, Landy, & Banks,
2004; Hogervorst & Eagle,
1998; Kersten, Mamassian, & Yuille,
2004; Knill & Richards,
1996; Weiss, Simoncelli, & Adelson,
2002). This approach has its origin in older ideas about the probabilistic nature of visual information (Brunswik,
1956; Fechner,
1860; Helmholtz,
1910/1925; Hochberg & Krantz,
2004). In these theories, estimated scene properties are manifested consciously within visual percepts as perceptual
attributes such as perceived velocities, surface slants, surface colors, contour groupings, object identities, and so on. The fusion of sensory data to estimate scene parameters has received particular attention, because theory makes testable predictions about how various cues (each of which is a statistic computed on the stimulus) are combined with each other and with prior belief to construct a perceptual attribute that represents a scene property.
Cue recruitment experiments have recently demonstrated that the visual system is capable of learning to utilize new cues during the construction of appearance (Haijiang, Saunders, Stone, & Backus,
2006). It is not clear, however, how to measure the strength of these effects. A theory to quantify the learning would make it possible to use bistable stimuli in quantitative tests of hypotheses about cue recruitment and cue combination. The purpose of this paper is to describe such a theory, called the Mixture of Bernoulli Experts (MBE). This theory justifies the use of probit analysis to quantify cue effectiveness during dichotomous perceptual decisions.