To estimate disparity, the visual system must determine which parts of the two retinal images correspond. Doing this by cross-correlating the two eyes' images has been used successfully in computer vision (Clerc & Mallat,
2002; Kanade & Okutomi,
1994), in modeling human vision (Banks, Gepshtein, & Landy,
2004; Cormack, Stevenson, & Schor,
1991; Fleet, Wagner, & Heeger,
1996; Harris, McKee, & Smallman,
1997), and in modeling binocular interaction in visual cortex. The prevailing model of binocular integration in visual cortex is the disparity-energy model (Cumming & DeAngelis,
2001; Ohzawa,
1998; Ohzawa, DeAngelis, & Freeman,
1990). In this model, the output of binocular complex cells can be expressed as
where
SL,R are the responses of simple cells in the left and right eyes and even and odd refer to the symmetry of the simple-cell receptive fields (Prince & Eagle,
2000). In the last two terms, the left eye's response is multiplied by the right eye's response. A bank of such cells, each tuned for a different disparity, making this computation performs the equivalent of windowed or local cross-correlation.