There has been some debate over the exact nature of the primitives of binocular matching. The outputs of linear filters, such as luminance-defined edges or “zero crossings” (Marr & Poggio,
1979), or local luminance extrema (e.g., Mayhew & Frisbee,
1981) are often the primary candidates for matching in models of stereoscopic vision. However, a variety of other features, such as monocular edges defined by motion (Lee,
1970), flicker (Prazdny,
1984), color (Ramachandran, Rao, & Vidyasagar,
1973), or texture (Ramachandran et al.,
1973), or, more recently, second-order features defined by contrast (e.g., Wilcox,
1999; Wilcox & Hess,
1996; Ziegler & Hess,
1999) have been shown to produce a percept of depth, although this percept is often only directional rather than quantitative in nature (Ziegler & Hess,
1999). There has even been an informal report of a percept of motion in depth from a change in disparity and associated IOVD in such images (Prazdny,
1984), although data have never been formally presented. Depth and motion in depth percepts from these second-order stereo stimuli still involve the matching of visible features in the two monocular images, despite the fact that these may not be defined by luminance variations. We refer to depth perception involving the matching of any such primitives (whether they are first or second order) as conventional stereopsis.