The motion aperture problem refers to the observation that the direction of a translating contour within an aperture is ambiguous; the motion is consistent with an infinite number of motion directions (Wallach,
1935; Marr & Ullman,
1981; Adelson & Movshon,
1982). This is an intriguing problem for investigation because motion detection units in early visual cortical areas are often characterized as having relatively small receptive fields corresponding to a limited region of visual space: an aperture on the world (Hubel & Wiesel,
1962). Because physiology indicates that the aperture problem is a fundamental one to solve, the question of how the brain resolves locally ambiguous motion information is of some importance. This ambiguous motion can, in theory, be correctly interpreted by integrating motion estimates from several local detectors. Many models of visual motion detection, therefore, consist of two stages: one where local information is analyzed and a second stage where local detectors are combined (Adelson & Movshon,
1982; Welch,
1989; Weiss, Simoncelli, & Adelson, 1998).
In this two-stage framework, the “barber pole” display of Wallach (
1935; translated by Wuerger, Shapley, & Rubin,
1996) provides evidence that the shape of the aperture could influence the perceived direction of motion. Specifically, elongated apertures bias motion perception along the direction of the longest side. These results have been explained as the propagation of motion signals generated by the grating line terminators along the aperture border. These border terminators provide unambiguous motion information and disambiguate the local motion signals of the inner regions of the stimulus after integration across local apertures. Shimojo, Silverman, and Nakayama (
1989) described two possible ways of classifying aperture borders in a real world situation, as either being intrinsic (belonging to the grating) or extrinsic (resulting from occlusion). By adding stereoscopic disparity information to the borders, Shimojo et al. were able to bias the perception of motion in the barber pole stimulus. When disparity was added to the border of the aperture so that the grating appeared in front of the aperture and thus all borders were intrinsic, the barber pole effect followed the aperture shape observations of Wallach (
1935). When disparity was added to the border of the aperture so that the grating appeared behind the aperture and thus all the borders were extrinsic, the influence of the aperture shape was eliminated. In this configuration, the grating can be interpreted as being completed at a depth behind the occluder, following a process similar to amodal completion (Kanizsa,
1979). These results imply that real-world occlusion conditions can influence the perception of motion direction.
Duncan, Albright, and Stoner (
2000) carried out an additional test of the intrinsic and extrinsic border classification, pitting intrinsic borders against extrinsic borders in a stimulus configuration that they termed the “barber-diamond” stimulus. In the barber-diamond stimulus, two borders are oriented at 45° and two at −45° to the grating orientation. Disparity was used to designate two alternate borders as intrinsic (behind the grating) or extrinsic (in front). The bordering panels were composed of random dot textures. With this stimulus it was found that motion perception was consistently seen in the direction parallel to the intrinsic border, following the intrinsic-extrinsic predictions. Electrophysiological recordings in monkey area MT conformed with the human psychophysical data, finding cells that responded maximally to depth-motion conditions consistent with the perceived direction of surface motion under an occluder.
Several studies have indicated that the results of Shimojo et al. (
1989) and Duncan et al. (
2000) might be explained as a result of half-occlusions, or unpaired monocular regions in the stimulus, introduced by the local displacement of the interocular positions of contour terminators (Anderson,
1999; Castet & Wuerger,
1997; Castet, Charton, & Dufour,
1999). Monocular occlusion cues in stimuli that have no disparity have also been shown to influence the perception in an aperture stimulus (Liden & Mingolla,
1998).
The present study seeks to further investigate the effect of intrinsic and extrinsic border classification by using a different method to provide depth information to the observer. We introduced monocular structure-from-motion information to designate the aperture borders as being intrinsic or extrinsic prior to grating motion. The use of prior structure-from-motion information is interesting for a couple of reasons: it eliminates both monocular and binocular cues to depth ordering during the test phase of the stimulus, and it allows testing of the contribution of prior information to a relatively simple motion stimulus. In a second experiment, we tested the strength of this prior information using the barber pole stimulus. In a third experiment, we explored the time course of the influence of the prior information.