One of the primary problems faced by the visual system is the underdetermination of world properties by sensory data (Knill & Richards,
1996; Shams & Beierholm,
2010). A key example of this problem is in motion perception. Ambiguity in motion perception arises for a number of reasons; however, one of the main causes is what has become known as the “aperture problem” (Adelson & Movshon,
1982; Fennema & Thompson,
1979; Marr & Ullman,
1981). The aperture problem refers to the fact that a one-dimensional (1D) image contour viewed through a finite aperture provides no time-varying information parallel to the orientation of the contour. This means that the information available through the aperture is completely ambiguous as to the underlying two-dimensional (2D) motion. In fact, it is equally consistent with an infinite family of 2D motions. Because neurons in early cortical areas, such as V1, have small receptive field sizes compared to image contours, they are subject to the aperture problem and, as a consequence, can predominantly signal only 1D motion orthogonal to image contours spanning their receptive fields (for a comprehensive review, see Pack & Born,
2008).
One way in which this local ambiguity could be overcome is by pooling the response of these cells over orientation and space. This is because with two or more 1D motion signals of different orientations, the underlying 2D motion can be uniquely determined (Adelson & Movshon,
1982). Research has therefore focused on identifying motion-sensitive neurons that could integrate the information available earlier in the motion pathway in order to recover 2D global motion. One area that has been the focus of much research is extrastriate area MT. Cells in MT have large receptive fields that receive feed-forward input from early cortical areas such as V1 (Movshon & Newsome,
1996) and are known to play a significant role in pooling local motion information (Born & Bradley,
2005).
MT has therefore been seen as a candidate site for helping resolve the ambiguity characterizing early motion signals (Movshon, Adelson, Gizzi, & Newsome,
1985; Pack & Born,
2001). In terms of output, MT projects to areas such as MST, which contains cells with large receptive fields that preferentially respond to higher order properties of motion such as patterns of optic flow (Duffy & Wurtz,
1991a,
1991b; Tanaka & Saito,
1989). Overall, this suggests a hierarchical pooling of information through the motion pathway, which works toward resolving the local ambiguity characterizing the response of early cortical areas (Born & Bradley,
2005; Pack & Born,
2008). Note that although one might expect that motion pooling would primarily involve areas V1, MT, and MST, our psychophysical experiments address a more general
processing architecture in which early ambiguous motion signals are disambiguated through subsequent stages of processing.
While the integration of motion signals is clearly important for the estimation of 2D object motion, the visual system faces the competing need to retain access to spatially precise local information in order to represent motion boundaries and features (Braddick,
1993). This type of information is key to processes such as object segregation and identification (Stoner & Albright,
1996) and is also, in a complementary way, exactly the type of information the visual system needs in order to decide which signals to integrate (Curran, Hibbard, & Johnston,
2007). This points to a tight coupling between integration and segregation in visual motion processing; however, currently there is no clear consensus as to how these competing needs are balanced or the neural architecture that might allow these processes to be instantiated in the brain (Hedges, Stocker, & Simoncelli,
2011; Pack & Born,
2008).
In the present paper, we used global motion Gabor arrays (Amano, Edwards, Badcock, & Nishida,
2009) and the motion aftereffect (Mather, Pavan, Campana, & Casco,
2008; Mather, Verstraten, & Anstis,
1998; Wade,
1994) to help elucidate the interplay between local and global motion processing. Global motion Gabor arrays provide an ideal stimulus because locally each individual Gabor is consistent with an infinite family of possible motions, but when the Gabors in an array are assigned orientations and drift speeds consistent with a single 2D motion solution, the whole array perceptually coheres into a single rigidly moving object/surface (see Amano et al.,
2009 for further details). We show that after adaptation to an array such as this, the aftereffect produced by ambiguous local Gabors in the array is shifted toward the global motion direction during adaptation.
This suggests that the visual system is able to use the statistical regularities characterizing 2D global motion to help infer the cause of the ambiguous local signals of which the global motion is composed. This highlights the complex interplay that exists between local and global motion processing and points to a neural architecture in which local directionally coded cells project to, and receive feedback from, cells with large receptive fields that integrate local signals in order to respond to global 2D motion (Sillito, Cudeiro, & Jones,
2006). This type of architecture offers a way in which the visual system might manage the competing requirements to integrate information to reduce ambiguity, but at the same time maintain access to spatially precise local information for tasks such as object segregation.