To segment a figure from its background, the boundaries need to be detected and then the surface needs to be segregated (Appelbaum, Wade, Vildavski, Pettet, & Norcia,
2006; Scholte, Jolij, Fahrenfort, & Lamme,
2008; Machilsen & Wagemans,
2011; Layton, Mingolla, & Yazdanbakhsh,
2014). The integration of these two steps is based on the formation of border ownership (Komatsu & Ideura,
1993; von der Heydt,
2015) that determines which surface is integrated with the contour to form a figure. Physiological studies found that V1 and V2 are responsible for boundary detection (Rossi, Desimone, & Ungerleider,
2001; Marcus & Van Essen,
2002), whereas neurons in V2 and V4 areas selectively respond to border ownership (H. Zhou, Friedman, & von der Heydt,
2000), and such selectivity arises via feedback projection from higher visual cortex (Jehee, Lamme, & Roelfsema,
2007; Layton et al.,
2014). On the other hand, an electroencephalogram (Scholte et al.,
2008) and a transcranial magnetic stimulation study (Heinen, Jolij, & Lamme,
2005) found that neural correlates of surface segregation first appear in temporal areas and back propagate to occipital areas. These studies suggested that local and global information of figure-ground segregation proceed along the visual ventral pathway. Boundary detection can be generated from different cues, such as color contrast, texture orientation, and, in this case, binocular disparity, which can also be detected in the V1 and V2 areas (Prince, Cumming, & Parker,
2002; Prince, Pointon, Cumming, & Parker,
2002; Parker,
2007). In the current study, it may be argued that to identify an object, one needs to detect the local boundary that is defined by the binocular disparity. In other words, the global structure is apparent only after the local feature has been extracted. This seems to contradict the proposal that global information proceeds from local information in visual perception. However, we argue that there is a distinction between psychological perception and the underlying physiological process. Thus, even though boundary detection is believed to be the first stage in the physiological process, it may not be sufficient to generate the psychological perception of the object, or even the boundary itself. To identify an object, the complete process of the figure-ground segregation is required, including border ownership assignment and surface segregation. With regard to the random-dot stereograms that were used in the current study, the visual system is required to pair all the dots in one eye with the corresponding dots in another eye, a process known as the stereo-correspondence problem (Julesz,
1964). Thus, even when the local boundary is detected within areas V1 and V2, the stereo-correspondence problem remains unsolved (Verhoef, Vogels, & Janssen,
2016) and can be solved only in the temporal visual cortex (Janssen, Vogels, Liu, & Orban,
2003), so that the successful perception of the 3D structure can be formed only after all the corresponding dots are matched.