The human visual system receives input from two forward-facing eyes that deliver significant overlap. A possible reason for this arrangement is to allow more of our cluttered world to be visible (Changizi & Shimojo,
2008). While this might have increased survival chances for our ancestors, this is still relevant today. When picking berries, there is a distinct benefit (less time spent near mosquitos) in detecting ripe berries quickly. Most of the time the berries are not in plain view but are partially “hidden” behind leaves and parts of the shrubs. As we will describe below, more of the berries are potentially visible if information from both eyes is used.
Under some circumstances, we know that the visual system is able to fill in information about a region that is not accessible, by interpolating information about the surface around that region (Durgin, Tripathy, & Levi,
1995). This is used to make an educated guess about what the inaccessible region is most likely to contain. Even though we have no visual input as to what these occluded regions contain, we perceive a coherent, complete object. This process is called amodal completion (Michotte, Thinès, Costall, & Butterworth,
1991; for a review see Sekuler & Murray,
2001).
Amodal completion of objects can slow performance during visual search tasks (e.g., He & Nakayama,
1992) and stereoacuity discrimination (e.g., Hou, Lu, Zhou, & Liu,
2006). It can improve texture segmentation (e.g., He & Nakayama,
1994) and pattern discrimination (e.g., Gold, Murray, Bennett, & Sekuler,
2000), and amodally completed objects can cause adaption (e.g., Fang & He,
2005). But, more importantly for the question of how information from monocular regions might be perceived, amodal completion highlights that there can be a marked dissociation between what is present in the retinal images and the percept we form based on this visual input.
Despite a wealth of literature on the effects of occlusion, almost all have considered it to be a 2D problem. In natural occlusion situations, however, occluded and occluding objects are often at different depths with respect to the observer. This results in differential occlusion between right and left eye (also referred to as da Vinci stereopsis, see e.g., Nakayama & Shimojo,
1990; or Harris & Wilcox,
2009 for a review). When one object occludes another, the foreground object lies closer than the background such that different parts of the background are occluded to the left and right eyes, respectively. This can be demonstrated by holding up a hand in front of the eyes and closing one eye. The viewing eye sees predominantly the hand, a region of the background is partly hidden, or occluded, by the hand. Through the other eye, one can see the hand, but also a different region of the background. This geometric configuration is illustrated in
Figure 1a, which shows a pair of eyes, a foreground occluding object, and a background (viewed from above). The blue dotted line on the figure shows what regions of the background are visible to only the right eye: note the monocular zone to the right of the occluder that is only visible to this eye. The red line shows what parts of the background are only visible to the left eye. Most studies of amodal completion have used stimuli where the two eyes viewed identical scenes. As
Figure 1a shows, this is not common under natural binocular viewing conditions. A very few studies have explored conditions like these. When the two eyes views are slightly different, consistent with a natural scene, amodal completion appears to be faster than when the two eyes views are identical (Bruno, Bertamini, & Domini,
1997). Further, it has been suggested that the perceived depth of background regions may be driven by amodal completion of monocular regions (Grove, Sachtler, & Gillam,
2006).
The existence of specifically monocularly occluded regions, at object boundaries, has been known about since Leonardo da Vinci (see Howard,
2012). More recently, it has been suggested that monocular regions themselves might provide a source of information about the depth between objects. This was coined “da Vinci stereopsis” by
Nakayama & Shimojo (1990), who studied depth perception from monocular regions in detail (see also for example Gillam & Borsting,
1988; von Szily, 1921 [trans. by Ehrenstein & Gillam,
1998]; and for a recent review Harris & Wilcox,
2009). How the visual information contained within each monocular zone is represented has been much less explored.
Our aim here was to explicitly test how the visual system represents information in regions only visible to one eye. To do so we chose two experiments studying two types of occlusion geometries—one in which both monocular and binocular information was visible in a stimulus and the more extreme case in which only monocular information was visible.
Such an extreme occlusion situation occurs when a regular series of foreground objects (like a picket fence) causes all the visible parts of a background object to occur in monocular regions (
Figure 1b). In this case the two eyes' views of the background scene are completely different. Notice how alternate regions of background are visible to just the left eye (red lines), or to just the right eye (blue lines). This viewing situation is akin to the more general phenomenon of binocular rivalry (e.g., Alais, O'Shea, Mesana-Alais, & Wilson,
2000; Blake, Lee, & Heeger,
2009; Blake & Logothetis,
2002). Rivalry occurs if the two eyes view totally different items, the resulting rivalrous percept switches between two different percepts. Yet for monocular regions like those in
Figures 1a and
1b, rivalry typically does not occur (Forte, Peirce, & Lennie,
2002; Marlow,
2012). One study has demonstrated this by measuring contrast sensitivity when the two eyes view image patches containing different visual information. If the regions are surrounded by a binocularly visible contour, contrast thresholds within the patch are similar for each eye, suggesting that there is no suppression of one eye's view (Su, He, & Ooi,
2009). But, in that study, suppression did occur when the visible contour was removed. The question of how visual information is combined, or completed, across such regions has not been previously addressed.
We asked observers to decide whether a scene contained more elements than a comparison scene, and will refer to this as density/numerosity discrimination. There is a debate over whether this kind of task depends critically on the representation of density or of number (e.g., Burr & Ross,
2008; Durgin,
1995; Durgin,
2008; Sophian & Chu,
2008) or whether they are different at all (Dakin et al.,
2011). We are agnostic over this issue; here, we simply chose the task for two reasons. First, because it requires integration of the available visual information across the whole scene. The second reason for choosing this task was that it has been suggested that density discrimination is represented at a level of processing where information has already been integrated between the two eyes (e.g., Durgin,
2001) while neuroimaging studies suggest that amodally completed objects may be represented very early (as early as V1) during visual processing (e.g., Rauschenberger, Liu, Slotnick, & Yantis,
2006).
In
Experiment 1 we asked whether information from spatially distinct monocular regions was integrated across the two eyes, when there was no binocular information available. For this, we studied two extreme but natural occlusion situations, one containing monocular regions (caused by vertical occluders, as in
Figure 1b), the other with binocular occlusions of parts of the background (caused by horizontal occluders).
We compared density/numerosity discrimination for three types of scene geometries: (a) a fully binocular scene where observers viewed a pattern of dots on a neutral gray background (
Figure 2a); (b) a vertically occluded scene (as used by Forte et al.,
2002;
Figure 1b) in which each slice of background was only visible to one eye, but if the two eyes' views were combined all of the background was visible behind a foreground vertical slatted fence (
Figure 2b); and (c) a horizontally occluded scene, where observers viewed a pattern of dots behind a foreground horizontally slatted fence (
Figure 2c). Here, the fence and dotted background were both fully binocular, but half the dots were not visible to either eye as they were occluded by the fence in both eye's views.
In
Experiment 2 we asked whether information in monocular regions is integrated with information in adjacent binocular regions to form a global density/numerosity percept. If the occluding fence in
Figure 1b is moved closer to the background plane, then both monocular and binocular regions of the background become visible (
Figure 1c). We studied density/numerosity discrimination for such scenes, varying the proportion of dots that was visible in monocular and binocular regions.