A general challenge of the visual system is that it receives information about the three-dimensional (3D) world around us, which is projected in two dimensions (2D) onto our retinae. Put differently, the visual system must identify 3D objects (distal stimulus) on the basis of 2D-light distribution on the retinae (proximal stimulus). The difficulty is that different distal stimuli can create the same proximal stimulus. The sensory stimulation alone is thus insufficient to establish the percept. Selection of a single percept from a variety of possible perceptual interpretations therefore requires the use of additional cues and constraints. This process of disambiguation can be best studied using bistable images. The different versions of bistable phenomena share the need for the stimulus to be disambiguated because two or more interpretations cannot be seen simultaneously. To help with this endeavor, further cues in the environment are recruited to determine which of two rivaling stimuli is more likely (e.g.,
Kim & Kim, 2019;
Parise & Ernst, 2017). These cues range from low-level effects, such as lateral interactions between neighboring stimuli, to higher-level top-down effects that increase the salience of one of the rivaling interpretations (
Tong, Meng, & Blake, 2006). These cues can even come from a different modality. Thus olfactory (
Zhou, Jiang, He, & Chen, 2010), auditory (
Einhäuser, Methfessel, & Bendixen, 2017), and tactile (
Lunghi & Alais, 2013) cues influence the visual predominance patterns toward a congruent interpretation. However, the majority of research in this area examines within-modality influences of context, specifically the effect of visual context on visual perception. Such visual context effects can occur at different levels of visual processing. Visual-spatial cues are particularly relevant cues for the observer because they show the observer how different elements in a visual scene are related to one another regarding position, distance, and orientation. They provide important information about the structure and organization of a given scene, which can influence our percepts and actions.
In the case of two rivaling targets, two opposing spatial context effects have been reported: a matching boost effect (the target that is more similar to the context is favored and will become dominant, e.g.,
Grossmann & Dobbins, 2003) and a nonmatching boost effect (the target that is more different from the context is selected, e.g.,
Carter, Campbell, Liu, & Wallis, 2004). In the following paragraphs, an overview of the findings will be presented. Specific research articles will be highlighted to underline differences in resulting predominance biases because of presentation style, ambiguity of context, and contrast; for a more detailed overview, please refer to
Table 1. Most notably, both matching and nonmatching boost effects could be found, each typically produced by a different type of paradigm.
When using side-by-side context configurations (e.g.,
Grossmann & Dobbins, 2003), matching boost effects dominate, whereas center-surround configurations (e.g.,
Carter et al., 2004) tend to favor nonmatching boost effects, although less consistently (e.g.,
Graf & Adams, 2008). The matching boost effect in studies using side-by-side configurations is quite robust, and it appears regardless of variations in the stimulus configuration and response mode (i.e., whether participants have to report coupling or the perceptual dominance of either target) (e.g.,
Alais & Blake, 1999;
Grossmann & Dobbins, 2003).
In center-surround configurations, spatial context is usually provided as an annulus that surrounds the central ambiguous target (
Paffen, Tadin, te Pas, Blake, & Verstraten, 2006). However, it can also consist of a surrounding grating that fills the entire screen (
Carter et al., 2004). In the case of high-contrast static targets and surrounds, a bias for the targets not matched to the surround can be found (e.g.,
Carter et al., 2004;
Fukuda & Blake, 1992;
Ichihara & Goryo, 1978;
Paffen, Alais, & Verstraten, 2005). However, there also seem to be some exceptions to this bias. For example, in the case of motion stimuli, the bias can be in the opposite direction (i.e., the motion that is most similar to the motion of the annulus is perceptually dominant) (
Baker & Graf, 2010). The preference for the nonmatching target in the case of center-surround configurations in high-contrast conditions has been suggested to be due to figure-ground segregation. This is suspected to be due to lateral inhibition (of the matching grating) (
Cavanaugh, Bair, & Anthony Movshon, 2002;
Polat & Sagi, 1993), which suggests that at some level of visual processing, neurons that are strongly stimulated by a high-intensity background of the rivalry target suppress the less strongly stimulated neurons responding to the inner grating, the rivalry target. Alternatively, the figure-ground segregation could also be explained by results from single-unit neurophysiological experiments (
Jones, Grieve, Wang, & Sillito, 2001; see
Alais, 2012) showing that the response of an optimally driven cell is reduced when it is surrounded by a matched grating (
Jones et al., 2001) whereas a surrounding of an orthogonal response can cause a supra-optimal response, leading to disinhibition or even facilitation (here: of the nonmatching grating) (
Sillito, Grieve, Jones, Cudeiro, & Davls, 1995). These explanations are not mutually exclusive and could work in unison.
But, as mentioned above, the nonmatching boost effect does not always prevail in center-surround configurations. Evidence suggests that it is modified by contrast. The effect is especially strong when the contrast of center and surround is high. However, when center and surround are presented at a low-contrast, the effect seems to be reduced or even reversed. Paffen and colleagues investigated center-surround interactions in orientation, motion, and color in a BR paradigm (
Paffen et al., 2006). The central rivalry targets and the surround were both either of high or low-contrast. When participants viewed the surround at high-contrast, target gratings matched to the surround were less dominant than a nonmatching rivalry target. However, when presenting the target and surround at low-contrast, this effect changed. How much it changed depended on the specific stimulus property. In the case of color, the nonmatching boost effect disappeared when low-contrast was used. In the case of orientation and motion, the effect was reversed: Reducing the contrast for these properties produced a matching boost effect (
Paffen et al., 2006). These results suggest that in situations with lowered discriminability of the rivalry target or the surround, the surround effect can be weakened and even turned around into a facilitative effect. This could be due to the fact that low discriminability requires evidence to be collected over a wider retinal region to determine the percept (
Sceniak, Ringach, Hawken, & Shapley, 1999). This facilitative effect could also occur when low discriminability is operationalized differently, such as through ambiguity (i.e., an ambiguous surround). In such cases, the unclear perceptual situation may necessitate the integration of information from both the center and the surround. Similar to low-contrast situations, a matching boost effect might thus prevail in case of an ambiguous surround. As seen by the empty rows in
Table 1, the effect of surround ambiguity has not been investigated for center-surround configurations, with one inconclusive exception (
Fukuda & Blake, 1992). Although their study used an ambiguous surround, the surround either completed none of the central stimuli or completed both central stimuli within the same condition. Thus it is impossible to draw conclusions concerning a nonmatching or matching boost effect from their ambiguous condition.
As discussed, spatial context can lead to either a matching boost bias or a nonmatching boost bias. Both of these effects can be due to one grating increasing in the average dominance duration, the other grating decreasing in dominance duration, or a combination of both. According to the revision of Levelt's second law (
Brascamp et al., 2015;
Levelt, 1965), this change results from an increase in the average dominance duration of the stronger stimulus (in our case, the more-facilitated or less-suppressed stimulus). In the case of a nonmatching boost bias, previous literature shows effects driven by an increase in the dominance duration of the nonmatching target or a combination of both an increase of the nonmatching and a decrease of the matching target (e.g.,
Carter et al., 2004;
Ooi & He, 2006;
Paffen, te Pas, Kanai, van der Smagt, & Verstraten, 2004;
Paffen, van der Smagt, Pas, & Verstraten, 2005).
Although contextual influence can be based on different processes, it can also occur at different points of the visual processing stream. One way to investigate whether processing at lower or higher levels affected the predominance of a target is to observe differences between intraocular (relevant surround and center are in the same eye) or interocular (different eye) conditions. A stronger effect in the intraocular condition would implicate lower-level effects, whereas the presence of an interocular effect can be taken as an indicator of involvement of higher-level processes. Evidence for these effects comes from a binocular rivalry experiment, in which Paffen and colleagues found an increase in suppression depth (
Paffen, Alais, et al., 2005): The suppression depth was the weakest in the interocular condition. It then grew stronger during the intraocular condition and was the strongest during the binocular (surround in both eyes) condition (
Paffen, Alais, et al., 2005). Generally, the literature reporting ocularity effects agrees on a stronger intraocular and binocular center-surround interaction compared to interocular interactions (
Carter et al., 2004;
Dieter, Melnick, & Tadin, 2015;
Grossmann & Dobbins, 2003;
Sobel & Blake, 2002).