Although our results are consistent with the coplanar ratio principle to the extent that it emphasizes the importance of coplanar relations, the grouping by illumination hypothesis provides a better theoretical framework for understanding the majority of our findings. For example, the effect of surroundedness (
Experiments 1 and
4), as well as the modest effect of coplanarity in the absence of adjacency (
Experiment 3), contradicts the adjacency requirement of the coplanar ratio principle.
In addition, while the coplanar ratio principle proposed by Gilchrist (
1980) treated planarity as an all-or-none variable, the grouping by illumination hypothesis suggests a graded view of planarity, which is more consistent with the results of a number of previous studies (Gogel & Mershon,
1969; Wishart, Frisby, & Buckley,
1997) as well as our own findings. In the coplanar only condition of
Experiment 3, the strength of coplanar ratios in determining target lightness is reduced, but not eliminated, when the target is not coplanar with its retinal background, but appears to float in front of it in a different plane. The graded interpretation of coplanarity is also consistent with the hypothesis proposed by Gilchrist and Radonjić (
2006) according to which surfaces that are parallel and facing the same direction are (perhaps weakly) grouped for lightness even if they are not immediately adjacent or coplanar. Indeed, the grouping by illumination hypothesis can be understood as a more general principle that integrates this
relaxed version of the coplanar ratio principle.
A group of surfaces treated by the visual system as sharing a common illumination level is roughly equivalent to the notion of a framework in the anchoring theory (Gilchrist et al.,
1999).
Across experiments, we obtained strong effects of perceived depth on lightness with either minimal (
Experiment 1) or no change in the retinal image (
Experiments 2 and
3). Thus, any lightness account that is strictly based on the processing of the retinal image, such as classic local contrast accounts that favor lateral inhibition (Cornsweet,
1970; Jameson & Hurvich,
1964) or recent, more sophisticated spatial filtering models, such as the ODOG model of Blakeslee and McCourt (
1999,
2003) or the model of Robinson, Hammon, and de Sa (
2007) fail to account for our data.
Our results are, in general, consistent, with a number of more recent studies showing the perceived three-dimensional arrangement plays a significant role in lightness perception (Bloj et al.,
2004; Bloj, Kersten, & Hurlbert,
1999; Boyaci, Doerschner, & Maloney,
2006; Boyaci, Maloney, & Hersh,
2003; Ikeda, Shinoda, & Mizokami,
1998; Knill & Kersten,
1991; Logvinenko & Menshikova,
1994; Pessoa, Mingolla, & Arend,
1996; Ripamonti et al.,
2004; Spehar, Gilchrist, & Arend,
1995; Taya, Ehrenstein, & Cavonius,
1995).
According to Boyaci, Doerschner, and Maloney (
2004; Boyaci et al.,
2003) and Bloj et al. (
2004; see also Ripamonti et al.,
2004), in order to judge surface lightness the visual system first creates an illumination model in a given scene in terms of intensity, chromaticity, and spatial variation, based on cues to the illumination available in the image (Boyaci et al.,
2006; Maloney,
2002) and then removes the illumination component from luminance, to derive surface lightness. This “discounting the illuminant” hypothesis, similar to the classic Helmholtz's (
1868/1924) idea of inferring the illumination, differs from the “grouping by illumination” hypothesis in its requirement that parameters of the light source, such as intensity and direction, must be explicitly estimated by the visual system, a requirement we regard as computationally expensive.
Ikeda et al. (
1998; see also Cunthasaksiri, Shinoda, & Ikeda,
2004) propose a similar account, according to which, based on information available in the retinal image, the visual system constructs three-dimensional regions of illumination (Recognized Visual Space of Illumination, RVSI) and then uses the properties of this space (intensity, chromaticity) to judge the lightness and color of surfaces within that space. Ikeda et al. thus propose an interesting idea that the visual system tends to compare surfaces within a volume of three-dimensional space, not within a plane. However, it seems to us that the RVSI hypothesis would have predicted a stronger effect of surroundedness in both coplanar and surrounded conditions of
Experiment 4, because one would assume that the target is a member of the same RVSI as the surrounding border.
We believe that, overall, the pattern of results we report can be best understood by the assumption that the visual system computes lightness by grouping together surfaces in the image that are equally illuminated, comparing the luminance values of surfaces within the group, and mapping these relative luminance values onto the gray scale using anchoring rules. However, in any case, our study contributes to an understanding of the conditions under which target lightness changes with a change in spatial relations and provides data that need to be accounted for by any comprehensive theoretical account of lightness computation in complex three-dimensional scenes.