Over the last few decades, a large number of psychophysical studies have been published on the topic of achromatic color perception, including brightness, lightness, shading, transparency, specularity, and related subtopics. Despite the major advances that have resulted from this work, the neurophysiological mechanisms underlying achromatic color perception remain largely unknown. Various competing neural accounts coexist in the current literature, including theories based on multiscale spatial filters (Blakeslee & McCourt,
1997,
1999,
2001,
2003,
2004), edge integration (Land,
1977,
1983,
1986a,
1986b; Land & McCann,
1971; Reid & Shapley,
1988; Rudd,
2001,
2003a,
2003b; Rudd & Arrington,
2001; Rudd & Zemach,
2002a,
2002b,
2003,
2004; Shapley & Reid,
1985; Zemach & Rudd,
2002,
2003), achromatic color filling-in (Arrington,
1996; Cohen & Grossberg,
1984; Grossberg & Mingolla,
1985; Grossberg & Todorovic,
1988; Grossberg, Mingolla, & Tedorovic,
1989; Neumann, Pessoa, & Hansen,
2001; Ross & Pessoa,
2000; Rudd,
2001; Rudd & Arrington,
2001), and midlevel cues to lightness, such as junctions (Adelson,
1993; Anderson,
1997; Logvinenko,
2002; Todorovic,
1997; Zaidi, Spehar, & Shy,
1997).
One early and influential neurally inspired theory of achromatic color computation that continues to guide research in the field is the Retinex theory of Land and McCann (
1971; Land,
1977,
1983,
1986a,
1986b). Retinex theory was devised to compute the colors—or lightness in the achromatic domain—of regions within Mondrian patterns: images made up of patches having homogeneous luminance and wavelength composition separated by sharp borders.
According to the Retinex theory, lightness computation occurs in a series of stages involving the following: (1) edge extraction, to compute local luminance ratios at luminance borders; (2) edge integration, to combine luminance ratios across space to establish a scale of relative lightness values for the regions lying between borders; and (3) anchoring, to relate the relative lightness scale computed in Stage 2 to the scale of real-world reflectances.
The results of a number of recent psychophysical experiments (Schirillo & Shevell,
1996; Bruno, Bernardis, & Schirillo,
1997; Li & Gilchrist,
1999) have been interpreted as supporting a particular lightness anchoring rule known as the highest luminance rule. The highest luminance rule states that the highest luminance in the scene appears white and the achromatic colors of all of the other regions within the scene are determined relative to the white point.
The highest luminance rule originated with the work of Wallach (
1948,
1963,
1976) and was later adopted in the lightness computation algorithms of Land, McCann, and Horn (Land & McCann,
1971; McCann,
1987,
1994; Horn,
1977). In the current literature, the highest luminance rule is probably most closely identified with Gilchrist's influential Anchoring theory of lightness perception (Gilchrist et al.,
1999; Li & Gilchrist,
1999).
Anchoring theory, like Retinex, assumes the existence of separate processes that function to establish the scale of relative lightness and to map the scale of relative lightness onto the scale of perceived lightness. Gilchrist refers to these two processes as scaling and anchoring. In Retinex theory, the edge integration serves to establish the ratio scale of relative lightness. Anchoring theory does not commit to a specific mechanism for computing the ratio scale. The main focus of Anchoring theory is on the anchoring rules that map relative lightness onto absolute lightness values.
Anchoring theory extends the classical theories of lightness anchoring proposed by Wallach, Land and McCann, and others, by positing the existence of multiple anchoring frameworks. The multiple frameworks hypothesis asserts that the lightness standard applied to any given region in the image is determined by a compromise between local and global applications of the highest luminance rule. This principle was originally introduced by Kardos (
1934), who referred to it as “co-determination” (see also Gilchrist et al.,
1999).
Anchoring theory also postulates a new anchoring principle,
relative area, which states that larger areas appear lighter and that the lightness of a target region within an image will be increased by increases in the areas, but not the luminances, of regions that are lower in luminance than the target region (Li & Gilchrist,
1999). Thus, according to Anchoring theory, anchoring is achieved by a combination of local and global applications of the highest luminance rule and the relative area rule.
The anchoring problem is central to much recent cognitive and computational theorizing about lightness perception. It is a problem that any viable neural, or mechanistic, model of achromatic color vision must address. In what follows, we will first present a model of lightness scaling based on a novel edge integration algorithm, which modifies the assumptions of Retinex. The new edge integration algorithm leads to a scaling model in which the lightness of a region depends not only on its luminance, but also on the luminances of surrounding regions. Although the rules of scaling only are changed in our edge integration model, the model generates predictions that are fundamentally at odds with the highest luminance anchoring principle. In particular, our edge integration model predicts that the lightness of the regions of highest luminance in the image will be influenced by regions of lower luminance.
In what follows, we will provide experimental demonstrations of simultaneous contrast effects obtained with incremental targets. We will show that these contrast effects violate the highest luminance anchoring principle but verify the predictions of our edge integration theory. In the
Discussion section, we propose an alternative anchoring principle: one that is consistent with our edge integration model. The new anchoring principle asserts that the lightness anchor does not reliably correspond to any particular luminance value, such as the highest luminance, but instead corresponds to a particular neural activity level: the highest neural activity in the output of the edge integration process.
The experiments reported here are based on a classic achromatic color matching paradigm in which two disks, each surrounded by a ring, are presented side by side on a display. The subject's task is to adjust the luminance of one of the disks as a function of the luminance of the ring surrounding the other disk to achieve an appearance match between the two disks. A number of past studies have investigated the conditions leading to an achromatic color match using such stimuli (Jacobsen & Gilchrist,
1988; Reid & Shapley,
1988; Rudd,
2001,
2003a,
2003b; Rudd & Arrington,
2001; Rudd & Zemach,
2002a,
2002b,
2003,
2004; Shapley & Reid,
1985; Wallach,
1948,
1963,
1976; Zemach & Rudd,
2002,
2003).
The observers in most of these studies were instructed simply to match the disks in appearance. However, Arend and Spehar (
1993a,
1993b) showed that observers are able to make separate judgments of the
lightness (perceived reflectance) and
brightness (perceived physical intensity) of the target when viewing similar displays consisting of square-shaped targets surrounded by frames, if specifically instructed to do so. In the present paper, we will investigate the properties of both
naïve matches (i.e., matches in which the observer is given to no special instructions to judge either brightness or lightness) and lightness matches, specifically. (Thanks to Prof. Davida Teller for suggesting the term “naïve match.”) Brightness matches will be studied, and their properties compared to those of lightness and naïve matches, in an upcoming paper (Rudd, Zemach, & Heredia,
2005). Ultimately, the properties of all of these types of matches—as well as the relationships between them—need to be understood if the extant literature is to be fully addressed by a theoretical model.
In what follows, we will use the term achromatic color to refer to the attribute that is matched in naïve appearance matching experiments. This term avoids the ambiguity of the common alternative term “brightness,” which is sometimes used to refer to perceived luminance (as in the present study) and at other times used to refer to the attribute that is matched in naïve matching experiments. We will also use the term achromatic color when we wish to refer to the general category of neutral color percepts that includes lightness and brightness as specific dimensions or attributes. The meaning should be clear from the context. When we wish to refer to lightness or brightness, in particular, we will be careful to use those labels.
The results of several naïve appearance matching experiments performed with disk-and-ring stimuli have been successfully modeled with algorithms based on the assumption that the disk color is computed from a weighted sum of either the Michelson contrasts or the logarithms of the luminance ratios at the inner and outer ring borders (Reid & Shapley,
1988; Rudd,
2001,
2003a,
2003b; Rudd & Arrington,
2001; Rudd & Zemach,
2002a,
2002b,
2003,
2004; Shapley & Reid,
1985; Zemach & Rudd,
2002,
2003). Reid and Shapley (
1988) proved mathematically that these two contrast measures are approximately equal in the low contrast limit. In a recent paper (Rudd & Zemach,
2004), we presented evidence favoring the weighted log luminance ratio edge integration model over a model based on Michelson contrast for high contrast stimuli.
The visual stimulus used in the main experiment of our previous study consisted of two disk-and-ring patterns of equal size presented side-by-side on a flat panel monitor. The luminance of each surround ring was higher than that of the disk that it surrounded and the stimuli were presented against a dark background. Our stimulus was similar to the one used in the classic “lightness constancy” experiments of Wallach (
1948,
1963,
1976). The appearance of one of the disks—the test disk—was varied by manipulating the luminance
of the ring surrounding that disk. The subject's task was to adjust the luminance
of the other disk—the matching disk—to match the two disks in their achromatic color. The luminance
of the test disk and the luminance
of the ring surrounding the matching disk were fixed, as was the background luminance
B.
The data from our previous study involving decremental targets was modeled with the following achromatic color matching equation, which states the condition that should yield a match according to the edge integration model:
In
Equation 1,
and
are linear weighting coefficients that determine the strengths of the induction signals originating from the inner and outer ring borders, respectively.
Solving
Equation 1 for
yields the following expression for the log luminance of the observer's matching disk settings:
In this paper, we will be mainly interested in how the test ring luminance
influences the matching disk luminance
.
Equation 2 predicts that a log–log plot of matching disk luminance as a function of test ring luminance will be a straight line. According to the theory, the slope of the plot equals the ratio of the weight given to the outer ring edge to the weight given to the inner ring edge minus one.
A least-squares fit of
Equation 2 to the achromatic color matching data from our previous study using decrements produced
statistics in the range 0.90–0.95 for each of four psychophysical observers, with the ratio
varying as the only free parameter of the model (Rudd & Zemach,
2004,
Experiment 1). Estimates of the induction strength ratio
were less than 1.0 for all four observers and decreased with increasing ring width (Rudd & Zemach,
2004,
Experiment 2). Thus, the plots corresponding to
Equation 2 in that study always exhibited negative slopes. This finding is consistent with the assumption that the edge weights
wi decay with distance (Reid & Shapley,
1988; Rudd,
2001,
2003a,
2003b; Rudd & Arrington,
2001; Rudd & Zemach,
2002a,
2002b,
2003,
2004; Shapley & Reid,
1985; Zemach and Rudd,
2002,
2003). We will refer to an edge integration model that makes this ancillary assumption as the distance-dependent edge integration model. For additional ideas about the rules governing edge integration, the reader is referred to papers by Gilchrist (
1988) and Ross and Pessoa (
2000).
Note that our distance-dependent edge integration model predicts that the slope of the
versus
plot will lie in the range −1≤ slope ≤ 0. The slope −1 corresponds to the limiting case in which the weight
given to the outer edge is zero. The slope 0 corresponds to the limiting case in which
; the weights given to the two edges are equal.
We next show that the distance-dependent edge integration model corresponding to
Equations 1 and
2 predicts violations of the highest luminance anchoring principle, and that the predicted violations are in fact observed experimentally.
In the current study, the test and matching disks were always increments. That is, they were surrounded by rings of lower luminance. For such stimuli, according to
Equation 2, it should be possible to set the luminances
and
of the test and matching disks to be equal, and furthermore to have both luminances be the highest luminance in the scene, and yet fail to achieve an achromatic color match, provided that
. This prediction of the distance-dependent edge integration model directly contradicts the principle that the highest luminance is always perceived as white because in this case
= the highest luminance, yet
and
are predicted to differ in achromatic color. Not only does the distance-dependent edge integration model predict violations of the highest luminance rule; it also predicts violations of a more general principle: that any two image regions that both have the highest luminance will have the same color appearance regardless of whether that color is perceived as white or not.
It may be useful to point out that any luminance-based anchoring rule serves two different functions in a theory of lightness perception. The first function of the anchoring rule is to specify one particular luminance—the anchoring luminance—that will not vary in appearance when either the spatial structure of the image or the illuminant is varied. According to the highest luminance rule, that luminance is the highest luminance in the image. Because the appearance of the anchoring luminance is invariant with respect to changes in either the spatial image structure or the illuminant, it follows that the highest luminance regions of any two images will always match in appearance. More importantly for present purposes, the highest luminance rule also predicts that any two regions within the same image that have the highest luminance will also match.
The second function performed by any luminance-based anchoring principle is to provide a perceptual label for the invariant appearance of the anchoring luminance. According to the highest luminance rule, the anchoring luminance is seen as “white.”
Many discussions of lightness are anchoring focus on the phenomenological aspects of the problem: What is the reported appearance of a given region within the display? Thus, they focus on the second role of anchoring. For example, in an elegant study, Li and Gilchrist (
1999) demonstrated that in a visual environment consisting of only two surfaces—paints having two reflectances applied to the inside of a dome inside which the observer's head was placed—the higher reflectance always appeared white, regardless of the actual physical reflectance of the paint corresponding that had the higher reflectance. The authors interpreted their findings as supporting the highest luminance rule. But to break the highest luminance rule, it suffices to show that there exist conditions in which two patches, each having the highest luminance, do not match in appearance. In other words, it suffices to show that the appearance of the highest luminance region is not invariant with respect to changes in its spatial context. This can be accomplished using the technique of asymmetric color matching, without the need to directly address the question of phenomenology. The phenomenal appearance of the disks is important, of course, and we will revisit that topic in the
Discussion.
Returning to the quantitative predictions of the distance-dependent edge integration, let us now suppose that we set
in the disk-and-ring display. According to
Equation 2, it should then be possible to choose a pair of luminance values
and
such that, at the match point,
is the highest luminance in the scene, yet the achromatic color of the matching disk is determined entirely by the three luminances
, and
, all of which are lower than the luminance
. If the test ring luminance
, in particular, is manipulated, the distance-dependent edge integration model predicts that the test disk appearance will be subject to a contrast effect. The slope of the
versus
plot should be nonzero and negative. The highest luminance rule, on the other hand, predicts that the appearance of the highest luminance regions should be invariant with respect to changes in the test ring luminance. Thus there should be no contrast effect and the slope of the
versus
plot should be zero. If we can show that the appearance matching with increments yields a contrast effect characterized by a linear
versus
plot with a slope lying between 0 and −1, we will not only have provided evidence that is quantitatively consistent with the edge integration model, we will also be able to firmly reject the highest luminance anchoring rule as a general principle of achromatic color anchoring. This we do in
Experiment 1.
Before presenting the details of that experiment, it is worth considering whether Gilchrist's idea of multiple anchoring frameworks can save the highest luminance rule in the event that a contrast effect is observed. To this point, we have been assuming that the highest luminance anchoring is applied globally to the display as a whole. In other words, a single luminance serves to anchor both the test and matching disk-and-ring configurations. Another plausible hypothesis is that separate local highest luminance anchoring rules might be applied independently to the two sides of the display. Thus, a disk which is the highest luminance within its own “framework” would always appear white. We will refer to this hypothesis as local highest luminance anchoring to distinguish it from the alternative global highest luminance anchoring hypothesis.
For present purposes, we will assume that a framework comprises a single disk-and-ring configuration. The idea that achromatic color computations are made independently within the two separate disk-and-ring configurations is implicit in Wallach's (
1948,
1963,
1976) interpretation of his own achromatic color matching results. As is well known, Wallach studied achromatic color matching with disk-and-ring stimuli in which the disks were always luminance decrements with respect to their surrounds and obtained the famous result that the two disks matched in achromatic color if and only if the disk/ring luminance ratios on the two sides of the display were approximately equal (but see Rudd & Zemach,
2002a,
2002b,
2004, for evidence of systematic deviations from the ratio rule in experiments using decremental test stimuli).
Wallach explained the approximate ratio matching behavior that he observed in terms of lightness constancy. He proposed that when the disk/ring luminance ratios are equal on the two sides of the display, the observer implicitly assumes that they are two identical disk-and-ring “objects” illuminated by light sources of different intensities. The rings on the two sides of the display are seen as having equal “white” reflectances and their luminance differences are inferred by the visual system to result from unequal illumination levels on the two sides of the display. Wallach's lightness constancy interpretation of ratio matches continues to exert a considerable influence in the contemporary lightness literature.
Applying this logic to the current experiment leads to the prediction that the two disks should both appear white regardless of whether they have the same luminance, as long as they are both increments, because they each have the highest luminance on their side of the display. Thus, if the independent frameworks hypothesis is correct and furthermore the highest luminance anchoring rule is applied strictly locally, then any incremental matching disk luminance should provide an acceptable and, in fact, equally good appearance match to the test disk. If such were the case, the subject could not reliably perform the matching task with incremental stimuli.
Whether anchoring is applied globally to the entire image, or locally within separate frameworks, the highest luminance anchoring rule predicts that the test and matching disks in the current study should, at the very least, match in achromatic color when their luminances are equal. In contrast, the distance-dependent edge integration model (
Equation 2) predicts that lightness matches will be obtained if and only if the test disk luminance and the matching disk luminance are unequal, provided that the luminances of their respective surround rings are also unequal. The predictions of the distance-dependent edge integration model thus contradict those of the highest luminance anchoring theory regardless of whether the highest luminance rule is assumed to apply globally or locally within separate frameworks.
Gilchrist's Anchoring Theory hypothesizes that individual regions of a display, such as our test and matching disks, may group to varying degrees with either local or global stimulus configurations in a way that depends on the Gestalt principle of belongingness. But as the preceding arguments demonstrate, the highest luminance principle is fundamentally at odds with the predictions of the distance-dependent edge integration theory, regardless of the degree of local or global grouping assumed by any particular instantiation of the belongingness principle. The distance-dependent edge integration model thus also contradicts Anchoring Theory. It follows that a test of the two different hypotheses—highest luminance anchoring versus distance-dependent edge integration—is possible and that the test can be made independent1y of assumptions about the degree to which anchoring is local versus global.