In
Experiment 1a, material identification functions (based on object-correct proportions) display a systematic asymmetry across conditions (
Figure 8A). This is seen within individual subplots by comparing the six test objects on the lower reflectance side of their standard to the corresponding six test objects on the higher reflectance side of the same standard. Identification functions have decreased proportions of object-correct responses with lower reflectance test sets under full illumination and with higher reflectance test sets under lower illumination. For these conditions, material identification curves require a greater reflectance difference to reach threshold.
If observers were to perfectly account for illuminant changes when identifying materials across the different environments, their ability to identify the correct object would be limited solely by their ability to discriminate within each compartment. Threshold levels for discrimination and material identification functions would therefore be at the same reflectance. We tested this hypothesis by analyzing the distribution of incorrect responses for each set of six test objects on both reflectance sides of each standard. Under this hypothesis, the incorrect object identification responses should be randomly distributed among the three standard objects. In other words, if identification errors are made only when observers cannot discriminate between the objects, observers will choose at random, and the incorrect responses will be evenly distributed among the three wrong answers. The number of [incorrect-side] responses should then be twice that of [incorrect-object given correct-side] responses. If this hypothesis is incorrect, a larger portion of responses will result in [correct-side but incorrect-object] responses (for a fuller derivation, see Robilotto & Zaidi,
2004a). Maximum likelihood estimates were calculated based on this hypothesis and then compared to the observed data with a chi-square test. With the given parameters and 6 degrees of freedom (one for each test reflectance level), the critical value of
χ2 at the 0.01 significance level is 16.81. Each
χ2 value from
Experiment 1 is shown in
Figure 8A under the data set from which it was obtained. Values greater than the critical value are in bold and labeled with asterisks, signifying conditions where the hypothesis that identification is limited only by discrimination can be rejected. The pattern of
χ2 values from
Experiment 1a is systematic and similar for the two observers. In the variable mean reflectance conditions, when the test objects under full illumination have lower mean reflectances than the standards, and when test objects under reduced illumination have higher mean reflectances than the standards, the above hypothesis is always rejected (16/16 times combined for both observers). The hypothesis generally fails to be rejected in conditions when test objects under full illumination have higher mean reflectances than the standards, and when test objects under reduced illumination have lower mean reflectances than the standards (rejected 2/16 times combined for both observers). These asymmetries are similar to those found for plain gray papers (Robilotto & Zaidi,
2004a).
One problem with this statistical method is that if there is a bias in choosing the side containing the test, the distribution of errors could be affected. For example, if the difference between standard and test is less than or close to the mean reflectance threshold, observers could systematically choose the darker side when they cannot discriminate brightness differences. This seems to be happening in
Experiment 1a and can be seen in the side-correct functions when comparing conditions in which the test objects were on the side of full illumination versus reduced illumination (i.e., upper versus lower subplot). Conditions of reduced illumination generally have higher relative proportions of correct-side responses, especially when the difference between test and standard is low. This results in steeper functions and lower reflectance thresholds for the reduced illumination conditions.
As an example of the illumination-based side-correct bias, look at the pair of plots for BV for the standards with a fixed contrast of 0.0 and fixed mean of 0.34. Here, the side-correct response rates under full illumination start off well below chance, whereas side-correct response rates for the reduced illumination condition begin near threshold levels even for the lowest test differences. Due to this bias, in this pair of subplots, the chi-square tests are rejected in all four conditions.
The overall trend across both observers in
Experiment 1a can be seen in
Figure 9A. Here, mean reflectance levels that give side-correct thresholds are plotted for conditions of full illumination versus reduced illumination. In the plots, data points from BV are represented by squares and JC by triangles. Test objects under reduced illumination resulted in lower thresholds for 12 of 16 conditions. These thresholds are in reflectance units and would be expected to be independent of illumination level by Weber's law. When reflectance differences between standard and test are low, observers are more likely to report object materials as different when they are under the lower illumination.
The discrimination bias does not lead to a simple asymmetry in the identification thresholds (
Figure 9B). Instead, results are grouped in two separate distributions on either side of the unit diagonal with half of the points for each observer in each group. To ascertain that the comparisons between the shapes of discrimination and identification curves are not misleading, we present another way to quantify the asymmetries in the material identification curves.
Another way to examine the asymmetries of material identification curves across illuminants is to compare mean reflectance thresholds for +Δ tests (higher mean than the standard) versus −Δ tests (lower mean than the standard). This is done for conditions in which the tests are under full illumination (
Figure 10A) and conditions in which the tests are under reduced illumination (
Figure 10B). In the plots, data points from BV are represented by squares and JC by triangles. Mean reflectance thresholds when test objects are under full illumination are higher for −Δ tests than for +Δ tests, whereas mean reflectance thresholds when test objects are under reduced illumination are higher for +Δ tests than for −Δ tests. Therefore, mean reflectance thresholds for object-correct functions agree with the material identification asymmetries shown by the chi-square tests.
From the asymmetries in the results of
Experiment 1a, the question arises whether a single perceptual strategy can account for both identification failures and successes. When looking at the material identification data, it is helpful to think about possible strategies observers could use. The difference in illumination between the two compartments (a factor of 4) is much greater than the difference in mean reflectance between standards and test (a few percent). Whenever the test object is in the compartment with full illumination, the object on that side with the highest mean reflectance will be most different in mean luminance, whether or not it is the odd object. Conversely, whenever the test object is in the compartment with reduced illumination, the object on that side with the lowest mean reflectance will be most different in mean luminance. Therefore, if observers made judgments based strictly on the mean luminances of the objects alone, they would respond correctly half the time; that is, when higher mean reflectance tests are under full illumination and when lower mean reflectance tests are under reduced illumination. Under the other half of stimuli conditions, they would consistently choose the incorrect objects and the proportion of correct responses would fall below chance towards zero.
On the other hand, there is sufficient information in the displays to perform the task accurately using reverse optics. If an observer could take precise mean luminance ratios between the backgrounds, this would provide an estimate of the relative intensities of the two illuminants. This estimate could be used to discount the mean luminance differences between the objects. In this case, identification across compartments would be as good as discrimination within a compartment.
The actual data show that observer responses are between these two extremes. One possibility is that observers are basing their responses on perceived brightness. We propose a model where the brightness of a stimulus is equal to its mean luminance multiplied by a scalar gain whose value is a monotonically decreasing function of mean luminance within a compartment (Hayhoe, Benimoff, & Hood,
1987; Zaidi, Shapiro, & Hood,
1992). Because the backgrounds of the compartments have equal mean reflectances, their luminance values can be related to the amount of illumination they receive. According to this model, the gain (
g) is equal to 1.0 when the compartment's mean luminance (
L) is equal to 0.0 and declines monotonically as illumination increases. The rate of decline is governed by the free parameter (
κ), that is,
g =
κ / (
κ +
L).
If observers were to make decisions of lightness identification based on perceived brightness dissimilarities, their response results would vary depending on the amount of adaptation.
Figure 11 shows examples of four such pairs of response curves in which perceived brightness is being adapted to four different degrees based on the value of
κ. The top row simulates test objects under full illumination, whereas the bottom row simulates test objects under reduced illumination. The leftmost pair (
κ =
∞) represents an observer with no brightness adaptation, in which lightness dissimilarities will be based strictly on mean luminance. Here there is complete asymmetry between the brightness discrimination and lightness identification curves. The direction of the asymmetry depends on whether the tests are under full or reduced illumination. The rightmost pair (
κ = 0) represents complete brightness adaptation, in which the standards will appear equal across illuminants. Here, lightness identification curves will be symmetrical and lightness identification will be limited only by brightness discrimination within illuminants. The asymmetry seen in the actual data of
Experiment 1 is somewhere in between these two extreme examples. The two middle pairs of
Figure 11 show that when a certain amount of adaptation occurs, the standard in the test's compartment will appear closer to the standards on the other side when the difference between standard and test is high. With more adaptation (lower
κ), less difference between standard and test is needed for correct identification. This model approximates the asymmetries seen in
Experiment 1a and also explains why object-correct proportions can be below chance level. This suggests that perceived brightness is the factor observers use to identify reflectance across illuminants. This is tested directly in
Experiment 2 by having observers explicitly identify brightness differences.
A question of particular interest with using real patterned objects is how well do the patterns aid in identifying lightness. This can be examined by comparing results for standard stimuli in
Experiment 1a that were the same in mean reflectance and differed in reflectance contrast.
Figure 12 compares lightness identification for the 0.0 reflectance contrast (plain gray) stimuli to the 0.46 reflectance contrast (patterned) stimuli. Squares represent the mean reflectance thresholds for observer BV, whereas triangles represent observer JC's thresholds. Patterned conditions have lower thresholds in 10 of the 16 conditions. This is suggestive but not significant by Fisher's randomization test. Under the experimental conditions in this study, pattern contrast does not appear to improve lightness identification significantly. It also seems that perceived brightness is not affected by perceived contrast in the material identification task.