Object block response counts summed across observers (
Figures 6a,
6b) show that the correct option had the most responses in all cases except for the large change toward yellow for the flat object. Nearly all of the incorrect responses were in the response options adjacent to the correct option. There was some tendency to report less of a change in the yellow direction than what is shown in the stimulus, both in general and in comparison to changes toward blue. On average, discriminability of color changes in the stimuli (
Figure 6c) did not show a statistically significant difference in paired
t-tests between the two object blocks or between yellower and bluer changes within blocks.
Figures 7a–c shows the response counts from the scene blocks for each of the 3 × 3 = 9 possible stimulus changes. The correct option again had the most responses in most cases, and incorrect responses were mainly for response options adjacent to the correct option. There was, however, also interaction between the responses to the two types of stimulus change. Most notably, observers were most accurate when there was either no change or a simultaneous change in both illumination and reflectance toward the same color. In contrast, observers performed worst when both illumination and reflectance changed toward opposite colors (and the reflected color thus stayed the same), often reporting no change in reflectance in this case. For comparisons of average discriminability in scene blocks (
Figure 7d), observers were better at discriminating illumination changes than reflectance changes (flat:
t(8) = 2.42,
p = 0.046; 3D matte:
t(8) = 4.41,
p = 0.007; 3D glossy:
t(8) = 2.81,
p = 0.046). Discriminability of reflectance changes was better for the flat scene compared to the 3D matte scene (
t(8) = 4.23,
p = 0.009). Discriminability of illumination changes did not show a significant difference between scene blocks.
To illustrate how correct identification of reflectance change depends on the correct identification of illumination change, cross-tabulated responses of scene blocks are presented in
Figures 8a–c. Chi-square tests clearly rejected the null hypothesis of independence for the 3D matte (χ
2(3,240) = 14.75,
p < 0.001) and 3D glossy (χ
2(3,240) = 35.06,
p < 0.001) scene blocks and, less decidedly, for the flat scene block (χ
2(3,240) = 4.41,
p = 0.036). The independence of correct identifications varied between different color change comparisons: Across all scene blocks, independence was not rejected for comparisons where illumination change was accompanied with either a change toward blue or no change in reflectance, but it was rejected for all other comparisons (all
p < 0.05). Overall, the chi-square tests rejected independence more often than not. However, on average, when the illumination change was not correctly identified, the reflectance change was still correctly identified half of the time—compared to the chance level of one third. Additionally, average conditional
d′ are presented in
Figure 9. In all scenes, discriminability was better with correct identification of illumination change, but only if there was no illumination change in the stimulus (flat:
z = 3.33,
p < 0.001; 3D matte:
z = 3.60,
p < 0.001; 3D glossy:
z = 3.85,
p < 0.001). Otherwise, discriminability did not show a statistically significant difference, and performance was always clearly above chance level.
Next, we binned the trials based on the reference reflectance and illumination into three categories: the yellow, neutral, and blue categories. We then computed discriminabilities
d′ and response criteria
c for reflectance and illumination changes. When trials were split by reference illumination, the decision criterion (
Figure 10a) was lower for illumination changes away from neutral than toward neutral (yellow reference:
z = −12.52,
p < 0.001; blue reference
z = 15.80,
p < 0.001). Note that as all combinations of reference value and stimulus change were analyzed separately, the response criteria for blue and yellow changes are expected to change in opposite directions: For a yellow reference value, a yellow change was away from neutral and a blue change toward neutral, and vice versa for a blue reference. A response criterion of 1 corresponds an optimal or unbiased criterion, and a criterion below 1 indicates a bias to respond with the respective color change. The observed criterion shifts thus mean that observers were more likely to respond that the illumination change was away from the neutral color than toward it. In conjunction, discriminability for illumination change (
Figure 10c) was higher for changes away from neutral than toward neutral (yellow reference:
z = −5.35,
p < 0.001; blue reference
z = −3.54,
p < 0.001). In other words, observers were not only more likely to respond that the illumination change was away from neutral but also more sensitive to these changes. The response criteria for reflectance (
Figure 10b) followed a similar pattern as a function of reference illumination (yellow reference:
z = −3.40,
p = 0.001; blue reference
z = 3.96,
p < 0.001), but there was no significant difference in sensitivity for reflectance changes (
Figure 10d).
When trials were binned by reference reflectance, the criteria for reflectance (
Figure 10f) were higher for changes toward neutral than away from neutral (yellow reference:
z = −13.96,
p < 0.001; blue reference
z = 11.44,
p < 0.001). For illumination changes, the pattern was different (
Figure 10e): The criterion was higher for illumination changes toward yellow in the yellow reference group (
z = 9.11,
p < 0.001). The observers were thus more likely to report blue illumination changes when the reference reflectance was yellow. There were no significant differences in sensitivity for either reflectance or illumination changes (
Figures 10g,
10h).
Finally, we quantified the amount of cross-talk in the processing of reflectance and illumination cues by fitting a two-dimensional signal-detection model to the data. The model has two hypothetical mechanisms, a “reflectance mechanism” and an “illumination mechanism,” whose directions in stimulus space determine how the two cues affect the responses. The raw data for the model fits were the response counts from the scene blocks, shown in
Figures 7a–c. Note that this model does not take into account the asymmetries demonstrated in
Figure 10, nor does it predict the specific pattern of conditional
d′-values in
Figure 9. The vectors depicting the mechanisms in stimulus space are shown in
Figure 11a for the three scene conditions. The axes represent the two possible stimulus changes, a reflectance change and an illumination change, with a blue change as positive and a yellow change as negative. If there was no cross-talk in the identification of the two types of change, the vectors would lie on the corresponding axes.
Instead, in all three conditions, neither mechanism vector is aligned with the axis. This means that illumination cues caused a response in the reflectance mechanism and vice versa. The illumination mechanism was, however, much more closely aligned with the illumination change axis in the stimulus space. In other words, the reflectance mechanism was more excited by illumination changes than the illumination mechanism was by reflectance changes. Thus, the illumination responses are much more independent of reflectance color changes than the other way around. In the 3D matte scene, the reflectance mechanism is close to having a 45° angle with the axes, which would indicate that both cues were equally likely to result in a perceived reflectance change. We tested the separation of the reflectance and illumination mechanisms with permutation tests. Compared to the 3D matte scene, the angle between the two mechanisms was larger in the presence of local contrast (flat scene,
p < 0.01) or specular highlights (3D glossy scene,
p < 0.05), leading to better separation of the two dimensions. The angles between the mechanism vectors and the reflectance axis (x-axis) are plotted in
Figure 11b. The average
d′-values are plotted in
Figure 11c. Although grouped differently, these correspond to the average
d′-values in
Figure 7d and show a similar pattern, with poorest color constancy for the 3D matte stimulus.
Figures 11d–f shows the response distributions for each stimulus change in decision space. The decision criteria are shown as vertical and horizontal lines. These distributions further illustrate how reflectance responses are much more dependent on illumination changes than illumination responses are on reflectance changes.