Bertamini et al. (
2008) and Burge et al. (
2005) reported that a configural cue can have an impact on the way observers respond to a forced choice paradigm where they are required to choose which of two patterns has the most depth. Such paradigms are incapable of discriminating between a difference that arises from a response bias and those that arise from a genuine distortion of perceived depth. In our experiment, observers were required to match the perceived depth interval of the displays used in Burge et al.'s (
2005) studies. In this paradigm, we observed no difference in the settings of metric depth depending on which region appears as figure. The precision of our measurements was at least as high as those of Burge et al. (
2005), so this failure cannot be attributed to a difference in statistical power in our studies. Our results indicate that the Burge et al. (
2005) configural effect is likely to have arisen from response bias.
One difference between our experiment and Burge et al.'s (
2005) was the viewing distance of the observers. They used a viewing distance of 3.25 meters. With a disparity between the near and far surface of 7.5 arcmins, this is consistent with a depth of 40 cm between the two surfaces. We used a range of disparities between the two surfaces (5.4 to 12 arcmins) but our viewing distance was only 85 cm and the maximum predicted depth difference was thus only 4.2 cm. A reviewer raised the possibility that disparity may receive less weight at greater distances because disparity scales with distance (decreasing as the square of the distance for a given depth). However our disparities were not greater than those of Burge et al. (
2005) so there is no reason to attribute our results to the nearer distance we used. Threshold depths expressed as disparities do not vary with distance within the relevant range (Bradshaw & Glennerster,
2006). Also given the very large perceived depth in the Burge et al. (
2005) stimuli, we might expect that if a constant amount of extra depth were assumed to be provided by configural cues, it should be more difficult to detect (by Weber's Law) in their conditions than ours. The effects reported by Bertamini et al. (
2008) at an intermediate distance with larger sizes and smaller disparities than Burge et al. (
2005) indicate that the configural effect is not specific to a narrow set of parameters and Burge et al. (
2005) do not state that it is. We think it likely that the Bertamini et al. results are also determined at the response selection level, which was also suggested by these authors.
Although our data do not support a view that configural face cues in a figure-ground arrangement influence metric depth, we do not claim that there are no contexts in which ordinal factors influence metric depth. On the contrary, we believe that there are cases where they do. These seem to fall into two categories. On the one hand, there is some evidence that interposition, an ordinal cue, can interfere with the perception of depth when in conflict with stereopsis (Schriever,
1925). Such conflicts are probably rare however when viewing natural scenes. In the other and more common category, ordinal cues serve essentially a veto function. They can provide information used by the visual system to indicate whether metric information is applicable. For example, Gillam and Cook (
2001) showed that whether a cyclopean trapezoid emerging from a random dot stereogram was in front or behind the surround had a strong influence on whether its trapezoidal shape influenced the perceived metric stereoscopic slant to a disparity gradient across its surface. In the behind case, the trapezoid shape is attributed to the aperture through which it is viewed, rather than to the surface itself. In this arrangement, the perspective cue generated by the trapezoidal shape had little effect on the perceived slant of the stereoscopically defined surface within its boundaries, but it had a much more significant influence on perceived slant when the same surface was placed in front of the surround. In this latter configuration, the trapezoidal shape is intrinsic to the surface it bounds, and hence provides information relevant to the surface's slant, which is not true when the ordinal depth is reversed. Likewise, the introduction of a surface in the correct position for a possible partial occlusion can change the stereo response to a disparate rectangle from strong slant to little slant (Häkkinen & Nyman,
1997). In these cases, an ordinal cue is informative about the contours to which it is appropriate to apply metric information provided by disparity.
There have also been a variety of other experiments that have shown that non-metric information can have a dramatic effect on the use of metric information. Meng and Sedgwick (
2001) showed that the perception of the metric properties of surface layouts could be strongly affected by whether surfaces were perceived to be in contact. A similar effect of the role of contact relationships affecting perceived metric depth was reported by Kersten, Mamassian, and Knill (
1997), who showed that the perceived motion trajectory and depth of a moving object could be dramatically altered by the presence of a shadow that either caused an object to appear to travel along a surface, or along a path off of the surface.
These associations between ordinal cues, or qualitative contact relationships, and their influence on metric depth are undoubtedly built in by evolution or learned. Such processes would presumably function to reinforce correlations between ecologically relevant cues to depth, and actual depth intervals. Burge et al. (
2005) try and reconcile their finding of an ordinal effect on metric depth with Landy et al.'s (
1995) requirement that depth cues be in the same units for combination. They speculate that an ordinal cue can acquire metric status by suggesting that an occluding surface could lead to an internalization of “the statistical likelihood of a metric depth value given that geometrically ordinal depth cue” (Burge et al.,
2005, p. 541). In other words, they suggest that an ordinal cue could become associated with the most likely depth interval associated with that particular cue. Even if this were true, it seems unclear why faces should be assigned a different value than an arbitrary occluding surface, which would be needed to explain their results within the cue combination approach that they offer. We believe the more parsimonious account of their results, given the results reported herein, is that they reflect a form of response bias.
In conclusion, our results indicated that the effect of a figure-ground cue on perceived metric depth as demonstrated by Burge et al. (
2005) did not occur when perceived depth in these displays was directly measured. The configural effect on metric depth appears to be a consequence of a forced-choice method that cannot distinguish between differences that arise from genuine transformations in perceived depth, and those that arise from response bias.