In any study of cue combination, one considers a number of cues (usually two) and makes manipulations that reveal how they might be combined. For such a study, the stimulus must be very carefully constructed. Here, for example, we used only monocular shading. The left- and right-eye views contained an identical shading pattern, so that the shading itself delivered zero binocular disparity across the whole image. Thus, only shading specified depth and shape. This is clearly an unnatural situation. Because the two eyes view the world from different locations, the shading patterns at the two eyes are different; hence, the stimulus contains shading-based disparity first suggested by Mach (
1866) and studied in detail by Arndt, Mallot et al. (
1995) and Bülthoff and Mallot (
1988). However, shading-based disparity cannot be manipulated independently of monocular shading, because the former relies on the latter. Hence, we separated the two cues by only using monocular shading as our shading cue. We felt justified in making these manipulations because shading-based disparity cannot be used to obtain local variations in depth that determine shape (Hill & Bruce,
1993; Mallot,
1997; Mallot, Arndt et al.,
1996).
We also took care to think about how the dots defining disparity might impinge upon the shading cue. Recent work (Sun & Schofield,
2011) suggests that shape from shading can be disrupted by low-frequency texture information but is less affected if the texture is of a much higher spatial frequency than the underlying shading pattern. Norman, Todd et al. (
2004), found worse performance for shape discrimination when texture and shading were used to judge shape than when shading was presented alone. Disparity was applied to our stimulus via small densely packed dots that covered approximately 30% of the image. Dot luminance was set to be equivalent to variations in surface reflectance. Dots did not carry any consistent shape from texture information. In pilot work, we found that if the disparity-carrying dots were too few, the percept tended to break into a pair of transparent surfaces, one carried by disparity and the other by shading.
Our stimuli were designed to avoid the impact of other cues to depth and shape. For example, our smooth bumps (
Figure 3) were designed so that there was no useful object contour, which has been shown to be a strong cue to depth and shape (e.g., Koenderink,
1984; Koenderink & van Doorn,
1982; Malik,
1987; Singh & Hoffman,
1999). Contour disparity, caused by each eye seeing a slightly different contour due to its different viewpoint, is also a useful cue to depth (Nefs,
2008) and was absent from our stimuli. We removed texture cues: The shaded object did not have texture itself, and texture information provided by the superimposed disparity dot pattern did not warp with the shape: Instead, it specified a flat surface.
Finally, we used a carefully considered choice of lighting and shape so that the ambiguous shape cue was as stable as possible. Shape from shading is an inherently ambiguous cue: The shading pattern changes not only with object shape but also changes with variations in the object viewpoint and changes in lighting direction (Horn,
1975,
1990; Pentland,
1989). We also know that changes in light direction can result in changes in perceived shape (Christou & Koenderink,
1997; Koenderink, van Doorn et al.,
1996; Nefs et al.,
2005). Presumably to combat some of this ambiguity, the visual system appears to operate several strong “priors” that bias observer judgments of shape from shading. In particular, there is a bias toward perceiving objects as convex (Hill & Bruce,
1993; Langer & Bülthoff,
2001), and the visual system assumes that light comes from above (Ramachandran,
1988; Todd & Mingolla,
1983). Indeed, one study has suggested that, for shape judgments of concave objects, observers can behave as if they use only priors, and there is no influence of the visual input at all (Liu & Todd,
2004). There is also evidence that the convexity prior can be modified (Champion & Adams,
2007). Here, we were careful to set directional lighting to be from above, from a consistent direction on all trials and to only use convex objects. No observers reported any kind of instability in perceived object shape and our highly reliable results suggest that, for our stimulus arrangement, shading was only around 30% less reliable than binocular disparity.