Abstract
We investigated the ability to use linear perspective to perceive depth from monocular images. Specifically, we focused on the information provided by convergence of parallel lines in an image due to perspective projection. Our stimuli were trapezoid-shaped projected contours, which appear as rectangles slanted in depth. If converging edges of a contour are assumed to be parallel edges of a 3D object, then it is possible in principle to recover its 3D orientation and relative dimensions. This interpretation depends on projected size, so if an image contour were scaled, accurate use of perspective predicts changes in perceived slant and shape. We tested this prediction, and measured the accuracy and precision with which observers can judge depth from perspective alone. Observers viewed monocular images of slanted rectangles, and judged whether the rectangles appeared longer or wider than a square. The projected contours had varying widths (7°, 14°, 21°), and side angles (7° or 25°), and heights were varied by a staircase procedure to compute a PSE and JND for each condition. Observers were able to reliably judge aspect ratios using only the monocular images: Weber fractions were 6–9% for the largest rectangles, increasing to as high as 17% for small rectangles with high simulated slant. Overall, the contours judged to be squares were taller than the projections of actual squares, consistent with perceptual underestimation of depth. Judgments were modulated by image size in the direction expected from perspective geometry, but the effect of size was smaller than predicted (20–40%).
Supported by NIH EY-013988