Because horizontal slants of the screen lead to the greatest length compression along the frontoparallel axis (
Figure 1), the question arises whether the decrease in perceived length is explained completely by the shorter projected length, or whether the visual system does compensate partially for this compression. On the dark ground, the retinal image of the object holds the only information available for size estimation.
Koch, Baig and Zaidi (2018) showed that in perception of poses in oblique views of pictures, the same observer-centered back-transform is used as for real scenes, thus leading to pose estimates that are a rigid rotation of actual scene poses by an angle equal to the viewing azimuth.
Maruya and Zaidi (2020) showed that perceiving 3D sizes in real scenes also uses the optimal geometric back-transform for sizes, but estimates are suboptimal because of a slant illusion that makes longer objects appear more slanted, which leads to less correction than required. The results in
Figure 2 show that observers do not make veridical estimates of 3D size. It is still possible that they use the optimal geometrical back-projection from the retinal image but misestimate one or more parameters that are part of the back-transform expression. The physical 3D length divided by the length projected on the retina (
\({L_{3D}}/{L_r})\) gives the Optimal Length Correction index (OLC) for each pose in each viewpoint (plotted as solid curves in
Figure 2c, color-coded similar to symbols for the azimuth of the screen). The symbols in
Figure 2c plot the perceived lengths from
Figure 2a divided by projected lengths giving the measured length corrections (MLC). OLCs are the highest at line of sight (90° and 270°) with the same values for all azimuths of the display. The largest values of MLC also correspond to the poses pointing towards or away from the observer, with similar values for across the different azimuths of the display, but they are lower than what is required for veridical estimates, especially for the longer lengths. OLCs at frontoparallel poses (0° and 180°) increase with the slant of the display. Similarly, MLCs at these poses increase with the slant of the display, but not enough.
Figure 2c enables us to reject the hypothesis that size perception in pictures uses the same back-transform as for real scenes, irrespective of the azimuth of the monitor, which makes it different from pose perception. If this hypothesis was correct, the frontal view OLC, which is also the OLC for real scenes, should fit the data with some multiplicative scaling of the red curve, but that cannot happen because OLC for 0° and 180° poses is anchored at 1.0 when display azimuth is fixed at 0°, no matter what the values for other parameters such as camera elevation, focal length or distance, whereas the MLC for 0° and 180° is higher for the obliquely viewed displays showing increasing length correction with increasing azimuth of the display. This increase is still substantially less than required by the OLC computed for the ±60° display azimuth. In general, the form of measured length correction as a function of 3D pose is similar to the optimal length correction curve, suggesting that observers may be using the optimal back-transform, but with additional multiplicative factors leading to the suboptimality. We test this hypothesis by fitting a modified back-transform model.