We can now explain why focal length affects apparent depth in pictured scenes and facial appearance in portraits. Recall that long- and short-focal-length pictures look respectively compressed and expanded in depth (
Figures 1,
2, and
3). We propose that people's preferred field of view when looking at most pictures leads them to view long-focal-length pictures from too near and short-focal-length pictures from too far. Perceptual compression and expansion occur because people do not take their incorrect viewing distances into account. Thus, scenes captured with long lenses look compressed in depth, which makes faces apparently flatter. Likewise, scenes captured with short lenses appear expanded in depth, which makes faces look rounder.
However, this does not tell us why pictures created with a 50-mm lens look most natural, i.e., neither expanded nor compressed. To investigate this, we calculated for each picture size the focal length for which the subjects' average preferred viewing distance would be equal to the COP distance. We call this the
recommended focal length:
where
dpref is the average preferred viewing distance,
lp is the diagonal length of the picture, and 43.3 is the diagonal length of standard 35-mm film in millimeters. The recommended values from our data, calculated by averaging the preferred viewing distance across all focal lengths for each picture size from
Experiment 2, are plotted in
Figure 11. The regression line from
Figure 9b is also replotted in terms of recommended focal length. The equation for the line is:
Thus, for prints 35 cm or larger, the recommended focal length is ∼50 mm. Most prints, particularly professional ones, are at least that size. We claim therefore that following the 50-mm rule of thumb maximizes the odds of a viewer looking at the photo from the COP distance and thereby makes it most likely that the percept will be undistorted. This rule has presumably evolved over time based on collective experience. Similar recommendations apply for cinematographers, computer-graphics engineers, and painters of realistic images. Some typical image sizes for various formats (Take,
2003) are superimposed as vertical bands in the figure. For most venues, the recommended focal length is ∼50 mm (35-mm equivalent). With the small screens of mobile devices, longer focal lengths should be used. If image creators know the size of a typical print or projection of their work, they can use
Equation 5 to make a better choice of focal length or to change the distance of the COP in postprocessing (Carroll, Agarwala, & Agrawala,
2010).
Most photography texts advocate the 50-mm rule (Kingslake,
1992; Belt,
2008; Modrak & Anthes,
2011; London et al.,
2010), but we wondered whether the rule is actually used in practice. To find out, we collected 3,930 photographs from the website Flickr that were taken with single-lens reflex (SLR) cameras. (These cameras tend to be used by professionals and serious hobbyists.) We obtained the 35-mm-equivalent focal length for those photos from their EXIF data. The median is 68 mm (50% quantile horizontal line in
Figure 11). Interestingly, 68 mm is closer than the advocated 50 mm to our recommended focal length for a wide range of sizes. Thus, current practice deviates slightly from the 50-mm rule, but is more consistent with our experimental data.
Our recommended focal length is much longer for small picture sizes, such as those on mobile devices. The viewing of images on mobile devices is becoming much more common (Choney,
2009; Carlsson & Walden,
2007). People tend to view smart phones from ∼30 cm (Knoche & Sasse,
2008). When standard content is viewed at that distance, the smart-phone user is generally much farther from the display than the COP distance, making the images of objects subtend small angles and producing expansion in apparent depth. Interestingly, smart-phone viewers prefer standard content to be magnified and cropped (Knoche et al.,
2007; Song et al.,
2010), which increases the COP distance, much like increasing focal length; this practice should make the viewed content appear less expanded than it otherwise would.
Focal length has a strong effect on the perceived personality of subjects in portraits (Perona,
2007). We speculate that such effects derive from correlations between people's actual facial dimensions and personality traits. For example, faces appear narrower when photographed with short lenses and wider when photographed with long lenses (
Figures 1c and
3). The actual width-to-height ratio of male faces is positively correlated with aggressive behavior (Carre & McCormick,
2008), so attributions made from apparent ratio changes probably derive from correlations with real ratios. It would be interesting to examine the relationship between other facial dimensions affected by focal length (e.g., nose length, face roundness) and personality traits.
Pictures are useful in part because viewers can gain a faithful impression of the pictured content even when they are not positioned precisely at the COP. However, the ability to compensate for incorrect viewing position differs between being off-axis (i.e., off to the side) and at the wrong distance. For off-axis compensation, the visual system estimates the slant of the picture surface and corrects for the expected foreshortening (Pirenne,
1970; Vishwanath, Girshick, & Banks,
2005; Rosinksi et al.,
1980). This compensation process is shape constancy (Wallach & Marshall,
1986), which allows one to perceive the dimensions of real objects from various viewpoints. (The observation of nearly complete compensation has occurred when the image content being judged is roughly parallel to the picture surface; when the content is roughly perpendicular to the surface, compensation is much less complete; Goldstein,
1987; Todorović,
2008.) To compensate for incorrect distance, the visual system would have to estimate the correct distance from the picture's contents, but such estimation is prone to error (La Gournerie,
1859; Kubovy,
1986; O'Brien & Farid,
2012). Thus, we argue that compensation for off-axis viewing occurs because the computations involved are useful in everyday vision and generally not prone to error. We argue further that compensation for incorrect viewing distance does not occur because the required computations are not useful in everyday vision and are prone to error.