Some studies have shown that image cues, such as two-frame motion (e.g., optical flow; Kawabe, Maruya, Fleming, & Nishida,
2015; Kawabe & Nishida,
2016), local 3-D structure (Giesel & Zaidi,
2013), and shape deformations (Paulun, Kawabe, Nishida, & Fleming,
2015; Kawabe & Nishida,
2016; Paulun, Schmidt, van Assen, & Fleming,
2017; Schmidt, Paulun, van Assen, & Fleming,
2017), can affect the perception of mechanical properties. However, static cues and two-frame motion cues can be conflicting and accidental, and therefore they are sometimes insufficient to capture the impression of mechanical properties. For example,
Figure 1A shows two static images of the same fabric. From the static images, we can already tell a lot about the material properties of the cloth (e.g., transparency and shape deformation). However, we might perceive the cloth in the left image to be stiffer than the one in the right. In contrast, when we view video sequences of how the fabric moves under a wind force (
Figure 1B; for a video, see Malcolm,
2017), we might achieve a more consistent impression of its stiffness. This indicates that multiframe motion may help disambiguate the conflicting information and help observers achieve a consistent judgment. In this article, we investigate the role of such long-range motion information, characterized as spatiotemporal coherence over multiple frames, on the perception of mechanical properties of cloth from videos.