Many objects in the world move; even motionless objects constantly move relative to our eyes as we navigate and explore the world. Numerous previous investigators have used psychophysical, neuroimaging, or electrophysiological methods to explore how different kinds of motion can support object segmentation and form perception. The appearance of motion can also arise from static form, for example when viewing a flipbook or a cartoon. Our perception of “something moving” is generally unified and coherent, but for experimental purposes it can be very useful to dissociate the “something”—that is, the form—from how it is moving.
Structure-from-motion stimuli, in which objects with ambiguous structure become unambiguous when set into motion, have provided many insights into the relationship between motion and form. Wallach and O'Connell (
1953) described the “kinetic depth effect,” in which flat projections of wireframes spring into three dimensions when the wireframes are rotated. Later studies have used random dot patterns applied to the surfaces of objects. As the objects move, each dot follows the trajectory of the underlying object at that point, and the collection of dots gives roughly the same motion impression as the original object. Such stimuli contain enough information to judge relative surface slant (Domini & Caudek,
1999), curvature (van Damme & van de Grind,
1996), and form (Doner, Lappin, & Perfetto,
1984). When uniform dot textures are mapped onto the surfaces of objects, it does not necessarily translate to a uniform distribution of dots as seen by the viewer. As a surface slants away from frontoparallel, apparent dot density increases. This can give significant form cues even in the absence of motion (
Figure 1). This can be ameliorated by evaluating dot distribution in each frame and adding and removing dots to bring the distribution closer to uniform (as in Sperling & Landy,
1989). However, this increases scintillation at the edges of rotating objects and can be computationally expensive.
Reducing the number of dots can also help to decrease the contribution of static form. One prominent example is the study of perception of biological motion using point-light walkers. These stimuli, first used by Johansson (
1973), are created by placing a small number of dots at points on an otherwise invisible moving human body. Still images from the resulting movies often appear to be formless collections of dots but when viewed in motion the actions of the underlying body are strikingly clear. Several experiments have attempted to distinguish between motion-based and form-based processing of these stimuli. It is possible that the strong perception of animate motion arises from particular patterns of motion in the stimulus, regardless of the form that is generating that motion (Casile & Giese,
2005; Hoffman & Flinchbaugh,
1982). It may also be that as information accrues over time, the dots in each individual frame obey the location constraints imposed by a particular motion, activating a form-based template of that motion (Beintema & Lappe,
2002). Some arguments have been for a purely form-based or a purely motion-based source for the vivid perception of biological motion; as one recent model (Giese & Poggio,
2003) formalized, both processes could combine to yield the final coherent percept.
Work outside the field of biological motion has also offered insights into the connections between apparent motion and apparent form. A recent study suggests that low-level motion information can give rise to a form-based representation that then affects how the motion is perceived (Caplovitz & Tse,
2007). Similarly, apparent motion caused by sequential static views only develops after those static views are parsed for form (Tse & Logothetis,
2002). At least one neurological patient is unable to perceive structure from motion, despite being able to recognize static objects and to perceive basic properties of motion (Cowey & Vaina,
2000); another is able to perceive structure from motion, despite severe impairment of low-level motion perception (Vaina, Lemay, Bienfang, Choi, & Nakayama,
1990).
This article describes a technique for the generation of stimuli that contain motion information but essentially no static form information. These formless dot fields differ from point-light walkers in that they can be made of thousands of dots, and they are free from the variations in dot density seen when uniform dot textures are mapped onto three-dimensional models. This technique is not the first with these properties (e.g., Sperling & Landy,
1989; Treue, Husain, & Anderson,
1991), but due to the use of hardware graphics acceleration, it is very fast. It is able to transform a modeled action sequence involving thousands of polygons into a structure-from-motion stimulus involving thousands of dots in real time. This makes it possible to present parametrically varied, actively controlled, or large numbers of individual objects without needing to pre-render an inordinate number of movies. An example of a formless dot field stimulus, created from a dancing humanoid, is shown in
Movie 1.