Humans are extremely adept at recognizing the actions of others, even when the human form is represented by only a handful of tokens or “point-lights.” Point-light biological motion animations depict complex human actions through joint kinematics, without explicit representation of body shape (Johansson,
1973). From these animations, observers can recognize a variety of actions such as walking, running, and dancing can identify the gender of an actor (Cutting,
1978) and recognize emotional expressions conveyed by body movements (Dittrich, Troscianko, Lea, & Morgan,
1996; Pollick, Paterson, Bruderlin, & Sanford,
2001).
Current computational models of biological motion perception differ in the extent to which they emphasize the importance of dynamic and structural information. That motion cues would be critical for point-light biological motion perception is intuitive. Take, for example, initial reports that describe single, static frames as a meaningless cloud of dots that emerge into a human figure only when set into motion (Johansson,
1973). Also disrupting the temporal ordering of point-light frames, or skipping frames altogether, impairs individuals' ability to recognize biological motion (Cutting,
1981; Mather, Radford, & West,
1992).
Point-light biological motion perception, however, is not disrupted by a number of manipulations that render local motion cues difficult to analyze. Point-light animations constructed with limited lifetime dots and dots that jitter spatial position from frame to frame should have minimal local image motion. And yet these displays are easily recognized as human actions (Beintema & Lappe,
2002; Neri, Morrone, & Burr,
1998).
What then is the relative importance of motion and form cues for point-light biological motion perception? To address this issue, we set about identifying the key features for recognition of biological motion using an adaptation of the “Bubbles” technique. Bubbles is a human observer-based feature extraction algorithm that identifies the diagnostic information used for image categorization tasks (Gosselin & Schyns,
2001). In this paradigm, randomly placed Gaussian windows partially reveal an image behind a dense mask. Observers make forced-choice discriminations and performance varies depending on the quality of the information revealed. Discriminations are most accurate when the sampled regions contain task-relevant information and less accurate when the critical information is inadequately sampled. Thus, based on observer performance, one can estimate the task-relevant, or diagnostic, regions of the stimulus space. The Bubbles technique is most commonly used to study the critical information in face discrimination (Gosselin & Schyns,
2001), object recognition (Gibson, Lazareva, Gosselin, Schyns, & Wasserman,
2007; Schyns & Gosselin,
2002), natural scene perception (McCotter, Gosselin, Sowden, & Schyns,
2005; Nielsen, Logothetis, & Rainer,
2006), and the perception of ambiguous images (Bonnar, Gosselin, & Schyns,
2002).
To identify diagnostic features in point-light animations, we have created temporal “Bubbles.” In this paradigm, randomly selected intervals of biological motion are revealed within a motion–noise sequence. From the distribution of diagnostic information across the entire action sequence, we can deduce (1) whether some moments in biological motion are more diagnostic than others, and if so, (2) the features of point-light animations that correlate with these diagnostic moments. The results from these experiments dissociate three candidate key features for point-light biological motion perception, namely, global form, joint velocity, and opponent motion.