Ever since Johansson (
1973) developed point-light displays, numerous studies have reported that humans from a very young age exhibit an exquisite sensitivity to biological motion (see, e.g., Blake & Shiffrar,
2007). The exceptional ability of the human visual system in perceiving actions is revealed by the three major findings. (1) Good recognition performance is obtained for sparse visual stimuli. In particular, point-light displays, which eliminate most form/structural information by reducing the actor to a set of 11 moving dots, still allow for meaningful interpretation (Johansson,
1973). In addition to the point-light display, researchers also found that observers are able to recognize actions with partial information due to occlusions, for example, when actors are viewed through a set of apertures (Lu,
2010; Shiffrar, Lichtey, & Heptulla Chatterjee,
1997). (2) Recognition performance is robust with noisy displays. For example, humans can still recognize point-light actions embedded in a noisy background (Bertenthal & Pinto,
1994; Cutting, Moore, & Morrison,
1988; Neri, Morrone, & Burr,
1998; Thompson, Hansen, Hess, & Troje,
2007), assigned with random contrasts (Ahlstrom, Blake, & Ahlstrom,
1997) or jittered positions (Beintema & Lappe,
2002), associated with scrambled depth (Bulthoff, Bulthoff, & Sinha,
1998; Lu, Tjan, & Liu,
2006), or assigned with a general transformation such as inversion (Bertenthal & Pinto,
1994; Bertenthal, Proffitt, & Cutting,
1984; Fox & McDaniel,
1982; Pavlova & Sokolov,
2000; Pinto & Shiffrar,
1999; Simion, Regolin, & Bulf,
2008; Sumi,
1984), and even when spatially scrambled (Chang & Troje,
2009; Troje & Westhoff,
2006). (3) Humans exhibit fine discrimination ability in perceiving actor characteristics such as gender (Cutting & Kozlowski,
1977b; Mather & Murdoch,
1994), identity (Cutting & Kozlowski,
1977a), emotion (Dittrich, Troscianko, Lea, & Morgan,
1996; Roether, Omlor, Christensen, & Giese,
2009), and sign language meaning (Poizner, Bellugi, & Lutes-Driscoll,
1981). These data have been incorporated into various theories that aim to explain why the visual system is so adept at interpreting biological motion (e.g., Giese & Poggio,
2003; Troje & Westhoff,
2006).