We have all experienced the difficulty of trying to locate a face in a crowd. The fact that a face is much easier to recognize when it is located in the central visual field than when it is located in the periphery is not entirely due to poor visual acuity in the periphery, but also due to the presence of internal as well as surrounding features that interfere with the identification of the target face. This effect is called crowding (Bouma,
1970; Field, Hayes, & Hess,
1993; He, Cavanagh, & Intriligator,
1996; Intriligator & Cavanagh,
2001; Latham & Whitaker,
1996; Levi, Klein, & Aitsebaomo,
1985; Martelli, Majaj, & Pelli,
2005; Pelli, Palomares, & Majaj,
2004; Strasburger, Harvey, & Rentschler,
1991; Toet & Levi,
1992; Westheimer & Hauske,
1975). Unlike traditional masking, in which a signal (e.g., the face) is rendered invisible, crowding occurs when the signal is still visible but its features blend with its neighbors (Martelli et al.,
2005; Pelli et al.,
2004). Integrating surrounding features with those of the signal results in the inability to scrutinize or identify the target. According to most models, crowding occurs because of interference or pooling among low-level features, which likely happens at a single, relatively early stage in visual processing (Ariely,
2001; Chung, Levi, & Legge,
2001; He, Cavanagh, & Intriligator,
1997; Levi et al.,
1985; Parkes, Lund, Angelucci, Solomon, & Morgan,
2001; Pelli et al.,
2004). To date, however, it has not been tested whether crowding can occur selectively at higher levels in the visual system—only crowding of low-level features has been demonstrated. The possibility therefore remains that crowding may operate at multiple levels in the visual system; for example, even between high-level representations of objects.