This basic result—ultra rapid and accurate animal detection in natural scenes—has been replicated reliably many times: in non-human primates (Fabre-Thorpe, Richard, & Thorpe,
1998; Vogels,
1999a,
1999b), using gray-scale instead of color images (Delorme, Richard, & Fabre-Thorpe,
2000), using different response paradigms and modalities (yes-no or go-no-go versus forced-choice; eye movements versus button presses; e.g. Kirchner & Thorpe,
2006), and while measuring neurophysiological correlates (ERPs; Rousselet, Fabre-Thorpe, & Thorpe,
2002; Thorpe et al.,
1996; MEG, Rieger, Braun, Bülthoff, & Gegenfurtner,
2005). Ultra rapid animal detection is even robust to inversion (180 deg rotation) and nearly orientation invariant (Kirchner & Thorpe,
2006; Rieger, Köchy, Schalk, Grüschow, & Heinze,
2008; Rousselet, Macé, & Fabre-Thorpe;
2003; but note that Rieger et al.,
2008 found a slight performance decrement for intermediate rotation angles but none for 180 deg inversions). Finally, there are suggestions that rapid animal detection may not require focused attention (Fei-Fei, VanRullen, Koch, & Perona,
2002; Rousselet et al.,
2002), although this is not uncontroversial (c.f. Evans & Treisman,
2005) and some performance limitations in the complete absence of attention have emerged (Evans & Treisman,
2005; Rousselet, Thorpe, & Fabre-Thorpe,
2004; VanRullen, Reddy, & Li,
2005). Recently Kirchner and Thorpe (
2006) showed that, in a spatial two-alternative forced-choice paradigm, human observers could initiate saccades to the image containing the animal even as short as 120 msec after stimulus onset, while Rieger et al. (
2005) measured the magnetoencephalogram (MEG) while studying natural scene perception and concluded that only 90 msec of undistorted processing—processing without a backward mask—enabled their human observers to recognize natural scenes in a match-to-sample task.