One would therefore expect that the same fixations on specific objects or locations that the viewer made during image or scene learning or encoding would also be repeated by this viewer during image or scene recognition (cf. Noton & Stark,
1971; Stark & Ellis,
1981). However, at variance with this hypothesis, scene recognition can be accomplished with even a single fixation, too (cf. Helmholtz,
1867; see also e.g., Intraub,
1981; Potter,
1976; Schyns & Oliva,
1994; Thorpe, Fize, & Marlot,
1996). Presenting scenes for only a few tens of milliseconds is enough that participants recognize important characteristics of scenes (e.g., Bacon-Macé, Kirchner, Fabre-Thorpe, & Thorpe,
2007), although this time is too short to allow for more than one initial fixation (for reviews, see Intraub,
2012; VanRullen,
2007). The reason for human single-glance scene-recognition ability is that scene-specific characteristics are often contained in the low-spatial frequency band of a scene's image, so that participants can extract much of the gist of a scene from an image's periphery within a single fixation within the scene image (Oliva & Torralba,
2006). In addition, the supportive role of fixations for memory is doubtful in light of findings that participants fail to notice scene changes (e.g., made across single saccades) even after they have spent substantial time on the inspection of the scenes and have made many fixations within the scene (Ballard, Hayhoe, Pook, & Rao,
1997; Droll, Hayhoe, Triesch, & Sullivan,
2005; Friedman,
1979; Irwin & Zelinsky,
2002; McConkie & Currie,
1996). Also, research on the phenomenon of change blindness suggests that detailed memory representations of scenes, objects, or features perceived from the visual environment are incomplete, short-lived, and prone to dynamic overwriting (Rensink,
2002; Simons & Levin,
1997; Simons & Rensink,
2005).