Two important issues raised by the change blindness literature are what object properties attract attention in a scene and whether top–down and/or bottom–up factors influence our allocation of attention when perceiving natural scenes. Studies investigating scene perception have suggested that we acquire information about the general “gist”, or scene-schema, early on in viewing, within the first 100 ms (Biederman,
1972; Intraub,
1981; Potter,
1976; Venturino & Gagnon,
1992). Once the general scene-schema has been extracted, knowledge-based information can be used to help guide attention and control gaze (see Henderson,
2003). Scene-schema knowledge can provide information about categories of objects that might be expected within a specific scene, for example, shower gel, sponge, and soap are items that one might expect to see in a bathroom scene. It might also provide likely candidate regions for the locations of such objects (e.g., soap is likely to be on a surface, close to a sink). Some studies have indicated that objects that are inconsistent with the scene-schema (e.g., a can of beans in a bathroom scene) attract early fixations during viewing. Loftus and Mackworth (
1978) presented participants with black and white line drawings of scenes sometimes containing scene-schema-inconsistent objects. These were not only fixated earlier than consistent ones but also received longer fixation durations, suggesting that attentional disengagement is more difficult (Henderson, Weeks, & Hollingworth,
1999). When photographs of real-world scenes are used, inconsistent objects attract eye fixations earlier than their scene-consistent counterparts do (Gordon,
2004; Underwood & Foulsham,
2006; Underwood, Humphreys, & Cross,
2007). Hollingworth and Henderson (
2000) presented participants with pairs of gray-scale line drawings within the flicker task (Rensink et al.,
1997), which contained changes to semantically consistent or inconsistent objects. Visual salience was partially controlled by swapping target objects between different scenes. Their results supported previous eye-tracking studies, showing that changes to inconsistent objects in the scenes were detected faster and with greater accuracy than changes to consistent ones. This pattern of results emerged even when the possibility of using a specific encoding strategy was discouraged.