For a complete model of bottom-up gaze control, the “where” and the “when” of the next fixation have to be determined. The majority of studies focused on the spatial aspect of the next fixation. Gaze positions in free viewing can be explained to a certain degree by salience models that combine salient stimulus features like color, luminance, and orientation in a priority map (Itti & Koch,
2001). In these models, the gaze is shifted to the peak of activation in the priority map, which has been identified in different brain areas like visual areas V1 (Li,
2002), V4 (Mazer & Gallant,
2003), the lateral intraparietal area (LIP) (Goldberg, Bisley, Powell, & Gottlieb,
2006), the frontal eye fields (FEF) (Thompson & Bichot,
2005), and the the superior colliculus (SC) (McPeek & Keller,
2002). These low-level salience models have been challenged by the finding that objects and faces clearly attract gaze (Einhäuser, Spain, & Perona,
2008; Cerf, Frady, & Koch,
2009; Nuthmann & Henderson,
2010), showing the important influence of higher-level features. Visual salience in the retinal periphery also affects fixation locations in dynamic scenes. In a dynamic 2-D bar code pattern, gaze is attracted by a dark spot (Rasche & Gegenfurtner,
2010). When observers are watching movies of natural scenes, moving objects are especially effective in attracting gaze (Dorr, Martinetz, Gegenfurtner, & Barth,
2010; Mital, Smith, Hill, & Henderson,
2011), showing that spatiotemporal stimulus aspects are also an important determinant of gaze location.