When viewing complex stimuli, human observers sequentially shift their attention (James,
1890). In natural vision, shifts in eye position correlate tightly with such attentional shifts (Rizzolatti, Riggio, Dascola, & Umilta,
1987). Various factors guide this “overt” attention, such as the features of the stimulus, the observer's experience, and the task (Buswell,
1935; Yarbus,
1967). Most models of human attention focus on the former “bottom–up” signals, resting upon the concept of a “saliency map” (Koch & Ullman,
1985). According to this scheme, stimuli are processed in multiple independent feature channels, local differences (“contrasts”) in these channels are summed, and the activity in the resulting saliency map reflects the probability of a location to be attended. Various implementations of the saliency-map scheme predict human fixation behavior in natural scenes better than chance (Itti & Koch,
2000; Parkhurst, Law, & Niebur,
2002; Peters, Iyer, Itti, & Koch,
2005; Tatler, Baddeley, & Gilchrist,
2005). One of the model's features—luminance contrast—is elevated at fixation, as compared with random locations (Krieger, Rentschler, Hauske, Schill, & Zetzsche,
2000; Reinagel & Zador,
1999). This effect, however, is contingent on correcting for a general fixation bias toward the image center (Mannan, Ruddock, & Wooding,
1996,
1997) or on restricting analysis to certain spatial frequencies (Einhäuser & König,
2003; Tatler et al.,
2005). In addition, this correlation does not imply a causal contribution of luminance contrast to fixation but rather reflects the correlation of both with a higher order stimulus property (Einhäuser & König,
2003). This raises the question to what extent the effect of a low-level feature, such as luminance contrast, on attention depends on higher order stimulus statistics.