Several studies have shown this model to be a useful predictor of where people fixate in natural scenes. Itti and Koch (
2000) themselves found a close match between the number of fixations the model predicted in a complex search before reaching the target and the time taken by real observers. Parkhurst, Law, and Neibur (
2002) confirmed that saliency at fixation was significantly greater than chance. Adding more realistic features (such as a simulated retina with declining contrast sensitivity at greater eccentricity) makes the saliency map model more predictive, even in dynamic scenes, than the standard model and greater than chance (Itti,
2006). More recently, several authors have argued that saliency can explain little or none of the variance in fixation locations in natural tasks such as walking (Jovancevic, Sullivan, & Hayhoe,
2006; Turano, Geruschat, & Baker,
2003). In realistic visual search, top-down guidance (for example, knowledge of a target's appearance) tends to dominate, and participants are rarely distracted by visually salient but irrelevant items in these tasks (Chen & Zelinsky,
2006). Recent updates of the model include top-down information (Navalpakkam & Itti,
2005). It is true that even when free-viewing static scenes, the correlation between saliency and fixation locations tend to be small; Parkhurst et al., for example, estimate mean correlations of 0.55 and 0.45 for fractals and natural photographs respectively.