Two primary decision processes occur in parallel: One decides when to look, the other where to look. A lot of work has been invested in understanding the selection of the next fixation location. Clearly, many different factors contribute to the decision of where to look next (Kollmorgen, Nortmann, Schröder, & König,
2010; König et al.,
2016). Most notably, task-dependent factors (Buswell,
1935; Hayhoe & Ballard,
2005; Rothkopf, Ballard, & Hayhoe,
2007), stimulus dependencies (Einhäuser, Spain, & Perona,
2008; Foulsham & Underwood,
2008,
2009; Koehler, Guo, Zhang, & Eckstein,
2014), and geometric dependencies of the trajectory (Henderson & Smith,
2009; Hooge, Over, Van Wezel, & Frens,
2005; Kaspar & König,
2011b; Motter & Belky,
1998; Tatler, Baddeley, & Vincent,
2006; Tatler & Vincent,
2009) exist. For the future fixation location, some of these factors have been summarized in the saliency model (Itti, Koch, & Niebur,
1998; Koch & Ullman,
1985). In recent years, the performance of saliency models has slowly converged to the interindividual noise ceiling. In other words, models based on features become as good as predictions based on other subjects, and therefore, only interindividual differences remain to be explained (Afsari, Ossandón, & König,
2016; Bylinskii, Judd, Oliva, Torralba, & Durand,
2016; Kümmerer, Wallis, & Bethge,
2015; Wilming, Betz, Kietzmann, & König,
2011). Furthermore, the concept of a saliency map is not just a computational convenience. Studies investigating neglect patients provide evidence for the existence of a saliency map in humans (Ossandón et al.,
2012), presumably in the superior colliculus (White et al.,
2017). Most models estimate saliency based on low-level stimulus properties, such as luminance, contrast, motion, or edges. In addition, high-level object-related features, such as the recall frequencies of objects in a scene, predict eye locations even better (Einhäuser et al.,
2008). In recent years, geometric dependencies on the trajectory have increasingly been included in these models. Spatial bias (Tatler & Vincent,
2009), saccadic momentum (Posner & Cohen,
1980; Wilming, Harst, Schmidt, & König,
2013), and horizontal asymmetries (Ossandón, Onat, & König,
2014) are incorporated not only to predict average fixation locations, but to model whole gaze paths (Schütt et al.,
2017). Taken together, quite extensive literature and profound insights exist on the decision of where to look next.