Previous studies have investigated the impact of reinforcement learning on gaze control both in experimental and natural settings (for reviews, see Glimcher,
2003; Hayhoe & Matthis,
2018; Madelain, Paeye, & Darcheville,
2011; Trommershäuser, Glimcher, & Gegenfurtner,
2009). Most learning experiments use extraneous consequences as reinforcers for saccades (e.g., food or liquid with nonhuman animals or money with humans), while the real-life benefit of saccades is associated with changes in the visual perception of the targeted object. It has been well documented that extraneous reinforcers such as monetary profits affect saccade target selection (e.g., Chen, Mihalas, Niebur, & Stuphorn,
2013; Liston & Stone,
2008; Lou, Hsu, & Sajda,
2015; Schütz, Trommershauser, & Gegenfurtner,
2012; Stritzke, Trommershäuser, & Gegenfurtner,
2009) and influence saccade latencies (e.g., Madelain et al.,
2007; Milstein & Dorris,
2007; Rothkirch, Ostendorf, Sax, & Sterzer,
2013; Vullings & Madelain,
2018). However, limited effects have often been reported when using arbitrary reinforcement for saccade latency (Madelain et al.,
2007; Vullings & Madelain,
2018) or saccade amplitude (Madelain, Paeye, & Wallman,
2011; Meermeier, Gremmler, Richert, Eckermann, & Lappe,
2017). Because biological constraints are known to have a substantial impact on operant conditioning (e.g., Domjan & Galef,
1983), one explanation for these effects is that the extraneous reinforcing consequences commonly used so far might be less biologically relevant than the ones obtained in natural settings (i.e., foveating the targeted object). In line with the fact that acquisition of information is reinforcing (Wyckoff,
1952), some studies demonstrated that visual consequences could as well act as a reinforcer for saccadic amplitude (Madelain, Paeye, & Wallman,
2011; Meermeier et al.,
2017; Paeye & Madelain,
2014). It has also been shown that humans exhibit shorter latencies when the target remains visible after the saccade than when it is systematically extinguished (Collins,
2012). Furthermore, Montagnini and Chelazzi (
2005) elegantly demonstrated that making the visual perception of a target contingent on short latencies has a dramatic effect on saccadic latency distributions in a visual-discrimination task. Taken together these results imply that saccades are reinforced by the ability to carry on a visually guided task. That a reinforcer might be a behavior rather than an object (such as food or money) has been established before (e.g., Timberlake,
1995; Timberlake & Allison,
1974). In the case of saccades, the outcome is information: Saccades enhance the perceived high spatial frequency of a visual object by placing its retinal image on the center of the retina. One could argue that whether this information is meaningful depends on the flow of activities in which the organism is engaged. In other words, changing what one sees constitutes a favorable outcome if the ongoing task benefits from this additional information. Paradigms such as ours (see also Montagnini & Chelazzi,
2005) attempt to mimic this natural state of affair by establishing relations between a specific dimension of saccadic eye movements, in our case their latency, and the ability to perform a visually guided task. That these relations come to change the saccades reveal the reinforcing effect of these outcomes.