Our data suggest that the functional significance of implicit VSL may be very different than previously supposed. Implicit statistical learning is often considered an important mechanism for extracting statistical regularities that are stable in the environment, such as frequently co-occurring sounds (Saffran et al.,
1996) or consistently paired objects (Fiser & Aslin,
2001; Turk-Browne,
2012). As a result, unsupervised VSL may be useful for representing the hierarchical structure of features, objects, and their relationship in the external world (Fiser & Aslin,
2005; Orbán, Fiser, Aslin, & Lengyel,
2008). Although the view of VSL as a mechanism for “extracting what is out there” is satisfactory in paradigms that involve perception only, it is more difficult to resolve with tasks that involve covert or overt action. Visual search, for example, is more than just perceiving what is in the visual world. It also involves actively shifting covert and overt attention between items until the target object is found. Unlike visual perception (which may be centered on objects or environmental locations), visual action is inherently egocentric (Goodale & Haffenden,
1998). Consistent with this claim, spatial attention is referenced primarily egocentrically (Golomb, Chun, & Mazer,
2008; Jiang & Swallow,
2013b; Mathôt & Theeuwes,
2010). Therefore, the function of VSL in active tasks is unlikely to be simply about perception. Rather, VSL may serve to increase the likelihood that successful actions will repeat in the future. On this interpretation, VSL did not accumulate in
Experiments 1 and
2 because the target-rich region was randomly located in a viewer-centered (possibly even action-centered) reference frame. In contrast, VSL was present in
Experiment 4 because the statistical regularities were stable relative to the viewer. As a result, successful movements of attention through space were relatively consistent across trials.