Another hypothesis is that the relative localization judgment is made after the information about the stimulus positions has been transformed from an eye-centered to a head-centered (or even body- or world-centered) coordinate system (e.g.,
Zipser & Andersen, 1988;
Bremmer et al., 1998). Such reference frame transformations at the single cell level have been shown at later stages in the processing of visual information in monkeys, i.e. VIP area;
Duhamel et al., 1997;
Schlack et al., 2005. If the ongoing pursuit in general and the shift of the eye position during the interstimulus interval in particular would be completely compensated for by this transformation, the visual system would use veridical information about each flash's position in screen coordinates for the relative localization. Thus, the PSEs in the pursuit conditions would equal the respective ones in the fixation task without any additional effects (Model A). However, our results showed that the values differed significantly, with systematic shifts of the PSEs depending on the targets' retinal eccentricity. This shows that the eye movement still had an effect on the relative localization, even if the spatial information was transformed into another coordinate system. Furthermore, it indicates that the eccentricity effect, as observed in the absolute localization task, also needs to be considered for the relative localization. Our Model C, employing a linear combination of the PSEs in the fixation task with shifts to compensate for the eccentricity effect, predicted the behaviorally measured pursuit data the best. In all conditions, the mean deviation between the predicted and the behaviorally measured PSEs was the smallest compared to the other models. Thus, our results suggest that the relative localization judgment occurs at a stage of the visual processing, where spatial information is available in nonretinal frames of reference. This could be an early stage, employing a nonretinocentric encoding via an implicit, i.e. population code (e.g.,
Bremmer et al., 1998;
Boussaoud & Bremmer, 1999). Or it could be based on an explicit encoding at the single cell level, as found in the monkey VIP area (
Duhamel et al., 1997). Within this transformation, the representation of each stimulus had been shifted in accordance with its eccentricity, when presented during ongoing SPEMs. Yet, it might be possible that the relative position information had been encoded neither in an eye-centered nor in a head-centered frame of reference but in an intermediate reference frame as also found in the VIP area (
Duhamel et al., 1997). A model using only a 1° shift during the interstimulus interval without considering the eccentricity effect was also able to predict the experimentally measured PSEs in three of the four examined pursuit conditions (data not shown). Hence, this model and Model C performed equally well. Nevertheless, both models suggest that the relative localization judgment relies on nonretinocentric information and, therefore, is likely to be performed at a stage of the visual processing in which retinal and nonretinal information is available. Importantly, studies suggest that shorter SOAs can lead to a localization in a retinal reference frame (
Brenner & Cornelissen, 2000). In a follow-up study, these authors showed that during saccades, relative localization occurred in a nonretinocentric reference frame for SOAs longer than 200 ms but in a retinocentric reference frame for smaller SOAs (
Brenner et al., 2005). These findings were backed up by behavioral studies (
Zimmermann et al., 2013) and neurophysiological studies in two gaze control centers of the macaque monkey, that is, the superior colliculus (
Sadeh et al., 2020) or frontal eye field (
Sajad et al., 2016), which suggest that building nonretinocentric spatial representations requires time on the order of a couple of hundred milliseconds. Taken together, these studies might imply that also in the current experimental approach, mislocalization could have occurred in an eye-centered rather than in a world-centered reference frame, if SOAs would had been shorter. Further experiments, however, are necessary to test this exciting hypothesis. In principle, nonretinal localization can happen already at the level of primary visual cortex (
Trotter & Celebrini, 1999;
Morris & Krekelberg, 2019), but we consider it more likely to happen in downstream areas of extrastriate or parietal cortex (
Duhamel et al., 1997;
Schlack et al., 2005). Additional experiments varying head and body positions during localization, however, would be required to differentiate between head-, screen-, body-, and world-centered reference frames.