Vision starts with a retinotopic representation of the visual scene: Neighboring points in the scene are mapped onto neighboring neurons in the retina, and this retinotopic encoding prevails in many visual brain areas (Amano, Wandell, & Dumoulin,
2009; Gardner, Merriam, Movshon, & Heeger,
2008; Sereno et al.,
1995). Our perception of the world, on the other hand, is clearly not anchored to retinotopic coordinates. For example, the image projected on the retina is shifted with every saccadic eye movement, causing a shift of the representation according to retinotopic coordinates. Yet these shifts are not perceived; our perception of the world remains stable across saccades. The key mechanism mediating perceptual stability across saccades is thought to be an efference copy of the motor command, which allows us to foresee and discount the displacements of the retinal image (Sperry,
1950; von Holst & Mittelstaedt,
1973). The transformation of the retinotopic representation by use of efference copy signals results in a representation based on a nonretinotopic (more specifically, in this case, a spatiotopic) reference frame. Another aspect of vision relying on nonretinotopic processing is motion perception. For example, motion is often perceived relative to moving reference systems (Duncker,
1929; Johansson,
1950; for review, see Öǧmen & Herzog,
2015). One example comes from gait perception: We perceive the limb movements of a walking person relative to his or her body. The body serves as a reference system for computing the relative motions, making the “true” retinotopic trajectories of the arms and legs invisible. The overall translational motion of the body is discounted from the motion of the limbs, similarly to the discounting of the retinal shifts caused by saccades, giving rise to a percept of pendular arm and leg motions. However, an important difference is that efference copies cannot aid the discounting of the motion. Perception of motion relative to moving reference systems has been studied extensively psychophysically and modeled, for example, by perceptual vector analysis (see, for example, Johansson,
1973). However, almost nothing is known about the underlying neural correlates.