Explanations for reverse-phi motion so far were based on excitation of motion detectors tuned to the direction opposite to the displacement. Clearly, at a high level of analysis, this must be the case. It has been shown that the level of activity in monkey area MT (middle temporal), an area dedicated to motion analysis, corresponds to the monkeys motion percept (Britten, Shadlen, Newsome, & Movshon,
1993; Newsome, Britten, & Movshon,
1989; Newsome, Britten, Salzman, & Movshon,
1990; Shadlen, Britten, Newsome, & Movshon,
1996). Microstimulation in MT may bias the monkey's perceptual choice depending on the tuning properties of stimulated neurons (Salzman, Murasugi, Britten, & Newsome,
1992). Krekelberg and Albright (
2005) have shown that MT activity for reverse-phi stimuli also corresponds to the monkey's percept. At the level of MT reverse-phi causes activation of cells tuned to the direction opposite to the displacement.
These findings, however, do not reveal how the two types of motion are initially detected. The initial stage of motion detection in primates is located in primary visual cortex (V1), where simple and complex cells derive directional selectivity from nondirectionally sensitive geniculate cells. Contrast information in the retina and LGN is divided across two different channels: ON cells most effectively signal positive contrasts, whereas OFF cells signal negative contrasts. High sensitivity for reverse-phi stimuli suggests that low-level motion detectors specifically combine information from ON and OFF cells (Mo & Koch,
2003). These authors proposed a model in which such opposite contrast correlations excite detectors tuned to the direction opposite to the displacement (
Figure 1B). However, here we show evidence that the alternative, as shown in
Figure 1A, is more plausible. Motion signals based on correlations between ON and OFF channels probably cause inhibition of motion detectors tuned to the direction of displacement.
The critical observation to support this conclusion is the similarity between motion aftereffects and reverse-phi motion. Directional interactions differ between direct motion percepts and motion aftereffects due to differences in directional tuning width for excitations and inhibitions. Low-level excitations remain narrowly tuned and as a result multiple components remain segregated. Response reductions after adaptation, however, induce the opposite percept through disinhibition (van de Grind et al.,
2003, van de Grind, van der Smagt, & Verstraten,
2004). If inhibitions are more broadly tuned, this leads to broad directional tuning for aftereffects (Grunewald & Lankheet,
1996). It explains the orthogonal motion aftereffect and the fact that adaptation to multiple motion components mostly results in a single, unified motion aftereffect. We propose that the same interactions may be responsible for the behavior of reverse-phi percepts. This would imply that reverse-phi, like motion adaptation, causes a response reduction at the detection level, which through disinhibition transforms into a motion percept in the opposite direction.
The proposal is further supported by results for the nulling experiment. Whereas two regular motion patterns are readily seen to move in opposite directions, one regular and one reverse-phi component always canceled each other. This shows that reverse-phi is fundamentally different from regular motion in the opposite direction. The behavior is similar to that observed for motion aftereffects, but the effect is much stronger. Motion aftereffects can also be nulled with real motion but the motion strength required is much lower (Blake & Hiris,
1993; Lankheet & Verstraten,
1995; van de Grind et al.,
2003). The fact that nearly equal motion strengths were required to cancel the percept shows that cancellation was not due to a relatively small sensitivity for reverse-phi motion. In fact, in pilot experiments, we found observers to be equally sensitive to the two types of motion. Motion coherence thresholds measured for the same motion settings as in the nulling experiments did not differ significantly between regular and reverse-phi motion.
Similarity of motion strengths required for nulling and quantitative similarity of motion sensitivity for regular and reverse-phi motion also rule out that the lack of same-contrast correlations in the reverse-phi stimulus can account for the present findings. If the main effect of reversing the contrast would be removing same-contrast correlations at the specific spatiotemporal offset, we should expect large differences in sensitivity. In this case, regular motion would result in a large bias in one specific direction whereas reverse-phi would correspond to incoherent noise in all directions, except one. Among other things, such a broad distribution of directions would largely cancel because most components would be balanced by their opponent counterpart. Equal sensitivity for regular and for reverse-phi motion thus strongly suggests that opposite polarity correlations are actually used for motion detection.
The most likely explanation for the nulling results is that regular and reverse-phi motion cause excitations and inhibitions of similar strength, which are added at the very first level of motion detection.
Clearly, this proposal does not deny the observation that the difference between regular motion and reverse-phi motion is embedded in the stimulus. Nor does it deny that ideal, linear processing of Fourier components could predict the reversal (Adelson & Bergen,
1985; Edwards & Nishida,
2004; Krekelberg & Albright,
2005). It rather addresses the level at which the reversal takes place. Our experiments show that we can differentiate between different schemes (
Figure 1) of combining positive and negative contrast signals. We propose that, at low levels of motion detection (V1), contrast reversals reverse the sign of the response, not the direction. At the next level of global direction integration (area MT) this then gives rise to a direction reversal.
Once we realize that contrast reversals may lead to perceived direction reversals through inhibitions rather than excitations, we may also appreciate the reason for this specific sensitivity. Combining opposite contrasts with a negative sign makes optimal use of all available information. If spatiotemporal correlations between pairs of ON cells or pairs of OFF cells signal coherent motion from one receptive field to the other, then correlations between ON and OFF cells signal the absence of coherent motion. Thus, correlations between equal contrast polarities provide positive evidence whereas correlations between opposite polarities provide negative evidence for the same motion. Weighing positive and negative evidence for the same motion at the first detection stage efficiently improves signal-to-noise ratios.
It is interesting to notice that the space–time representation of our motion stimuli, as presented in
Figure 2, is similar to Glass patterns that have been used to study orientation sensitivity. Results for orientation detection are, however, less consistent, and aftereffects and contrast reversals may yield different results. Burr and Ross (
2006) showed that contrast reversals for Glass patterns can counteract the effect of equal-polarity correlations but do not result in orientation percepts. This would suggest inhibition without opponency. However, Clifford and Weston (
2005) used a nulling procedure for adaptation to Glass patterns similar to the procedure we used for motion. They found clear orthogonal aftereffects, indicating opponency between orthogonal orientations. Dakin (
1997) also reported induction of orthogonal percepts from contrast reversed Glass patterns. These different findings show an essential difference between orientation analysis and motion analysis. For orientation detection, aftereffects and reversed contrast stimuli may yield different results. For motion, however, aftereffects and reverse-phi percepts behave in similar ways.
Our hypothesis is in line with neurophysiological results in the primate and feline visual system. Directional selectivity, the corner stone of motion sensitivity, arises in primary visual cortex, especially in complex cells. This is also the level were information from ON and OFF pathways merges (Schiller,
1992; Schiller, Sandell, & Maunsell,
1986). V1 complex cells in both cat (Emerson, Bergen, & Adelson,
1992) and macaque (Livingstone & Conway,
2003) respond with relative inhibitions to reversed contrast correlations. At this level, there is relatively little opponency (Snowden, Treue, Erickson, & Andersen,
1991). At the level of area MT, however, negative modulations have been transformed into positive modulations, and perceived motion corresponds to excitations (Krekelberg & Albright,
2005). Thus, it seems likely that directional interactions translating inhibitions to excitations (Grunewald & Lankheet,
1996) are effectuated in the projection of V1 to MT.