The spatial frequency doubling illusion (FDI) occurs when the contrast of a low spatial frequency sinusoidal grating is modulated at high temporal frequencies—its apparent spatial frequency increases (Kelly,
1966). Maddess and Henry (
1990,
1992) suggested that the illusion arises due to the transduction properties of a specific class of magnocellular ganglion cells, M(y) cells, which resemble cat Y cells in their non-linear spatial summation response characteristics (Blakemore & Vital-Durand,
1986; Kaplan & Shapley,
1982; Marrocco, McClurkin, & Young,
1982; Shapley, Kaplan, & Soodak,
1981). However, this notion is contentious since several studies have reported that in both macaque retina and LGN, there are very few magnocellular ganglion cells that demonstrate the required non-linearity in spatial summation (Derrington & Lennie,
1984; Kaplan & Shapley,
1986; White, Sun, Swanson, & Lee,
2002). White et al. (
2002) specifically demonstrated that magnocellular ganglion cells have very low non-linearity indices and are not capable of generating the sort of non-linear responses required to explain the FDI, especially at the low spatial frequencies for which the illusion is seen. To explain the perception of this illusion, White et al. proposed a new theory whereby spatial frequencies appear doubled in the FDI because at high temporal frequencies we lose our ability to precisely encode the timing of light modulations, i.e., the change from one luminance polarity to the other occurs so fast that we see both the polarities simultaneously and it gives the perception of doubled frequency. Nevertheless, their physiological measurements from magnocellular ganglion cells showed that these cells can adequately encode temporal phase of the fast flickering gratings; hence, the locus of any psychophysically identified loss of temporal phase encoding must be due to more central limitations on visual information processing.
While this theory is successful in explaining the perception of doubled frequencies in the FDI, it fails to explain why the FDI is perceived only for low spatial frequencies. To examine the spatiotemporal relationship between the FDI and loss of temporal phase encoding, in this study we measured the temporal phase discrimination (TPD) thresholds over a range of spatiotemporal parameters. To measure the temporal phase encoding ability, we modified the stimulus paradigm used by White et al. (
2002). Their stimulus consisted of two sets of horizontal grating pairs with 30′ separations between the gratings in each grating pair. In one grating pair, both gratings were locked in phase but in the other grating pair, gratings were flickering at 180° temporal phase difference. With this stimulus, they psychophysically measured the contrast thresholds for discriminating a pair of flickering grating locked in phase from a pair of flickering grating at opposite temporal phase. Recent reports suggest that the TPD thresholds are affected significantly if the two targets are separated by a spatial gap (Forte, Hogben, & Ross,
1999; Victor & Conte,
2002), so it is likely that the TPD thresholds measured by White et al. may be artificially elevated because of the 30′ gap between their gratings.
To circumvent this problem, we adopted the stimulus used by Mechler and Victor (
2000) (with modifications—explained below) which consisted of two horizontally oriented gratings with similar spatiotemporal configurations except that the temporal phase of one grating was advanced. These gratings were placed adjacent to each other so that there was a temporal phase offset across the edge, which observers had to identify. As there is no separation of the gratings, this kind of stimulus design is an advantage over White et al.'s (
2002) stimulus design. But, as Mechler and Victor accepts, observers found this task difficult (due to only one edge between the two regions) and required extensive practice to acquire a reliable cue. They could perform the task only because of the apparent motion cue across spatially aligned abutting horizontal gratings.
To avoid this motion cue, we modified the stimulus design used by Mechler and Victor (
2000) and generated a novel stimulus (explained below). Our stimulus design had two main advantages. Firstly, the phase offset could be varied to any value in the range of 0° to 180° while White et al. (
2002) could only use 180° phase offset. (In this kind of task, the experimental parameter (phase-offset) can only vary between 0° and 180° as temporal cycle repeats itself in reverse order after 180° and thus 180° is the upper bound to phase offset thresholds.) Secondly, TPD thresholds could be measured at any contrast level whereas White et al. were limited to measuring the contrast required to discriminate temporal phase offset of only 180° between flickering grating pairs.
With our stimulus design, we could characterize the temporal phase encoding for each spatial and temporal frequency at higher multiples of contrast threshold levels (measured as the contrast required to identify orientation of the sinusoidal gratings—orientation identification thresholds). Our results show that the loss of temporal phase encoding ability is dependent on the spatial and temporal frequency of the flickering gratings and only occurs principally at those spatiotemporal parameters associated with the appearance of the FDI.