The results of
Experiment 2 show that the spatial limitations of fast temporal segmentation are consistent with the spatial limitations of typical neurons in V1. This suggests that temporal-phase segmentation and the perception of phantom contours may be mediated by early cortical visual neurons. Indeed, it has already been suggested that spatiotemporal filtering by neurons in the early stages of visual processing can account for segmentation in other forms of temporally structured display. For example, Lee and Blake (
1999) found that in a dynamic texture, a target region in which all elements reversed their motion in synchrony appeared distinct from a background in which reversals were uncorrelated between elements. In the apparent absence of cues from luminance, contrast, and coherent motion, this suggested that observers could rely on “temporal microstructure” to perform spatial grouping and segregation. Adelson and Farid (
1999), however, showed that the temporal filtering properties of early vision would reveal static form in such displays, at least in their original form (see also Farid & Adelson,
2001).
If temporal segmentation in phantom-contour stimuli is also achieved by spatiotemporal filtering, how might a single cell extract the modulation energy falling within its RF? A counterphasing edge may be characterized in two dimensions: one dimension of space perpendicular to the edge (
x), and one dimension of time (
t). The spatiotemporal (
x–
t) profile comprises a step function in the
x dimension multiplied by a sinusoid in the
t dimension. The RF of an ideal detector will have a sensitivity profile that matches the stimulus (Watson, Barlow, & Robson,
1983). As noted by Forte et al. (
1999), this ideal
x–
t profile is well approximated by linear spatiotemporal separable RFs such as those initially mapped in cat striate cortex (DeAngelis, Ohzawa, & Freeman,
1993a,
1993b,
1995). Such RFs are also present in V1 of macaques: Pack, Conway, Born, and Livingstone (
2006, figure 12a), for example, present an RF in which the spatial (
x) profile reverses in polarity over approximately 25 ms. This interval is ideal for the detection of modulation at 20 Hz, the rate used in the current study.
It would thus appear that basic neural mechanisms found in primary visual cortex are suited to the detection of counterphasing luminance modulation. Given the results of
Experiment 2, it is reasonable to suggest that these units might underlie fast temporal segmentation and the perception of phantom contours. If this is the case, it should be feasible to obtain independent psychophysical estimates of RF size using spatial-summation techniques (Anderson & Burr,
1987,
1991; Gorea,
1985; Howell & Hess,
1978). We did so in
Experiment 3, providing further support for our hypothesis that temporal segmentation is mediated by an early cortical mechanism.