If duration dilation occurs, ROs should be greater than 1. Moreover, if Brown's (
1995) finding could be extrapolated to a subsecond timeframe, ROs of faster stimuli should be larger than those of slower ones, suggesting two alternative predictions (
Figure 3). On the one hand, if the temporal frequency of the stimulus determines the illusion, ROs would be best described as an increasing function of temporal frequency and would show no differences across spatial frequency conditions (
Figure 3a). On the other hand, if the speed of the stimulus determines the illusion, ROs would be influenced by both temporal and spatial frequencies because the speed of a sinusoid is defined as temporal frequency divided by spatial frequency (
Figure 3b).
We calculated each RO for each subject and each spatiotemporal frequency condition and then averaged the data across all subjects for each condition. These results, shown in
Figure 4, indicate that RO increased as temporal frequency increased and as spatial frequency decreased. Clearly, RO was influenced by both temporal and spatial frequencies.
The main question we asked was whether the magnitude of this illusion is determined by temporal frequency or by speed. To address this question, we performed a multiple linear regression analysis, with spatial frequency and temporal frequency as the independent variables. The function is written as RO =
β 0 +
β 1 S +
β 2 T, where
S indicates log2(spatial frequency) and
T indicates log2(temporal frequency). The best-fit function, which defines a slanted plane above the two-dimensional spatiotemporal frequency axes, was plotted as a mesh plot (see
Figure 4). The corresponding equation is given as follows:
If either the
S or the
T axis alone were the determinant factor of the illusion, we should obtain a slanted plane passing through the
T or
S axis, respectively (
Figure 3a).
If, on the other hand, stimulus speed governed the illusion, we should obtain a slanted plane with a 45 deg tilt, or equivalently, RO =
k +
aV =
k −
aS +
aT, where
V indicates log2(speed) and
k and
a are constants (
Figure 3b). The best-fit function supported the second prediction, i.e., that speed governs the illusion. The coefficients of
S and
T were both significant,
t(153) = −2.99,
p < 0.05 for
S;
t(153) = 2.98,
p < 0.05 for
T. This finding confirms that both frequency components contribute to the magnitude of the illusion. Furthermore, the values of the two coefficients were similar in size but with opposite signs. In fact, the value of each coefficient fell within the confidence limits of the other, reflecting no significant difference between them. Since
S and
T are logarithmic,
T −
S provides a logarithmic measure of stimulus speed, or
V. Therefore,
Equation 2 can be rewritten as
This indicates that stimulus speed was the major determinant of RO; as speed increased, RO increased proportionally with log speed. The influence of S was unclear and did not reach statistical significance.
In the above, we performed the linear regression using the linear plane model, but we also tested whether functions with nonlinearity could yield better fits to our data. We tested the quadratic function, RO =
β 0 +
β 1 S +
β 2 T +
β 3 S 2 +
β 4 T 2 +
β 5 ST, and compared the goodness of fit in terms of Bayesian Information Criterion (BIC)
1. The best-fit parameters were (1.283, −0.006, 0.028, −0.020, 0.007, −0.017), and BIC = 71.19, whereas BIC = 58.64 in the linear fit. We next tested a linear plane model with a floor nonlinearity, RO = max(
β 0 +
β 1 S +
β 2 T, β 6), but the best-fit parameters (1.258, −0.062, 0.051, 1.185) resulted in a poor fit, namely BIC = 63.01. Therefore, we concluded that the linear plane model without nonlinearity was the most reasonable one.
Having demonstrated the dependency of RO on speed, we replotted RO data against stimulus speed;
Figure 5 shows the results of these calculations. Again the RO data were well described as an increasing linear function of log speed with spatial frequency having little influence. We also tested a model with a quadratic term, RO =
β 7 V 2 +
β 8 V +
β 9. The best-fit parameters were (0.006, 0.033, 1.247), but only
β 8 and
β 9 were significantly different from 0 (
p < 0.05), which confirmed the claim that the RO increased proportionally with log speed.
One might argue that the data points at high temporal frequencies might construct a steeper slope than the slope at low temporal frequencies and that, if high temporal frequency conditions were excluded, the best-fit function for the remaining data points might become flat. Did high frequency conditions distort the overall data shape? To address this question, we performed the simple linear regression analysis on the data shown in
Figure 5 excluding 8 and 16 Hz conditions. The result showed that the slope of the best-fit line became less steep, 0.032, but it was still significantly greater than 0 (
p < 0.05).
We must note here that in some parts of the data it was not clear whether only stimulus speed governed RO. Let us focus upon the 1 Hz conditions in
Figure 4. Within these conditions, there seemed to be only small differences of RO values among the four spatial frequencies. When one-way ANOVA was performed within these conditions, we could not find a statistically significant effect of spatial frequency (
F(3,21) = 0.43,
p > 0.05). The reason for this is not clear, but we think that there might be two possible reasons. One possibility is that the speed dependence of the illusion that we propose here exists only in the frequency area higher than 1 Hz. Since the aforementioned linear regression analysis on the data excluding 8 and 16 Hz showed a shallower slope, we cannot reject this possibility. The other possible reason is the noisy nature of the data. We noticed that even at 1 Hz, some tendency of speed dependence was seen for the data at 1, 2, and 4 c/deg, but the point at 0.5 c/deg was located irregularly against the speed dependence scenario. At this low spatial frequency, subjects might be able to track the displacement of one of luminance stripes in the Gabor patch to judge duration based on displacement information. In any event, currently we cannot resolve whether the conclusion of speed dependence is applicable to all data points with a conditional statement about random noise, or whether the conclusion is applicable to a subset of parameter space without 1 Hz conditions. We do not emphasize that duration dilation should strictly obey a linear function of log speed at every spatiotemporal frequency point; a rising function with some nonlinearity and floor/ceiling effects would be more realistic. However, since the present study lacks strong evidence for these claims, it is concluded that linear speed dependence is the best summary of the present data.
These results are consistent with previous studies demonstrating that the duration of a moving stimulus is perceived to be longer than that of a stationary stimulus (Brown,
1995; Lhamon & Goldstone,
1975; Mitrani & Stoyanova,
1982). Our results demonstrating the importance of stimulus speed are also consistent with previous studies that focused on longer time intervals (Brown,
1995; Tayama et al.,
1987).
In addition, we demonstrated that a Gabor patch, with a moving carrier within a stationary contrast envelope that never changes its overall position in the visual field, also creates motion-induced duration dilation. This new finding implies that a motion trajectory is not required for the illusion to occur.