Here we provide the details of the spatiotemporal filtering model we developed. It is designed to predict the color-opponent response modulation that the stimuli in our experiments would create. We assume that the visibility of color breakup is monotonically related to the response modulation.
The stimulus was a bright achromatic rectangle on a dark background. The rectangle moved horizontally, so we considered only the horizontal dimension of space. We first defined the input stimulus in two-dimensional space-time coordinates as in
Figure 2. The input is in retinal coordinates. We were interested mainly in the chromaticity of the percept, so we converted the RGB values of the input stimuli into values in opponent color space, which approximates the luminance and chromatic channels of the visual system (Poirson & Wandell,
1993; Zhang & Wandell,
1997). First, we linearized the RGB values. Because the stimulus in our simulation was a white target on a black background, we set the linearized RGB to [1 1 1] in the target and [0 0 0] in the background. We then transformed the linearized RGB values into tristimulus values in the CIE XYZ color space. If the values conform to sRGB IEC61966-2.1, a standard color space, the transformation is (Winkler,
2005):
The transformation from CIE XYZ color space to opponent color space is (Zhang & Wandell,
1996):
As the displayed image changes over time, the luminance of pixels changes accordingly. Ideally, temporal changes would be instantaneous, but the display has its own temporal response. We incorporated the temporal properties of the display (ViewSonic G225f) by measuring its temporal impulse-response function (IRF). The function was very close to an ideal exponential decay with a time constant of 1.5 msec. All simulation results included the display's IRF. (When we replaced the IRF with a delta function, the simulation results were nearly identical, which means that the display's IRF was short relative to the assumed human IRF.)
The resulting values in opponent color space, passed through the display's IRF, were then convolved with the IRFs of the two chromatic channels. To calculate IRFs, we adopted the red-green channel's contrast sensitivity function (CSF) as described by Burbeck and Kelly (
1980) and Kelly (
1983) (
Equation A3). The CSF is the sum of excitatory (
E) and inhibitory (
I) components; it is not separable in the space-time domain even though the excitatory and inhibitory components are separable.
Es,
Et,
Is, and
It are defined, respectively, as the spatial response of the excitatory component, temporal response of the excitatory component, spatial response of the inhibitory component, and temporal response of the inhibitory component. The last equation under A3 is commonly used in deriving the other equations in A3.
fs1,
fs2,
ft1, and
ft2 are constants that depend on individual variances. We adopted values for
fs1,
fs2,
ft1, and
ft2 measured by Burbeck and Kelly (
1980): 10 cpd, 0.5 cpd, 19 Hz, and 1 Hz.
The CSF has no phase information, so we had to reconstruct phase, which we did by extending the assumption of a minimum-phase filter (Stork & Falk,
1987; Kelly,
1969;
Mason & Zimmermann, 1960) to the space-time domain. Such an assumption is reasonable for chromatic channels (Burr & Morrone,
1993). We assume a complex transfer function,
H, which is the Fourier transform of the IRF of the visual system,
h.
Here, the modulus is the same as the CSF. Our goal was to estimate
θ(
fx,
ft) in order to reverse-engineer
h. First, we computed the logarithm of the transfer function. The log of the modulus and the unknown phase become, respectively, the real and the imaginary parts:
If these real and imaginary parts are a Hilbert transform pair, then
h is real and causal (Mason & Zimmermann, 1960). The Hilbert transform pair is not the only solution for causal and real
h, but it is the solution that satisfies minimum phase. For a signal
k whose independent variable is
t, the Hilbert transform is
where the Cauchy principal value takes care of the integration around the singularity at
t =
t′. In our case, we want a causal signal in time such that
However, we do not need such a constraint in the space dimension. Thus we apply the Hilbert transform along the temporal frequency dimension only. The convolution and transforming integration that we used were:
where
δ is the Kronecker delta function. With the phase term estimated, we calculated the IRF
h(
x,
t) by taking the inverse Fourier transform of
H(
fx,
ft). The calculated IRF was causal and real. We also used the same IRF for the yellow-blue channel because it is very similar to its red-green analog (Mullen,
1985; Tong, Heeger, & van den Branden Lambrecht,
1999).