Ψ was observed.
Φ is given.
K is the solution to the behavior of the “black box” process. Unfortunately, however, the kernel
K shown in
Figure 3C is not the only correct solution, but just one of many. Why? Because bias and uncertainty can also occur along
time.
In
Figure 4A, the data in
Figure 2E are reproduced as a spatiotemporal plot. The diagonal line indicates the trajectory of the moving bar (rightward at 1 pixel/frame). Each colored point is the observer’s response to the flash; the origin is set at the moving bar that was presented simultaneously with the flash. At first glance, one might be disappointed with the apparent scarcity of data points. What will be seen if the flash is presented somewhere other than along the abscissa? Why not get more data?
Actually, all data are already available. Recall that the moving bar was in continuous translation and that the flash was presented at a random timing during the translation of the moving bar. To the observer, there was no visible indication (e.g., change in direction) of the moving bar at the frame when the flash was presented. Therefore, the moving bar at the origin of
Figure 4A has no special meaning. One can plot the observer’s response to the flash that was presented at, for instance, +5 frames with respect to the coordinates in
Figure 4A, simply by looking at the data structure of
Figure 4A from the viewpoint of the moving bar 5 frames before the flash’s presentation. In
Figure 4B, these responses are plotted simply by shifting the spatiotemporal coordinates and setting their origin at the moving bar 5 frames before the flash. Relative to this particular moving bar, all the flashes were presented at 5 frames in the future. One can repeat the same procedure by setting the origin at each spatiotemporal instance of the moving bar and by plotting responses according to the new coordinates. The superposition of these plots is shown in
Figure 4C. Note that in plotting them there is neither interpolation nor extrapolation of raw data: each point is not an expected hypothetical result that would have been obtained if measured actually, but a result of
actual measurement at each spatiotemporal position. By applying the best-fit cumulative Gaussian shown as the green curve in
Figure 2E, one gets a smoothly curved surface shown in
Figure 5A. Each point on this surface indicates the percentage of “right” responses to the flash presented at each spatiotemporal position. Let us call this surface a spatiotemporal psychometric function.
The profiles shown in
Figure 4A and B are the spatiotemporal events that happen in actual space-time. The difference between these figures is that only the origin of space-time is set at a different instance of the moving bar. Specifically, the color profile in
Figure 4A can be written as
p =
f(
x,
t), where
f indicates the percentage of “right” responses to the flash at a spatiotemporal point (
x,
t). Likewise, the profile in
Figure 4B is
p =
f(
x − 5,
t − 5), plotted relative to the moving bar presented at (−5, −5). Then the superposition, shown in
Figure 4C, is to add
f(
x +
χ,
t +
τ) if and only if (
χ,
τ) is along the motion trajectory (rightward at 1 pixel/frame). As the motion trajectory in space-time can be written as
m(
x,
t) = bool[
x =
t] (where bool[
Q] is 1 if
Q is true, 0 if false), the profile in
Figure 4C is simply written as
. This equation is the definition of spatiotemporal correlation between
f(
x,
t) and
m(
x,
t). In this context, the trajectory of the moving bar (the white diagonal line) can also be called the autocorrelation of the moving bar’s position. Now the abscissa and ordinate of
Figure 4C can be viewed as the relative position and time, respectively, between the flash and moving bar, the latter of which is always located at the origin. Let us call this format a spatiotemporal correlogram.
What does the spatiotemporal correlogram tell us? This 2D data structure makes it clear that the observed psychometric function Ψ, the perfect psychometric function Φ, and the internal kernel K introduced in the previous section are spatiotemporal functions (see Figure 5A–C), Ψ(x, t), Φ(x, t), and K(x, t). The shape of Φ is the performance of a hypothetical noise-free system in which everything is always registered with perfect accuracy and precision: the percentage of “right” responses is always 100% for every flash on the right of the motion trajectory, whereas it is always 0% on the left. The goal of analysis is to discover the internal kernel K that satisfies the relationship Ψ = Φ * K. There is a problem, however: K is not determined uniquely.
Let us begin with formulating the generic form of K(x, t). In the previous section, K(x) was assumed to be a Gaussian function of space, with its μx and σx characterizing spatial bias and uncertainty, respectively. However, if the visual system somehow produces spatial bias and uncertainty, for the same reason it should produce temporal bias and uncertainty as well. When the observer is supposed to judge the relative position between a simultaneously seen pair of motion and flash, perceptual simultaneity might be biased so that it tends to be between a moving bar at the present and a flash in the past—moreover, to an uncertain extent in the past. As in the case of spatial bias and uncertainty, let us assume that temporal bias and uncertainty are characterized by a Gaussian function (with μt and σt). Putting them together, the kernel K(x, t), which is the probability density function of perceptual spatiotemporal alignment, forms a 2D Gaussian in space-time. To avoid further complication, its covariance is hereafter assumed to be zero, i.e., its spatial and temporal components are independent of each other. (In fact, in a preliminary version of the fitting analysis described below, the space-time correlation coefficient ρ was also included as one of free parameters, but it yielded the best-fit ρ of only 0.079.)
The kernel
K(
x) illustrated in the previous section is now described as a special case of the 2D Gaussian, when
μt = 0 and
μt → 0. Indeed, convolution of
Φ with this particular
K equals
Ψ. Another extreme example of
K is a pure temporal function.
K could also be other shapes in between. Note that all these candidates equally satisfy the relationship
Ψ =
Φ *
K. Therefore, although the above analysis clearly proposes that the flash-lag effect is viewed as a spatiotemporal correlation structure, the particular data set used in
Figure 5 is not informative enough to determine the shape of
K.