Here, we provide a quantitative account for the reference field theory with motion-nearest-vector metric. We used the same modeling approach as in our previous study (Clarke, Öğmen, & Herzog,
in press). In order to model the experimental data, we first created movies of the stimulus for each and every stimulus condition. The first stage of the model consists of filters that extract local motion vectors. In general, any number of motion-extraction algorithms suffice to find the appropriate motion vectors (e.g., Adelson & Bergen,
1985; Simoncelli & Heeger,
1998; Watson & Ahumada,
1985) and the choice of motion vector extraction algorithm is not crucial to model performance. Here, since we generated the stimuli artificially, we know the motion vectors exactly, and to save computational time, we replaced the outputs of a filtering stage with the exact, known motion vectors. Since all of our stimuli involve circular motion, the motion vectors were coded in terms of their angular velocity around the fixation point. Following this stage, a reference field is established around each motion vector for each object. The field follows a Gaussian weighting function of the form:
where
dij is the Euclidian distance between a pair of motion vectors' spatial locations (i.e.,
where
x and
y represent the horizontal and vertical coordinates of a motion vector),
σ is a constant specifying the spatial extent of each vector's influence,
G is a gain factor which accounts for imperfect interactions, and
C is a small constant representing the default long-range interactions between motion vectors. Since our results suggest that the motion-nearest-vector distance metric is the best among all four considered, the model computes the distance between each pair of motion vectors at every point in time. The weight fields of the motion vectors that yield the smallest distance determine the strength of interaction. In quantitative terms, this is equivalent to
r(
t) =
mi −
wijmj, where
r(
t) is the perceived motion of the target arc at time
t,
mi and
mj are instantaneous motion vectors of the closest points on the target and reference arcs, respectively, and
wij is the instantaneous weight of the velocity field between the two closest motion vectors. The model finds the modulation amplitude that is necessary to get an instantaneous sign change in
r(
t). This amplitude corresponds to the empirically measured baseline-subtracted PSS. Model fitting was carried out by varying three parameters,
G,
C, and
σ. The best-fitting values were 0.4 for
G, 85 pixels (2.41 deg) for
σ, and 0.15 for
C. The same values of these parameters are used to fit the data from all experimental data in this study. Simulation results are plotted in
Figure 8. In short, the reference field theory with a nearest-motion-vector metric provides a good account for all of our results. It should be noted, however, that the model slightly overestimates the effect size for large velocity modulation factors in
Experiment 4. Note also that alternatively, the combined influence (through weighted averaging) of all neighboring motion vectors could also be subtracted from the motion of the target arc. However, that would indirectly use the form information, and hence, cannot account for all of our results.