After experimenting with different architectures, we found that a very simple model could account reasonably well for our data. The model is composed of three layers of units, and it has no dynamics; its output is an estimate of the magnitude of the OFR. The first layer simply represents the inputs and, more precisely, a thin horizontal strip on the screen. Each unit subtends the same visual angle Δ
x 1 (vertical extent), and its output is 1.0 if the strip is occupied by a drifting vertical grating and 0.0 if it is not:
Here
x 1j is the center of the “receptive field” of the
jth unit in the first layer. The second layer has many fewer units than the first one (i.e., larger “receptive fields”), and each unit computes a weighted sum of the outputs of the first layer. The weighting function is a Gaussian, with unitary height and coefficient of dispersion
σ e. All units in the second layer share the same
σ e, but they are shifted along the input layer, so that each unit in the second layer samples from a different subset of the first layer:
Here
x 2i is the center of the “receptive field” of the
ith unit in the second layer. The third layer has the same number of units as the second layer and carries out a sort of global divisive normalization (Grossberg,
1973; Heeger,
1992) on the output of the second layer:
Here
f(·) is a cumulative Gaussian function:
with parameters
μ n and
σ n . Since
f(·) varies between 0.0 and 1.0, the denominator in
Equation 5 varies between 1.0 and 2.0. Note that it only assumes one of two values: one when one strip is presented and another when a pair is presented, regardless of strip location or separation. The outputs of the third layer are then summed. Since all the model elements are identical, changing the location of a stimulus would only change which units in the third layer are activated but not how strongly. To enable the model to account for the observed dependency of the OFR on eccentricity, the sum of the outputs of the third layer is weighted, with each unit receiving a weight depending on its location:
where
x 3i =
x 2i is the center of the “receptive field” for the
ith unit in the third layer, and
τ is a space constant. Finally, a scaling factor (essentially a sensorimotor gain) is computed (linear fit with zero intercept) to best match the responses to single strips.