The cause of the model's instability is the relatively long visual delay. This is illustrated by simulations of the model in which the delay is set to 0 ms (
Figure 2C). Now, the vergence responses become fast and damped. The problem created by the delay is that disparity in combination with its time integral cannot produce the required stepwise changes of plant input. A solution for this problem is to use target vergence instead of disparity as the driving stimulus because target vergence of static targets is constant by nature. Since target vergence is not signaled by any individual visual or motor signal, it can only be represented by a combination of signals. Target vergence
θ may be represented by a signal
θ′ that is obtained by combining an estimate of disparity
d′ and a neural signal indicating ocular vergence
ϕ′. At the location in the model where
θ′ is computed,
θ′ is given by the following equation:
Corollary discharge, also called efference copy, of the vergence state is assumed to supply the signal
ϕ′ in the model (
Figure 3A). The delay of the corollary signal is indicated by Δ
t c. Since signal
d′(
t) represents disparity of the stimulus
d(
t − Δ
t v) and signal
ϕ′(
t) represents ocular vergence
ϕ(
t − Δ
t c − Δ
t e), the signal
θ′(
t) can only represent target vergence of the stimulus
θ(
t − Δ
t v) accurately if Δ
t v = Δ
t c + Δ
t e. To study stability effects due to variability in the delays, I have divided the total time Δ
t of the feedback loop into visual (Δ
t v) and motor delays (Δ
t m). Visual delays have been measured by Schmolesky et al. (
1998). Relative to stimulus onset, the authors measured delays of neural activity in V1 (44 ms), V2 (59 ms), V3 and MT (72 ms), and FEF (80 ms) in monkey. Assuming that target vergence is first represented at the level of the inferior temporal cortex where signals arrive slightly later than in MT and assuming that visual delays are similar in human and monkey, I have chosen Δ
t v = 75 ms. In accordance with Zee et al. (
1992), I assume that Δ
t e = 3 ms and thus much shorter than the delay of visual feedback. As a result, Δ
t c = 72 ms. In order to keep the duration of the total delay of the loop at 160 ms, the motor delay has to be chosen as Δ
t m = 85 ms. The term motor delay as it is used here is slightly deceptive because, among other processing times, the delay includes time needed for the decision-making process in area FEF. The model presented in
Figure 3A resembles the local feedback model for the generation of pure saccades, originally proposed by Robinson et al. (Robinson,
1975; Zee, Optican, Cook, Robinson, & Engel,
1976). The saccadic model was later extended by a similarly structured vergence model to describe pure vergence movements and to explain saccade–vergence interaction by shared omnipause neuron gating (Zee et al.,
1992). A major difference between the Zee model and the current model is the neural representation of target vergence. The current model enables vergence to respond continuously to visual feedback. In order to extract a disparity signal from the target vergence signal, I propose that the signal representing target vergence,
θ′, is compared with a local neural estimate of the vergence state,
ϕ′. Ideally, the resulting signal
d″ is a short-latency estimate of the retinal disparity signal
d. Similarly as
d′ in the previous model (
Figure 2A),
d″ drives a set of premotor neurons to produce the velocity command VC. VC's output is passed to the pulse command PC and both neural integrators NI. PC generates the initiation of vergence responses, whereas the NIs produce the eye position commands that are needed to keep vergence at the desired steady state. The vergence position command is fed back after a short time delay of Δ
t e to provide the local efference copy of
ϕ necessary for the computation of
d′. The plant input PI is described by the following equations:
where
and
where
and
where
where
and
Figure 3B shows simulations by the model for three VC and PC values. Vergence responses are damped but slow (red trace) for small VC values. Higher VC values result in underdamped responses (blue and green traces). The model's behavior is inappropriate because VC alone should produce slow and damped responses that by addition of the PC pulse component should become sufficiently fast. As mentioned before, input signal
d″ should be a short-latency estimate of the disparity stimulus
d. In other words, it is not the visual feedback signal
d′, with its long delay, driving the response but the short-delay estimate
d″. By doing so, the stability of the system should be greatly increased. The left graph of
Figure 3C shows that this is not the case due to the fact that
θ′ does not represent target vergence
θ well during the vergence response. The reason for the mismatch is that the chosen corollary discharge signal (i.e., the output of the vergence–velocity integrator NI) represents ocular vergence in static but not in dynamic conditions. The solution to this problem is to reconstruct an estimate of the actual eye rotations from the neural commands using a dynamic model of the plant (DP in
Figure 4A). The right graph of
Figure 3 shows that the signal Le′ − Re′ of this model is better than
ϕ′ in representing
ϕ. Now,
θ′ correctly signals the stepwise change in target vergence. The signal NI is not suited to serve as input for DP because DP requires the same input as does the plant, and therefore, its input should also include the PC signal. An appropriate input signal for DP is a signal that equals signal PI. Consequently, in the set of
Equations 6 to
13, which describes the plant input PI of the model,
Equation 11 is replaced by
Now, target vergence is accurately represented in both static and dynamic conditions (see signal
θ″ in
Figure 4B).
Figure 4C shows simulations of the model for various PC values. Both VC and PC contribute to the speed of vergence responses. The PC signal increases speed and decreases duration of the responses. High PC values generate overshoot of the vergence response, however, without causing further oscillations. After overshooting target vergence, responses decay gradually to their steady state (
Figure 4C, blue and green traces). The combination of overshoot and decay resembles vergence responses that have been observed in vergence adaptation experiments (Munoz et al.,
1999; Takagi et al.,
2001). The model cannot generate overshoot of the vergence responses if PC is absent, irrespective how high the value of VC. The reason is that without the contribution of PC, PI cannot surpass or even reach the stepwise change that is required for the production of realistic vergence responses (
Figure 4C, red trace). Plant input changes about stepwise for specific combinations of VC and PC values (
Figure 4D, red trace). This example shows that contributions of pulse-like (PC) and integrated (NI) input signals in parallel are needed to produce step changes of motoneuron activity. Higher PC values cause pulse-step-like plant input signals and some overshoot of the vergence responses (
Figure 4C, blue and green traces). Since delays related to different processes play a role in the model, it is necessary to simulate vergence responses for several combinations of the delays Δ
t c, Δ
t v, and Δ
t m. The model is only realistic if it allows for some natural variability in the delays. The combination Δ
t c and Δ
t v is critical because it determines how closely the neural representation of target vergence matches the stimulus.
Figure 4E shows simulations in which Δ
t c and Δ
t v differ by 10 ms from the ideal condition where Δ
t c and Δ
t v are identical. The mismatches between the delays cause only moderate deviations in the later part of the vergence responses. A striking property of the model is that neither motor delay Δ
t m nor visual delay Δ
t v affects the dynamics of the vergence responses, provided that the visual delay Δ
t v is about equal to the corollary delay Δ
t c. Visual, efferent, and motor delays only affect the latency of vergence responses. Local feedback determines the dynamics.