Free
Article  |   September 2014
Parameter recovery for the time-window-of-integration (TWIN) model of multisensory integration in focused attention
Author Affiliations
Journal of Vision September 2014, Vol.14, 14. doi:https://doi.org/10.1167/14.11.14
  • Views
  • PDF
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Farid I. Kandil, Adele Diederich, Hans Colonius; Parameter recovery for the time-window-of-integration (TWIN) model of multisensory integration in focused attention. Journal of Vision 2014;14(11):14. https://doi.org/10.1167/14.11.14.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract
Abstract
Abstract:

Abstract  In multisensory settings such as the focused attention paradigm (FAP), subjects are instructed to respond to stimuli of the target modality only, yet reaction times tend to be shorter if an unattended stimulus is presented within a certain spatiotemporal vicinity of the target. The time window of integration (TWIN) model predicts successfully these observed cross-modal reaction time effects. It proposes that all the initially unimodal information must arrive at a point of integration within a certain time window in order to be integrated and thus to initiate response enhancements like the observed reaction time reductions. Here we conducted a parameter recovery study of the TWIN model for focused attention tasks, with five parameters (the durations of the visual and auditory unimodal and the integrated second stage, the width of the time window, and the effect size). Results show that parameter estimates were highly accurate (unbiased, constant error less than 5 ms) and precise (variable error less than 8 ms) throughout, speaking to a high reliability and criterion validity of the process. Further analyses ensured that the estimation procedure is consistent and sufficiently robust against contamination (faulty integration). It can thus be used to estimate reliably the point of integration and the width of the time window.

Introduction
Evidence for multisensory integration is found in many different formats, notably as either facilitation or inhibition of responses to crossmodal stimulus sets compared to unimodal stimulation. Prime examples of facilitation include an increased firing of a multisensory neuron in superior colliculus (Wallace, Wilkinson, & Stein, 1996), acceleration of manual or saccadic reaction time (Frens, Van Opstal, & Van der Willigen, 1995), and improved performance in audiovisual speech perception (Sumby & Pollack, 1954). The behavioral context considered here is the focused attention paradigm (FAP) where a visual and an acoustic stimulus are presented to the participants under specified spatiotemporal conditions (Figure 1). Their task is to only respond to the onset of a stimulus from a predefined target modality, such as the visual, for example, and to ignore the possible occurrence of a nontarget stimulus from the other modality, the auditory. It has nevertheless been shown that the nontarget stimulus can modulate the response, manual or saccadic, to the target. Depending on the spatiotemporal configuration of target and nontarget, this crossmodal effect results either in a speedup (facilitation) or a slowdown (inhibition) of the response (Amlôt, Walker, & Spence, 2003; Diederich & Colonius, 2007a). A prominent concept to describe the underlying dynamics is the spatiotemporal “window of integration” (e.g., Meredith, 2002). We have developed a quantitative model framework, the time-window-of-integration model (TWIN), to describe and predict reaction time (RT) modulation under crossmodal stimulation (Colonius & Diederich, 2004; Diederich & Colonius, 2012). A series of experimental studies found support for the main assumptions of the model (Diederich & Colonius, 2008a, 2008b; Colonius, Diederich, & Steenken, 2009). The purpose of this study is to probe how well our model fitting algorithm is able to identify the numerical parameters controlling the model predictions. To this end, first, data are generated from the model using a range of model parameter values and, second, the parameters estimated from the simulated data are compared with the parameter values actually used for the simulated data (parameter recovery). 
Figure 1
 
Layout of and typical results for the focused attention paradigm (FAP). (A) Experimental setup. Sets of combined loud speakers and LED lights are arranged in a semicircle around the subject. Subjects are instructed to focus their attention on the target stimulus and identify at which side it appears by looking at it (saccadic response). In the example used throughout this study, the visual stimulus is the target stimulus and the acoustic stimulus is the nontarget. (B) Spatial layout. In more complex FAP settings, the visual target and the acoustic nontarget stimuli are presented either at the same side (ipsi) or at opposite sides (contra), thereby influencing whether the visual and the acoustic cue are perceived as emerging from one or different sources. However, in the evaluation study conducted here, we concentrated on the temporal cues, that is, as if visual and acoustic stimuli always emerged from the same position. (C) Temporal layout of a single trial. By convention, the visual target stimulus is presented at t = 0, whereas the accessory acoustic stimulus is presented either before (negative SOAs: τ < 0), in synchrony with (τ = 0), or after the visual cue (positive SOAs: τ > 0). In the sketch, τ = −100 ms. (D) Typical results of FAP experiments. In comparison to the condition in which the visual cue is presented alone (uniV, rightmost), reaction times are significantly reduced if the acoustic stimulus antecedes the visual. This facilitation (gain) is generally interpreted as indicating the successful integration of the two stimuli.
Figure 1
 
Layout of and typical results for the focused attention paradigm (FAP). (A) Experimental setup. Sets of combined loud speakers and LED lights are arranged in a semicircle around the subject. Subjects are instructed to focus their attention on the target stimulus and identify at which side it appears by looking at it (saccadic response). In the example used throughout this study, the visual stimulus is the target stimulus and the acoustic stimulus is the nontarget. (B) Spatial layout. In more complex FAP settings, the visual target and the acoustic nontarget stimuli are presented either at the same side (ipsi) or at opposite sides (contra), thereby influencing whether the visual and the acoustic cue are perceived as emerging from one or different sources. However, in the evaluation study conducted here, we concentrated on the temporal cues, that is, as if visual and acoustic stimuli always emerged from the same position. (C) Temporal layout of a single trial. By convention, the visual target stimulus is presented at t = 0, whereas the accessory acoustic stimulus is presented either before (negative SOAs: τ < 0), in synchrony with (τ = 0), or after the visual cue (positive SOAs: τ > 0). In the sketch, τ = −100 ms. (D) Typical results of FAP experiments. In comparison to the condition in which the visual cue is presented alone (uniV, rightmost), reaction times are significantly reduced if the acoustic stimulus antecedes the visual. This facilitation (gain) is generally interpreted as indicating the successful integration of the two stimuli.
The TWIN model postulates that a cross-modal stimulus triggers a race mechanism in the very early, peripheral sensory pathways, which is then followed by a compound stage of converging sub-processes that comprise neural integration of the input and preparation of a response. Note that this second stage is defined by default: it includes all subsequent, possibly temporally overlapping, processes that are not part of the peripheral processes in the first stage. The central assumption of the model concerns the temporal configuration needed for multisensory integration to occur: multisensory integration occurs only if the peripheral processes of the first stage all terminate within a given temporal interval, the time window of integration (TWIN) assumption. Thus, the window acts as a filter determining whether afferent information delivered from different sensory organs is registered close enough in time to trigger multisensory integration. Passing the filter is necessary, but not sufficient, for cross-modal interaction to occur because the amount of interaction may also depend on many other aspects of the stimulus set, such as spatial configuration of the stimuli. The amount of cross-modal interaction manifests itself as an increase or decrease of second-stage processing time. Although this amount does not directly depend on the stimulus onset asynchrony (SOA) of the stimuli, temporal tuning of the interaction occurs because the probability of interaction is modulated by the SOA value. The five parameters to be recovered are (a) and (b) parameters Display FormulaImage not available and Display FormulaImage not available determining peripheral visual and auditory processing time, respectively; (c) parameter μ representing the average second stage (central) processing time; (d) parameter ω giving the width of the time window of integration; and (e) parameter Δ denoting the size of the cross-modal interaction effect in milliseconds. 
The model framework encompasses both the FAP paradigm and the redundant signals paradigm (RSP) where, in the latter, stimuli from either modality are defined as targets (e.g., Colonius & Diederich, 2012). Here we focus on TWIN modeling of FAP only. For this paradigm, it is additionally assumed that cross-modal integration only occurs if (a) a nontarget stimulus wins the race in the first stage opening the time window of integration, such that (b) the termination of the target peripheral process falls into the window. One interpretation is that a winning nontarget will keep the system in a state of heightened reactivity such that the upcoming target stimulus, if it falls into the time window, will trigger cross-modal interaction.1 If a stimulus from the target modality is the winner of the race in the peripheral channels, second-stage processing is initiated without any multisensory integration mechanism being involved. 
In order to derive quantitative predictions for an empirical test of the model, further assumptions about the probability distributions of the various process durations postulated in TWIN have been introduced.2 These distributions are specified by numerical parameters that have to be estimated from the observed RT data. The model has been probed in a series of experiments under a variety of empirical conditions (Colonius et al., 2009; Diederich & Colonius, 2007b; Diederich & Colonius, 2008a). While these tests have generally supported the essential features of the model, the quality of a model test critically depends on one's ability to obtain good estimates of the model parameters. The purpose of the current simulation study therefore is to probe whether, and how well, our parameter estimation method is able to recover the correct parameters. To this end, RT data sets are randomly generated from the model's probability distributions with fixed and a-priori known parameter values. Next, estimates of these parameters are computed on these simulated data sets in order to check how well the original parameter values can be recovered. Obviously, the quality of the recovery results will depend both on properties of the model and on the efficiency of our estimation procedure. The next section provides details of the time window model and its simulation. 
Time window of integration model
For concreteness, we assume the visual modality to be target modality and the auditory to be nontarget modality. The race in the first stage is represented by two statistically independent, exponentially distributed random variables V and A denoting the peripheral processing times for the visual and the acoustic stimulus with density  with λ > 0 and specific values λλA and λλV for the auditory and visual modality, respectively. 
With τ as SOA value and ω as integration window width parameter, the above requirement for multisensory integration to take place is the realization of the event  i.e., the nontarget stimulus wins the race in the first stage “opening the time window of integration” such that the termination of the target peripheral process V falls into the window. Here, a positive τ value indicates that the visual stimulus is presented before the acoustic, and a negative τ value indicates the reverse presentation order. Observable total reaction time is the sum of first stage processing time of the target modality, here V, and second stage processing time, M, assumed to follow a normal distribution and comprising all subsequent processes including motor preparation and execution and, in the bimodal condition, multisensory integration. The mean of the distribution of M differs depending on whether a unimodal or bimodal condition is considered. For unimodal trials, M has a mean of μ and variance σ2 (Figure 2A). For bimodal trials, second stage processing time depends on whether or not the condition for multisensory integration, Iτ,ω, is met in a given trial (Figure 2B, C). Therefore, reaction time in the bimodal condition is mixture of two distributions with mean μ and μ − Δ, respectively, and mixing parameter P(Iτ,ω). Thus, mean reaction times in the unimodal and bimodal conditions are, respectively,   with λV denoting the exponential parameter for V introduced above and E the expectation operator (mean) of random variables. From Equation 2 it is obvious that when the cross-modal effect Δ equals zero or when the probability of integration P(Iτ,ω) is zero, expected reaction time in the unimodal and bimodal condition will be identical. 3 Probability of integration, P(Iτ,ω), is a function of the exponential parameters λV and λA for V and A, the window width parameter ω, and the SOA value τ that is determined by the experimental setup. Explicit expressions for P(Iτ,ω), depending on the sign of τ and τ + ω can be found in the Appendix, see also Colonius and Diederich (2011). 
Figure 2
 
Outline of the time window of integration (TWIN) model. The TWIN model posits that the subject's reaction time critically depends on the time of arrival of the attended visual and the unattended acoustic cue at the point of integration. (A) The RT for the unimodal condition is the sum of the durations of the sensory processing time for the target visual stimulus (V, parameter 1/λV) and of the secondary common processing time (M, parameter μ). (B) & (C) In the bimodal case, a nontarget acoustic cue is presented additionally to the visual cue with an SOA (τ). Upon arrival of the auditive cue at the point of integration (processing time A, parameter 1/λA), a time window for the integration is opened for a period of the length ω. (B) If the target visual information arrives at the point of integration with the time window being open, then the stimuli are integrated and the duration of the second step is reduced by a given amount of time (parameter Δ). (C) However, the cues are not integrated and RT is not reduced if the visual information arrives either before the temporal window has been opened (e.g., due to a negative SOA like τ1 in the upper half of the panel) or after the window has already been closed again (due to a long positive SOA like τ2, lower half of the panel).
Figure 2
 
Outline of the time window of integration (TWIN) model. The TWIN model posits that the subject's reaction time critically depends on the time of arrival of the attended visual and the unattended acoustic cue at the point of integration. (A) The RT for the unimodal condition is the sum of the durations of the sensory processing time for the target visual stimulus (V, parameter 1/λV) and of the secondary common processing time (M, parameter μ). (B) & (C) In the bimodal case, a nontarget acoustic cue is presented additionally to the visual cue with an SOA (τ). Upon arrival of the auditive cue at the point of integration (processing time A, parameter 1/λA), a time window for the integration is opened for a period of the length ω. (B) If the target visual information arrives at the point of integration with the time window being open, then the stimuli are integrated and the duration of the second step is reduced by a given amount of time (parameter Δ). (C) However, the cues are not integrated and RT is not reduced if the visual information arrives either before the temporal window has been opened (e.g., due to a negative SOA like τ1 in the upper half of the panel) or after the window has already been closed again (due to a long positive SOA like τ2, lower half of the panel).
Choosing the exponential distribution in TWIN is mainly motivated by computational simplicity. Adding a normally distributed second stage duration then results in the ex-Gaussian distribution for the observable reaction times, a distribution that has found some empirical support (Luce, 1986). Given the somewhat arbitrary choice of the ex-Gaussian, one would not expect the model to validly represent variability in the data and, thus, model fitting for TWIN has up to now been limited to the level of mean RTs.4 
Parameter recovery and simulation of TWIN
Given an experiment with simultaneous presentation of visual target and auditory nontarget (i.e., τ = 0) yields the following model equation for the bimodal means:   
There are five parameters to be estimated, λA, λV, μ, ω, and Δ, but only two “observables” are available, i.e., the unimodal and bimodal sample means for RTV and RTVA. Thus, the model and its parameters are clearly not identifiable. However, the situation can easily be remedied by increasing the number of SOAs. With four SOA values there will be model equations for four bimodal means and one unimodal mean, i.e., an equal number of “observables” and parameters (i.e., a “saturated” model). Importantly, however, note that here we are not investigating formal model identifiability of TWIN using theoretical approaches as developed, for example, in Bamber and van Santen (2000). Rather, we want to ascertain in a pragmatic way whether our parameter estimation method is able to recover the model parameter values from data sets that have been generated under the model with those same parameters. Before we present the details of the simulation procedure in the next section, two more issues have to be addressed. 
First, we need to specify how to measure the “goodness” of the parameter recovery. In statistics, a common measure for the quality of a finite-sample point estimator θ̂ for a parameter θ is the mean-squared error (MSE) defined by E[(θ̂θ)2]. It is well known (Casella & Berger, 2002, p. 330) that then  where (E[θ̂]−θ)2 is (Bias θ̂)2. Thus, MSE incorporates two components, Var[θ̂] measuring the variability of the estimator (precision) and the other measuring its bias (accuracy) (Casella & Berger, 2002, p. 330). In analogy to this definition, we evaluated the precision and accuracy of the parameter recovery resulting from the simulations by considering the variability and central tendency of the parameter estimates across different simulated data sets. Moreover, we tested for the robustness of the parameter estimation method, that is, its ability to recover the original parameter values when participants committed certain errors like integrating although event Iτ,ω was not realized (or vice versa). 
The second issue concerns the generation of random variables in TWIN. Variability in the first stage is determined by the exponential parameters λA and λV that will be estimated by model fitting at the level of means. However, the variance of the normally distributed processing duration in the second stage, σ2, is not included in this model fitting procedure and, therefore, cannot be “recovered.” On the other hand, in order to generate random data sets from the model, this parameter needs to be specified as well. We chose to fix the value at σ = 25 ms, a value that seems plausible given some experimental results obtained earlier (Diederich, Colonius, & Schomburg, 2008). Moreover, some pilot simulations suggested that the outcome of our main simulation study did not depend on the exact value of σ
The main part of this evaluation study assessed accuracy and precision of the parameter estimation process in seven steps depicted in Figure 3: (a) For each of the five parameters of the basic TWIN model, we defined a set of parameter values. (b) We systematically combined them into 576 parameter vectors and (c) conducted a virtual experiment with 40 virtual subjects, all of them having that parameter vector as intrinsic variables. The experiment comprised Steps 4–6, in which reaction times were generated for each subject for a number of conditions of a multisensory integration task. The means of these RTs were then used to estimate the parameters for each individual subject (subject level). Estimates across all subjects in an experiment were evaluated in comparison to the original parameter values for each experiment individually (experiment level). Finally, these evaluations were gathered across the 576 experiments to assess how well the parameter estimation process works if confronted with all sorts of parameter vectors, plausible, odd, and impossible ones (see below). 
Figure 3
 
Outline of the parameter recovery evaluation. The evaluation comprises seven steps: (1) Defining a set of values for each of the five parameters. (2) Generating 576 parameter vectors by complete combination of these values. (3) Conducting a virtual experiment with 40 “subjects” (simulations) for each of the vectors, thereby (4) generating 200 reaction times in each of nine experimental conditions (see text), (5) feeding the means of these RTs into the parameter estimation procedure for fitting parameters. (6) These parameter estimates are then gathered and evaluated, first for all subjects of one experiment, then averaged using the median and (7) analyzed across all experiments with respect to both accuracy (unbiasedness) in terms of the constant error to assess the validity, and precision (variance and standard deviation) in terms of the variable error to assess the reliability of the parameter estimation process.
Figure 3
 
Outline of the parameter recovery evaluation. The evaluation comprises seven steps: (1) Defining a set of values for each of the five parameters. (2) Generating 576 parameter vectors by complete combination of these values. (3) Conducting a virtual experiment with 40 “subjects” (simulations) for each of the vectors, thereby (4) generating 200 reaction times in each of nine experimental conditions (see text), (5) feeding the means of these RTs into the parameter estimation procedure for fitting parameters. (6) These parameter estimates are then gathered and evaluated, first for all subjects of one experiment, then averaged using the median and (7) analyzed across all experiments with respect to both accuracy (unbiasedness) in terms of the constant error to assess the validity, and precision (variance and standard deviation) in terms of the variable error to assess the reliability of the parameter estimation process.
In additional experiments, we tested the consistency of the parameter estimation process, assessed the correlation between the parameters, and determined the degree of robustness of the estimation against contamination of the integration process. Finally, we investigated whether replacing the mean by the median in order to average the generated data (Step 4) would make the recovery more accurate or precise. 
Material and methods
The seven steps (Figure 3) will now be described in greater depth. 
Step 1: Determining the parameter values
For each of the five parameters, three to four parameter values were used for the evaluation process (Table 1). They were selected to cover the entire range of (neurophysiologically) plausible values. 
Table 1
 
Low level results. For each parameter, the first line gives the lower and upper boundary limiting the recovery search process. Below that, the percentage of valid recoveries (bold face) is given next to the percentages of recoveries that terminated in or close to these boundaries.
Table 1
 
Low level results. For each parameter, the first line gives the lower and upper boundary limiting the recovery search process. Below that, the percentage of valid recoveries (bold face) is given next to the percentages of recoveries that terminated in or close to these boundaries.
Parameter Value % lower B % valid % upper B
1/λA Bounds: [ 5, , 250 ]
20 20.3 79.7 0.0
50 10.1 89.8 0.1
100 4.3 94.7 1.0
150 1.1 92.5 6.4
1/λV Bounds: [ 5, , 250 ]
20 14.0 86.0 0.0
50 8.4 91.4 0.2
100 4.7 92.8 2.5
150 2.4 87.4 10.2
μ Bounds: [ 0,
50 16.1 83.9 0.0
100 6.8 93.2 0.0
150 1.2 98.8 0.0
200 0.0 100.0 0.0
ω Bounds: [ 5, , 1000 ]
100 1.0 96.2 2.9
200 0.3 93.6 6.1
300 0.1 80.1 19.8
Δ Bounds: [ 0, , 175 ]
20 7.0 85.4 7.6
50 1.5 90.8 7.7
100 0.2 88.7 11.1
For both Display FormulaImage not available and Display FormulaImage not available , i.e., the parameters for the lengths of the auditory and visual sensory stages, we assumed 20, 50, 100, and 150 ms to cover the range of possible values. If one assumes that the integration takes place in the cortex and that also the responses are triggered there (Molholm et al., 2002), then integration should not occur before the primary auditory and visual cortices (in macaque monkeys) are activated, i.e., some 20 ms after the auditory onset of noise (Camalier, D'Angelo, Sterbing-D'Angelo, de la Mothe, & Hackett, 2012) and some 30 ms after the visual onset (Schmolesky et al., 1998). On the other hand, responses to visual stimuli reach the frontal eye field, i.e., the secondary motor cortex for eye movements, between 60 and 100 ms after stimulus onset. We thus assumed twice that time as a safe candidate for the highest value. 
The parameter values for μ, the duration of the secondary (i.e., postintegration or common) stage, were simply obtained by subtracting the 1/λ values mentioned above from a typical simple-reaction time of 200 to 250 ms: 50, 100, 150, and 200 ms. The parameter Δ stands for the temporal gain, i.e., the reaction time reduction, resulting from the integration of the multimodal information. Here, we took 20 and 50 ms to be reasonable values and added 100 ms as a safe maximum gain. Note that, for simplicity, we do not consider the case of inhibition, i.e., negative values of Δ. 
Parameter ω represents the width of the time window, i.e., the time interval during which integration may occur. In neurophysiological terms, this may be the duration for which the membrane potential of the integration cell remains excited. Using modern whole-cell in-vivo recording methods, duration estimates vary greatly between very short, e.g., 2 ms for cells in the avian cochlea (Reyes, Rubel, & Spain, 1994), and long durations, e.g., 50–140 ms in hippocampus (Andersen, Raastad, & Storm, 1990). In contrast, area MT (V5) cells integrate motion information across more than 200 ms (Bair & Movshon, 2004). To cover the whole range, here we utilized values of 100, 200, and 300 ms. 
Step 2: Setting up the parameter vectors
The parameter values defined in the first step were systematically combined into 4 × 4 × 4 × 3 × 3 = 576 different vectors. Among them were plausible vectors such as (20, 50, 150, 200, 50), a vector that was identified in an earlier study for reaction time data using the same FAP paradigm (Diederich et al., 2008), as well as rather odd ones, which would represent gross imbalances between the values and the model parts they represent e.g., (20, 100, 50, 300, 150), and finally clearly unrealistic vectors such as (20, 20, 50, 200, 100), which would on average result in negative RTs if integration occurred (mean RT = Display FormulaImage not available + μ − Δ = 20 + 50 − 100 = −30). All of these vectors were tested and taken into the evaluation, irrespective of their plausibility.5 
Step 3: Conducting 576 single experiments
For each of these 576 vectors, a separate virtual experiment was conducted using 40 virtual subjects (simulations). 
Step 4: Generating reactions times for the subjects
For each subject in a given experiment, RTs in nine conditions were generated for 200 trials each. The nine conditions comprised the unimodal visual condition, in which only the visual stimulus would be presented (uniV), and seven conditions, in which the visual stimulus would be presented at t = 0 and the acoustic stimulus would either precede, be in synchrony with, or lag behind the visual target stimulus by an SOA of −200, −150, −100, −50, 0, +50, +100, +150, and +200 ms, respectively. In all these conditions, a real subject would be instructed to react to the onset of the visual stimulus only (focused attention paradigm). As an additional ninth condition, we implemented a unimodal auditory condition (uniA), in which the subject would be presented only with an acoustic stimulus and would have to react to it. 
For each of the trials, pseudorandom numbers were generated for the first and second stage based on the distributions and the specific parameter values for the respective parameter. As described in the Introduction, A and V are random variables with exponential distributions (cf. Equation 1), while M is a random variable with a normal distribution Display FormulaImage not available As discussed above, the parameter σ was not recovered during the process and is therefore not part of this evaluation study. 
The random realizations A, V, and M were then combined into an observable reaction time following the logic of the model:  with τ being the SOA value in the given condition. Reaction times were averaged across all the 200 trials in each condition. 
Step 5: Estimating the parameter values
Mean reaction times for each of the nine conditions were handed over to the parameter estimation procedure that was programmed to find the best fitting estimates for the five parameter values. The objective was to minimize the objective function (Equation 6), i.e., the z-standardized differences between generated and predicted reaction times across all uni- and bimodal conditions,   
The range of possible fits was thereby limited by the individual boundaries for the five parameters listed on the top line for each parameter in Table 1. The process started at random positions individually set anywhere between the lower and the upper boundary (for μ between 5 and 300 ms). 
Step 6: Evaluating the results on the experiment level
In all experiments and in all subjects, the average difference between generated and predicted mean RTs was about 6 ms, indicating that the process could find some vector that explained the data sufficiently well (cf. Figure 4). 
Figure 4
 
Evaluation on the subject level (A & B) and group level (C & D). (A) Parameters were estimated by minimizing the objective function defined in Equation 6. In this sample subject in the experiment with the plausible vector, the difference between generated and predicted mean RTs is less than 2 ms per condition. (B) Across all experiments and subjects the mean difference was lower than 5.95 ms (±3.64) per condition. (C) Recovered parameters compared to the original values (here shown for 500 pretest subjects in the experiment with the plausible vector). Most individual parameter estimates are within a small range of ±15 ms around the original value, indicating that the process recovers the parameter values with a very high accuracy and a good precision. The number of statistical outliers (dots outside the antennas) is within the normal range (< 3%). (D) Plots indicate the relative amounts of subjects whose parameter recoveries resulted in veridical estimates (black bars) and those for whom the process terminated with estimates at or close nearby the lower or upper boundary (gray bars to the left and right of the black bar, respectively). Analysis is given separately for each of the parameters and their values. Slim black lines indicate the respective boundaries.
Figure 4
 
Evaluation on the subject level (A & B) and group level (C & D). (A) Parameters were estimated by minimizing the objective function defined in Equation 6. In this sample subject in the experiment with the plausible vector, the difference between generated and predicted mean RTs is less than 2 ms per condition. (B) Across all experiments and subjects the mean difference was lower than 5.95 ms (±3.64) per condition. (C) Recovered parameters compared to the original values (here shown for 500 pretest subjects in the experiment with the plausible vector). Most individual parameter estimates are within a small range of ±15 ms around the original value, indicating that the process recovers the parameter values with a very high accuracy and a good precision. The number of statistical outliers (dots outside the antennas) is within the normal range (< 3%). (D) Plots indicate the relative amounts of subjects whose parameter recoveries resulted in veridical estimates (black bars) and those for whom the process terminated with estimates at or close nearby the lower or upper boundary (gray bars to the left and right of the black bar, respectively). Analysis is given separately for each of the parameters and their values. Slim black lines indicate the respective boundaries.
Step 7: Assessing accuracy and precision on the study level
These estimate differences were gathered across the 40 subjects and 576 experiments and evaluated in terms of accuracy and precision. This evaluation was performed separately for each parameter (cf. Figure 5), and within each parameter both across all parameter values (leftmost insets) and for each parameter value individually (right insets). Accuracy refers here to the absolute distance between the original parameter value and the average parameter estimate. A high accuracy is indicated by a low distance and thus a small constant error, and indicates in turn a high criterion validity. Precision, on the other hand, measures the (smallness of the) width of the estimates' distribution. A high precision comes with a small variable error (typically the standard deviation or variance), and indicates in turn a high reliability. A high reliability is the prerequisite for a high validity, i.e., accuracy can only be evaluated if the standard error is sufficiently low. 
Figure 5
 
Main results. Histograms show accuracy and precision for the parameter estimates. For each parameter (A)–(E), the panel on the far left side shows the histogram for the parameter aggregated across all values, whereas the panels to the right show histograms for each parameter value individually. X axes show absolute deviations from the original value (in milliseconds), and y axes the percentage. Accuracy refers to a (low) difference of the distribution mean from the central value to the left or right, while precision corresponds to a (small) widths of the distribution. Histograms are computed across the median estimates of each experiment, and thus comprise 576 data points (left panels), or the respective fraction of them, 144 or 192 (right panels).
Figure 5
 
Main results. Histograms show accuracy and precision for the parameter estimates. For each parameter (A)–(E), the panel on the far left side shows the histogram for the parameter aggregated across all values, whereas the panels to the right show histograms for each parameter value individually. X axes show absolute deviations from the original value (in milliseconds), and y axes the percentage. Accuracy refers to a (low) difference of the distribution mean from the central value to the left or right, while precision corresponds to a (small) widths of the distribution. Histograms are computed across the median estimates of each experiment, and thus comprise 576 data points (left panels), or the respective fraction of them, 144 or 192 (right panels).
All simulations and model fittings were ran in Matlab using the Matlab function fminsearch (The MathWorks, Inc., Natick, MA, USA). All statistics were performed using Mathematica (Wolfram Research, Inc., Champaign, IL, USA). 
Results
Recovery results on subject, experiment, and study level
As mentioned above, results in this study are described on three levels, the subject level (n = 1 subject), the experiment level (n = 40 subjects forming a group), and the study level (across all 576 experiments and thus n = 23,040 subjects). While the main questions of this evaluation concern the study level, good and reliable results on subject and experiment levels are the necessary precondition for the analysis of the main results presented further below. 
Results on the subject level are shown in Figure 4 (Panels A and B). Mean RTs generated for each subject were used to estimate the parameters for the subject. Panel A shows the means of the generated reaction times for one person for the seven bimodal and the two unimodal conditions (gray symbols, n = 200 trials per condition) along with the RTs predicted by the model based on the estimated parameters. The differences between these reaction times were fairly small, with approximately 1.9 ms per condition for the plausible parameter vector. As depicted in Panel B, across all n = 40 subjects in all n = 576 experiments, this prediction error had a mean of 5.95 ms (SD = 3.64) per condition. 
Panel C shows the results on the experiment level. Evaluation on this level comprised analyzing the results of all subjects participating in a single experiment (please note that, for the sake of clarity, this figure shows a result from a pretest, in which 500 subjects were used rather than the usual 40). The panel shows the estimate statistics of the five parameters for one sample experiment (plausible parameters). About one half of the estimates met the original parameter value with an error of less than about 15 ms (box in the boxplots), whereas the other half of the estimates deviated from that original value by more than ≈ 15 and up to some 50 ms (antennas in the boxplots). Only a minor fraction (1%–2%) between experiments deviate by more than 50 ms. 
However, outliers are not the only possible issue. The recovery could also yield invalid estimations if the process terminated in or near the limiting boundaries. This can best be seen in the leftmost boxplot in Figure 4C, in which the lower antenna ends exactly at 5 ms, i.e., the lower boundary. This means that a significant proportion of the estimates in these experiments is invalid. Cumulative results across the recoveries for all subjects in all experiments are given in the bar plots of Figure 4D and in Table 1. Black bars indicate the percentage of valid estimates for the given parameter and parameter value, while the gray bars to its left or right, where existing, give the percentage of the recoveries that terminated in the lower or upper boundary, respectively. (Boundaries are indicated in that figure by the small black stick-like bars at the ends of the scale.) Results show that while more than 90% of the estimates are valid, some 10% are invalid, with a higher occurrence of the latter for very low parameter values (for Display FormulaImage not available , Display FormulaImage not available , μ, and Δ), which are by themselves very close to the boundary and thus endangered from the outset, or very high values (for ω). 
For the study level, individual parameter estimates were averaged using the median across the 40 subjects of an experiment into group estimates. All of these group estimates stood clear off the boundaries and thus all further assessments (and the graphs shown in Figure 5) rely on the full amount of 576 estimates. Results for the evaluation of the recovery are shown in Figure 5 and Table 2. Graphs in Panels A to E show histograms for the five parameters, and while the leftmost graphs show histograms aggregated across all parameter values, the others (to the right) show them for each parameter value individually. To ease comparison, histograms are plotted against the distance of the recovered parameter to the original value. 
Table 2
 
Main results of the study. Accuracy and precision of the recovered parameters are given for each parameter separately: first across all parameter values (All), then for each parameter value individually. Accuracy (unbiasedness) refers to a preferably low constant error, i.e., difference from the original parameter (mean or median), whereas a high precision is indicated by a low variable error (standard deviation, SD, or median absolute deviation [MAD]).
Table 2
 
Main results of the study. Accuracy and precision of the recovered parameters are given for each parameter separately: first across all parameter values (All), then for each parameter value individually. Accuracy (unbiasedness) refers to a preferably low constant error, i.e., difference from the original parameter (mean or median), whereas a high precision is indicated by a low variable error (standard deviation, SD, or median absolute deviation [MAD]).
Parameter Value Mean Diffs SD Median Diffs MAD
1/λA All −0.25 4.50 −1.94 12.10
20 0.75 2.85 1.85 6.87
50 −0.52 3.40 −1.19 10.5
100 −2.25 5.65 −4.81 13.35
150 −0.75 5.85 −3.59 15.07
1/λV All −0.73 4.03 −2.77 12.61
20 −1.03 2.03 −1.34 4.33
50 −1.95 4.20 −4.21 10.27
100 1.00 5.43 −1.02 15.04
150 0.00 6.45 −4.55 16.70
μ All 1.35 3.58 3.20 11.97
50 1.20 3.68 3.49 14.26
100 1.48 3.30 3.18 12.13
150 1.05 3.78 2.79 9.69
200 1.70 3.93 3.33 11.49
ω All −2.82 8.05 2.26 51.54
100 −1.50 2.95 −4.32 9.08
200 −4.05 5.65 −9.71 23.59
300 −6.75 34.68 20.73 82.48
Δ All 3.65 3.60 6.71 9.25
20 4.70 3.20 7.62 8.07
50 2.78 3.18 6.41 10.05
100 2.75 3.65 6.09 9.48
For parameters Display FormulaImage not available , Display FormulaImage not available , and μ, the histograms ubiquitously are rather narrowly distributed around the expected value (zero on the x axis). Medians (constant errors) of the group estimates for the first three parameters ( Display FormulaImage not available , Display FormulaImage not available , μ ) range between −2.5 ms and +2.5 ms (means are between −5 and 5 ms) for individual parameter values, and level out across all parameter values to about half that size, thereby exhibiting a very high accuracy of the estimates. Median absolute deviations (MAD) are lower than 7 ms (SD ≤ 16 ms), speaking to a high precision. Given the low absolute distances and the three times higher deviation measures, the estimates have to be classified as unbiased
For parameter ω, the same assessment (narrow distribution, unbiasedness) holds true for the first two parameter values of 100 and 200 ms. Although the medians are negative, they are close to zero (medians > −5 ms; means > −10 ms), while the median absolute deviation is again lower than 7 ms (SD < 24 ms). In contrast, for ω = 300 ms, the constant error is higher (median = −13 ms, mean = −18 ms) and the distribution is much wider (MAD = 35 ms, SD = 82 ms) and shallower than in all other cases. 
Estimates for parameter Δ (Figure 5, Panel E) show, on first glance, a stable overestimation (85% of the estimates are positive), which could speak in favor of a biasedness. However, the constant (median) error is less than 5 ms (mean < 8 ms), and the variable error is comparable to the ones found for the first three parameters (or even less) with median and standard deviations of approximately 3–4 and 8–10 ms, respectively. 
Further evaluation results
Consistency
As sample size per experimental condition increases, one would expect that the parameter estimate based on the sample will improve. In statistical theory, an estimator is called consistent if it converges to the true (population) value when sample size becomes infinitely large. Note that a consistent estimator may be biased and an unbiased estimator may not be consistent. For many estimators, like maximum likelihood estimators, consistency holds under rather general conditions. However, the parameter estimation method utilized here does not guarantee consistency of the estimators. On the other hand, consistency follows if both the bias of the estimator and its variance converge to zero for sample size approaching infinity (Lehmann, 1983). Therefore, we tested the accuracy and precision behavior of our estimates with increasing sample size. 
We used the standard plausible parameter vector ( Display FormulaImage not available , Display FormulaImage not available , μ, ω, Δ) = (20, 50, 100, 200, 50), with the standard number of 40 subjects and with a number of trials per condition (sample size) increasing from 40 to 200 in steps of 20 and from 200 to 1000 in steps of 100 trials. Results in Figure 6 show that the bias for the median decreases with the square root of the sample size (black symbols) and that same holds true for the median absolute deviation (gray symbols) for all five parameters. This strongly suggest that the parameter estimation procedure produces consistent estimators. 
Figure 6
 
Consistency of the parameter estimation process. Consistency was tested using the plausible parameter vector (20, 50, 150, 200, 50), 40 virtual subjects and an increasing number of trials (sample size) per condition. The expectation was that both the constant error (mean and median) and the standard error (standard and median deviation) decrease with the sample size. Results for both error measures show such a pattern suggesting consistency of the estimators. Median differences are shown in black and median absolute deviation in gray.
Figure 6
 
Consistency of the parameter estimation process. Consistency was tested using the plausible parameter vector (20, 50, 150, 200, 50), 40 virtual subjects and an increasing number of trials (sample size) per condition. The expectation was that both the constant error (mean and median) and the standard error (standard and median deviation) decrease with the sample size. Results for both error measures show such a pattern suggesting consistency of the estimators. Median differences are shown in black and median absolute deviation in gray.
Correlation between parameter estimates
A potential problem in recovering parameters for a model is that the value of the estimate for one parameter may be significantly correlated with the value of another (Ratdliff & Tuerlinckx, 2002). In trying to optimize the fit, the fitting procedure may trade off the value of one parameter against the value of another one. In the TWIN model, one would expect this to happen to some degree between the λ parameters of the first stage and the mean μ of the second stage processing. This issue becomes especially critical when one tries to interpret differences among parameter values across experimental conditions. For example, increasing stimulus intensity should be reflected in differences in the λ parameters but not in Δ because intensity affects first stage processing and integration probability, but not the amount of facilitation or inhibition in the second stage, according to TWIN assumptions. 
Parameter correlation was tested again using the standard plausible parameter vector (20, 50, 100, 200, 50) with 40 subjects and 200 trials per condition. Parameter estimates were scaled relative to the original parameter value. The resulting differences, thus representing over- and underestimates, were then correlated pairwise for all parameters. Scatterplots in Figure 7 show the correlation between pairs of parameters along. There is a very high correlation of r > 0.98 between Display FormulaImage not available and Display FormulaImage not available and a comparably high but negative correlation between Display FormulaImage not available and μ and between Display FormulaImage not available and μ; further a strong, positive correlation between the estimates for the first two parameters and Δ with r > 0.88, and a comparably high but negative correlation between the estimates for μ and Δ, with r = −0.87; finally a moderate (r = 0.38) but significant negative correlation between ω and Δ. Correlations between ω and the first three parameters were not significant. While some of these correlations reflect expected trade-off effects of the estimation procedure, they also point to some caveats for interpreting parameter differences between experimental conditions. 
Figure 7
 
Correlations between the recovered parameter values of the experiment with the plausible parameter vector (20, 50, 100, 200, 50). Graphs show pairwise scatter plots along with the correlation coefficient. Axes show absolute deviations of the individual estimates from the original parameter value.
Figure 7
 
Correlations between the recovered parameter values of the experiment with the plausible parameter vector (20, 50, 100, 200, 50). Graphs show pairwise scatter plots along with the correlation coefficient. Axes show absolute deviations of the individual estimates from the original parameter value.
Robustness against contamination
Contaminated reaction times come from some process other than the one being studied. For example, the subject may occasionally anticipate the stimulus rather than processing it or there may be some distraction from the task by various sources not under the control of the experimenter. A number of general methods, e.g., the truncation of data sets, have been considered to deal with such spurious RTs (Ulrich & Miller, 1994). Here we limit discussion to the case where, in a certain percentage of trials, subjects violate the integration mechanism postulated in TWIN by exhibiting the RT reduction of Δ milliseconds when the condition for integration was not satisfied, or vice versa. Robustness of the parameter estimation procedure, that is the accuracy and precision of the parameter recovery, was tested by systematically increasing the proportion of such faulty integration behavior. In general, i.e., in the unbiased case (i, see below), data were generated so that integration and the associated reduction in reaction time took place in only (100 – p)% of the due cases, whereas RT was not reduced in the other p%. At the same time, reaction times were generated incorporating the reduction in p% of the cases for which integration should not have taken place. In six experiments, the percentage p was systematically increased from 0% to 25% in steps of 5%, using the same plausible parameter vector (20, 50, 150, 200, 50) as before. Generated RTs were averaged and presented to the parameter estimation procedure. 
We assessed the robustness of the parameter estimation in respect to four questions: How robust is the process against (a) unbiased and (b) biased contamination of the integration? (c) Is the robustness additionally affected by the width of the temporal window (ω)? And (d) can the robustness of the parameter estimation be increased by using the median over the mean in averaging the RT data? 
Results are shown in the panels in Figure 8. Black lines represent the constant error (accuracy) in terms of the difference between the median across the parameter estimates across the 40 subjects, whereas gray lines represent the variable error (precision), i.e., the median absolute deviation. Generally, parameter estimates become less accurate and precise with an increasing percentage of contaminated trials. We defined a difference of 20 ms from the original parameter value as a threshold for all parameters (i.e., one standard deviation in the unbiased case in Figure 8A at 0% contamination) in order to compare the multiple conditions tested here among each other. 
Figure 8
 
Robustness of the parameter recovery against contamination. For the plausible parameter vector (20, 50, 150, 200, 50), graphs show dependence of the parameter estimates on the proportion p of contaminated response generations, i.e., when subjects do or do not integrate in trials in which they wouldn't or would have to integrate. Graphs show the constant error (median difference from the original parameter value; black lines) and the variable error (MAD; gray lines). (A) Robustness is stronger (less errors) and weaker (more errors) if subjects have a tendency to either omit integration (left panel) or to commit integration (right panel), as opposed to an unbiased response (central panel). (B) ω values do neither significantly support nor destroy robustness. (C) Parameter estimation becomes much more robust if one uses the median rather than the mean for averaging single RT data before presenting them to the estimation procedure.
Figure 8
 
Robustness of the parameter recovery against contamination. For the plausible parameter vector (20, 50, 150, 200, 50), graphs show dependence of the parameter estimates on the proportion p of contaminated response generations, i.e., when subjects do or do not integrate in trials in which they wouldn't or would have to integrate. Graphs show the constant error (median difference from the original parameter value; black lines) and the variable error (MAD; gray lines). (A) Robustness is stronger (less errors) and weaker (more errors) if subjects have a tendency to either omit integration (left panel) or to commit integration (right panel), as opposed to an unbiased response (central panel). (B) ω values do neither significantly support nor destroy robustness. (C) Parameter estimation becomes much more robust if one uses the median rather than the mean for averaging single RT data before presenting them to the estimation procedure.
Data for question (a) show (central panel in Figure 8A) that the process is generally very finely tuned and highly sensitive to even a low contamination of the RT data. Approximately 10%–15% sufficed for the mean group estimates to reach the threshold. The pattern of estimation error, i.e., whether the parameter is over- or underestimated is clear and consistent even for small amounts of contamination: Display FormulaImage not available , Display FormulaImage not available , and Δ become overestimated, whereas μ and ω become underestimated. However, while accuracy worsens with contamination, the variable error (gray lines), and thus the precision, remains stable across a wide range of contamination, meaning that the error in parameter estimation is quite systematic. 
With respect to question (b), results show that contamination can have opposing effects, depending on the kind of bias the subjects have (left and right panel in Figure 8A). If only integration trials are affected by the contamination, then the subject exerts a bias towards nonintegration. As a consequence, contamination does not seem to have any early effect on the estimates (left panel). Only if contamination is higher than 15%, Δ becomes underestimated, which, however, is a consequence from the decreasing average RT gain. In contrast, if only the nonintegration part is contaminated, subjects have a bias towards integration. In this case the errors seen in the central panel increase by factor of two and consequently the thresholds are reduced by factor of two for all parameters. 
In question (c) we investigated whether the ω value had any effect on the robustness, since we had observed some gross under- and overestimates for ω in the main analyses. We therefore assessed robustness for three values of ω. While we fixed Display FormulaImage not available , Display FormulaImage not available , μ, and Δ to the values used before (standard vector), we increased ω from 100 to 200 (standard value) to 300 ms. Results (Figure 8B) for the first three parameters seem to be rather constant across the three panels and thresholds do not differ. The only effect the increase seems to have, is a systematic decrease of the threshold for ω associated with an earlier underestimation of ω. The effect resembles the pattern observed for the ω estimates in Figure 5D. However, while this is true for the deviations in absolute measures, the relative deviation from the original parameter value becomes lower and lower with the increasing ω. These effects are paralleled by a general overestimation of Δ, which is additionally reduced with increasing ω
Question (d) investigated the possibility to make the parameter estimation more robust against contamination. Because subjects tend to be inattentive to the task or anticipate the response in a certain proportion of trials, this contamination, and especially biased contamination, constitutes a real rather than just a theoretical problem. One way of diminishing the impact of these spurious (too long or too short) reaction times is to use the median rather than the mean in order to average the 200 single RT data into the mean/median reaction times before these are handed over to the parameter estimation procedure (Step 4 in the overview shown in Figure 3). These modifications diminish the constant error by factor four and increases the threshold by factor two, and thus make the process much more robust against contamination. 
Discussion
Recovery results
This study evaluated the parameter estimation method in the time-window-of-integration (TWIN) model of multisensory integration (Colonius & Diederich, 2004) for its ability to recover the underlying true parameter values. To this end, different values for five model parameters ( Display FormulaImage not available , Display FormulaImage not available , μ, ω, Δ) were systematically combined into a total of 576 different parameter sets (vectors of length five) for each of which a virtual experiment was conducted involving 40 subjects producing 200 responses per experimental condition. Data were generated by sampling from the probability distributions postulated in the model, best-fitting parameter values were obtained by minimizing an objective function defined for the model, and the resulting distributions of parameter estimates were evaluated with respect to accuracy (constant error) and precision (variable error). 
Our data reveal that the model parameters are recovered with very high accuracy, i.e., with low constant errors (median differences to the original value) of less than 5 ms, and also very high precision, i.e., low variable errors (median absolute deviations) of less than 7–8 ms for practically all parameters and their values. It should be stressed that the range of parameter vectors was huge, comprising neurophysiologically plausible vectors, such as (20, 50, 100, 200, 50 ), as well as odd ones like (20, 100, 50, 300, 150), which would represent gross imbalances among the values and thus among the model parts they represent; and even unrealistic (forbidden) vectors, such as (20, 20, 50, 200, 100), which would, if integration occurred, lead to negative reaction times. 
The only exception from these unambiguously good results has been observed for the highest value of parameter ω. Further investigation revealed that the distribution of estimates for ω = 300 ms can be divided into two subpopulations (Figure 9A). For Display FormulaImage not availableDisplay FormulaImage not available , the distribution of ω estimates is rather narrow (MAD = 11 ms). In contrast, for the complementary case, Display FormulaImage not available > Display FormulaImage not available (black bars), the estimates have a wide distribution (MAD = 43 ms). 
Figure 9
 
Further analysis for ω = 300 ms. (A) Further analysis of the main results revealed that the relationship between Image not available and Image not available divides the estimates for ω into two subpopulations: for Image not available > Image not available (gray) a narrow distribution results, whereas the spread is much larger for Image not availableImage not available . (B) This can be explained by the relationship between ω and the probability of integration. Graphs show the probability of integration for Image not available > Image not available (left) and Image not availableImage not available (right), and for ω values of 100 (light gray), 200, 300, … , 700 ms (black). While for the first case (left panel) the integration probability curves differ clearly across all ω values, for the second case (right panel) the curves more or less collaps for ω values of 300 ms and higher. This relative insensitivity of P(I) with respect to the exact ω value for higher window widths makes determination of optimal ω values less precise for the subpopulation defined by Image not available > Image not available .
Figure 9
 
Further analysis for ω = 300 ms. (A) Further analysis of the main results revealed that the relationship between Image not available and Image not available divides the estimates for ω into two subpopulations: for Image not available > Image not available (gray) a narrow distribution results, whereas the spread is much larger for Image not availableImage not available . (B) This can be explained by the relationship between ω and the probability of integration. Graphs show the probability of integration for Image not available > Image not available (left) and Image not availableImage not available (right), and for ω values of 100 (light gray), 200, 300, … , 700 ms (black). While for the first case (left panel) the integration probability curves differ clearly across all ω values, for the second case (right panel) the curves more or less collaps for ω values of 300 ms and higher. This relative insensitivity of P(I) with respect to the exact ω value for higher window widths makes determination of optimal ω values less precise for the subpopulation defined by Image not available > Image not available .
This odd but systematic pattern can be understood by inspecting the dependence of integration probability P(I) on the intensity parameters λA and λV across the SOA range (cf. the equations in the Appendix). The corresponding graphs in Figure 9B depict the two opposite cases relating to the two subpopulations defined by Display FormulaImage not available < Display FormulaImage not available (left-hand graph) and Display FormulaImage not availableDisplay FormulaImage not available (right-hand graph). Each graph presents a family of ordered probability of integration curves corresponding to seven different values of ω, from ω = 100 ms (lightest gray) to ω = 700 ms (darkest gray). Obviously, for ω values of 200 ms and above the curves on the right (where Display FormulaImage not availableDisplay FormulaImage not available ) collapse for most SOA values, whereas for the other subpopulation the probability of integration curves still differ for different values of ω
As a consequence, because the objective function controlling the estimation procedure directly depends on the probability of integration, actually, the term P(I) × Δ, the relative insensitivity of P(I) with respect to the exact ω value for higher window widths makes determination of optimal ω values less precise for the subpopulation defined by Display FormulaImage not availableDisplay FormulaImage not available
Consistency, correlation, and robustness of estimates
In additional experiments, we investigated consistency, correlation between recovered estimates, and robustness/sensitivity of recovery against contamination, and we compared using the mean or the median for data averaging at the different levels of the parameter estimation process. 
Consistency
The parameter estimates provided by the estimation process become more accurate and precise with an increasing number of trials per condition, and thus with an increasingly more reliable average reaction time for each of the conditions. Both the constant error (difference of the mean from the original parameter) and the variable error (standard deviation) decrease with the number of trials suggesting consistency of our estimation procedure. 
Correlation
The high negative correlation between the first-stage intensity parameters λV and λA and second stage parameter μ is not surprising given the additivity of the two stages at the mean RT level. A similar argument holds for the high negative correlation between ω and the (positive) Δ values and the moderate negative correlation between ω and Δ. These relationships must be kept in mind when interpreting parameter values across different experimental conditions. On the other hand, the absence of correlation between window width ω and parameters λV, λA, and μ is encouraging for testing certain hypotheses. For example, one may probe the inverse-effectiveness hypothesis by comparing experimental conditions with high and low intensity values. If a larger window width is associated with low intensity λ parameters, and vice versa, a conclusion in favor of that hypothesis may be less prone to an artifact of correlations among the parameter estimates. 
Robustness
Robustness of the parameter estimation procedure was tested assuming that, in a certain percentage of trials, the rule for integration was violated, i.e., that stimuli not registered within the time window were nevertheless integrated (perhaps due to a strong temporal offset of the stimuli) or that subjects failed to integrate acoustic and visual information that should have been integrated. 
Parameter estimation turned out to be robust if the subject does not integrate faulty in more than 10%–15% of the trials (cf. Figure 8A, central panel). However, this holds true only for unbiased subjects who behave faulty in both ways, i.e., integrate when they are not to, and vice versa, with the same probability. In contrast, if they have a bias towards nonintegration, contamination does not seem to have any effect on the good quality of the parameter estimation (left panel), whereas the effects of contamination are significantly increased for subjects with a bias towards integration. For these subjects, the threshold goes down to 5%– 9% (right panel). 
Importantly, sensitivity to contamination can be met effectively by a minor change in the data pooling process, that is, before the parameter estimation begins. If one averages the single reaction times of a subject using the median rather than the mean, and replacing the standard deviation (SD) by the median absolute deviation (MAD) in the objective function, contamination does not contribute to such a great extent (Figure 8C). Thus for future research, we recommend to use the objective function (Equation 6) in this modified version. 
Estimating the point of integration in the processing stream?
Despite the constraints in regard to the ω, the parameter estimates can be used to calculate the point of integration in the (cortical) processing stream and the gain with a confidence interval of ±33 ms; and if only one experiment is run, the width of the time window of integration (TWIN) can be estimated with a C.I. of ±70 ms . Still, as the time from first activation of a cell in V1 (latency of 45 ms after stimulus onset) to that of the last cell in FEF (latency of 100 ms) is only 55 ms (Schmolesky et al., 1998), our parameter estimates cannot be used to decide upon the localization within the visuo-saccadic processing stream. 
Even if one used the estimates from all 576 experiments to probe localization, with the high accuracy (median of 2.5 ms) and the high precision (median deviation of 7 ms) as discussed before, good localization would not be possible. From Figure 10, the median firing latencies of cells in V1, V3, V5 (MT), and FEF, which all lie along the presumed cortical processing stream of a visually induced saccade, differ by a mere 10 ms, while their distributions have a wide standard deviation of another 10 ms, and thus overlap to a great extent. Thus, even for an ideal point estimate with 0 ms constant error and 0 ms variable error, no reliable distinction could be made between these stages in the processing stream. However, estimations obtained here may help decide whether multisensory integration takes place directly in the brain stem, thalamus, occipital/parietal cortex, or rather late motor cortex. 
Figure 10
 
Precision of the simulation process in comparison to data obtained from single cell recordings. Gray and light gray lines represent (cumulative) distributions of visually evoked single-cell latencies in the magnocellular LGN, striate area V1, extrastriate areas V3 and MT, and the frontal eye field (FEF), which presumably mark the cortical processing stream of visually induced saccades. Superimposed is the distribution (sim) of the recovered estimates for the parameter value of 1/λV = 50 ms. Note that the precision of the estimation results is higher (corresponding to a steeper graph) or as high as the precision of single cell recording data obtained from awake monkeys.
Figure 10
 
Precision of the simulation process in comparison to data obtained from single cell recordings. Gray and light gray lines represent (cumulative) distributions of visually evoked single-cell latencies in the magnocellular LGN, striate area V1, extrastriate areas V3 and MT, and the frontal eye field (FEF), which presumably mark the cortical processing stream of visually induced saccades. Superimposed is the distribution (sim) of the recovered estimates for the parameter value of 1/λV = 50 ms. Note that the precision of the estimation results is higher (corresponding to a steeper graph) or as high as the precision of single cell recording data obtained from awake monkeys.
Conclusion
The time window of integration (TWIN) model, as developed over the last 10 years by Colonius, Diederich, and colleagues constitutes, on the one hand, a general framework describing how multisensory integration unfolds over time; on the other hand, by adding specific statistical assumptions about the distributions of the various postulated sub-processes, it allows us to predict quantitative changes of multisensory integration effects in reaction time, as a function of the specific experimental conditions like spatiotemporal stimulus configuration, instruction of the participants, or personal variables like age. While the general framework does not involve the issue of parameter estimation, the specific distributional instantiations of the model will only be useful for hypothesis testing if accuracy and precision of the parameter estimates can be assured. Importantly, a satisfactory fit of a model to the data alone does not guarantee that the parameters can be interpreted meaningfully. Let us consider a situation where one wishes to determine the actual durations of the primary auditory/visual stage and of the second, central stage. For example, assume estimates for the first stage parameters ( Display FormulaImage not available and Display FormulaImage not available ) are systematically overestimated by 50 ms, while average second stage processing time (μ) is systematically underestimated by the same amount. As a consequence, one might falsely localize the point of integration away from early processing areas towards motor areas in the brain. While we did not set a fixed threshold, given a total reaction time in experiments with real subjects of 200 to 250 ms, we considered errors of up to 10 ms as acceptable. Precision of the parameter estimates may even be of more importance. For example, assume one investigates whether the time window is larger (wider) in one condition compared to another condition or if the window width measured for a subject differs from the optimal window, with optimality derived from a Bayesian argument (Colonius & Diederich, 2010; Diederich et al., 2008). If the window estimates (ω) for the two conditions are 100 and 150 ms, say, one needs to know about the precision (the reciprocal of the variance) to infer whether this difference is real or just a product of chance. 
To conclude, we have demonstrated that the parameter estimation procedure recovers the parameters in the TWIN model with a very high accuracy (unbiasedness) and precision, speaking to a high reliability and criterion validity of the process. The goodness-of-fit procedure provided consistent parameter estimates, without problematic dependencies (correlations) among the parameter estimates. Furthermore, using either the mean or the median in order to average (a) single reaction time data in the objective function, or (b) to average parameter estimates for individual subjects into a (grand) group estimate, constitutes an adjustment wheel to make the parameter estimation process either sensitive, balanced, or robust against faulty integration (contamination) in subjects, depending on what is required in a given experimental study. 
Acknowledgments
We thank Achim Gelessus for help with using Jacobs University's computer cluster (CLAMV). Work on this study was supported by the Deutsche Forschungsgemeinschaft (DFG) grant DI 506-12 (AD and HC) and grant SFB-TRR-31 “Active Listening” (HC). 
Commercial relationships: none. 
Corresponding author: Farid I. Kandil. 
Address: School of Humanities and Social Sciences, Jacobs University Bremen, Bremen, Germany. 
References
Amlôt R. Walker R. Spence C. (2003). Multimodal visual-somatosensory integration in saccade generation. Neuropsychologia, 41, 1–15. [CrossRef] [PubMed]
Andersen P. Raastad M. Storm J. (1990). Excitatory synaptic integration in hippocampal pyramids and dentate granule cells. Cold Spring Symposia on Quantitative Biology, 55, 81–86. [CrossRef]
Bair W. Movshon J. (2004). Adaptive temporal integration of motion in direction-selective neurons in macaque visual cortex. Journal of Neuroscience, 24 (33), 7305–7323. [CrossRef] [PubMed]
Bamber D. van Santen J. (2000). How to assess a model's testability and identifiability. Journal of Mathematical Psychology, 44 (1), 20–40. [CrossRef] [PubMed]
Camalier C. R. D'Angelo W. R. Sterbing-D'Angelo S. J. de la Mothe L. Hackett T. R. (2012). Neural latencies across auditory cortex of macaque support a dorsal stream supramodal timing advantage in primates. Proceedings of the National Academy of Sciences, USA, 109 (44), 1–10.
Casella G. Berger R. L. (2002). Statistical inference (Int. Student Ed.). Pacific Grove, CA: Duxbury Press.
Colonius H. Arndt P. A. (2001). A two stage model for visual-auditory interaction in saccadic latencies. Perception and Psychophysics, 63, 126–147. [CrossRef] [PubMed]
Colonius H. Diederich A. (2004). Multisensory interaction in saccadic reaction time: A time-window-of-integration model. Journal of Cognitive Neuroscience, 16 (6), 1000–1009. [CrossRef] [PubMed]
Colonius H. Diederich A. (2010). The optimal time window of visual-auditory integration: A reaction time analysis. Frontiers in Integrative Neuroscience, 4 (11), 1–8. doi:10.3389/fnint.2010.00011. [PubMed]
Colonius H. Diederich A. (2011). Computing an optimal time window of audiovisual integration in focused attention tasks: Illustrated by studies on effect of age and prior knowledge. Experimental Brain Research, 212 (3), 327–337. [CrossRef] [PubMed]
Colonius H. Diederich A. (2012). Focused attention vs. crossmodal signals paradigm: Deriving predictions from the time-window-of-integration model. Frontiers in Integrative Neuroscience, 6 (62), 1–10. doi:10.3389/fnint.2012.00062. [PubMed]
Colonius H. Diederich A. (2013, August). Response time variability in the time window of integration model. Unpublished manuscript.
Colonius H. Diederich A. Steenken R. (2009). Time-window-of-integration (twin) model for saccadic reaction time: Effect of auditory masker level on visual-auditory spatial interaction in elevation. Brain Topography, 21, 177–184. [CrossRef] [PubMed]
Diederich A. Colonius H. (2007 a). Modeling spatial effects in visual-tactile reaction time. Perception and Psychophysics, 69 (1), 56–67. [CrossRef] [PubMed]
Diederich A. Colonius H. (2007 b). Why two “distractors” are better than one: Modeling the effect of non-target auditory and tactile stimuli on visual saccadic reaction time. Experimental Brain Research, 179, 43–54. [CrossRef] [PubMed]
Diederich A. Colonius H. (2008 a). Crossmodal interaction in saccadic reaction time: Separating multisensory from warning effects in the time window of integration model. Experimental Brain Research, 186 (1).
Diederich A. Colonius H. (2008 b). When a high-intensity “distractor” is better then a low-intensity one: Modeling the effect of an auditory or tactile nontarget stimulus on visual saccadic reaction time. Brain Research, 1242, 219–230. [CrossRef] [PubMed]
Diederich A. Colonius H. (2012). Modeling multisensory processes in saccadic responses: Time-window-of-integration model. In Murray M. M. Wallace M. T. (Eds.), The neural bases of multisensory processes (p. 253–276). Boca Raton, FL: CRC Press.
Diederich A. Colonius H. Schomburg A. (2008). Assessing age-related multisensory enhancement with the time-window-of-integration model. Neuropsychologia, 46 (10), 2556–2562. [CrossRef] [PubMed]
Frens M. A. Van Opstal A. J. Van der Willigen R. F. (1995). Spatial and temporal factors determine auditory-visual interactions in human saccadic eye movements. Perception & Psychophysics, 57 (6), 802–816. [CrossRef] [PubMed]
Lehmann E. L. (1983). Theory of point estimation. New York: John Wiley & Sons.
Luce R. D. (1986). Response times: Their role in inferring elementary mental organization. New York: Oxford University Press.
Meredith M. A. (2002). On the neural basis for multisensory convergence: A brief overview. Cognitive Brain Research, 14, 31–40. [CrossRef] [PubMed]
Molholm S. Ritter W. Murray M. Javitt D. Schroeder C. Foxe J. (2002). Multisensory auditory-visual interactions during early sensory processing in humans: A high-density electrical mapping study. Brain Research Cognitive Brain Research, 14, 115–128. [CrossRef] [PubMed]
Ratdliff R. Tuerlinckx F. (2002). Estimating parameters of the diffusion model: Approaches to dealing with contaminant reaction times and parameter variability. Psychonomic Bulletin and Review, 9 (3), 438–481. [CrossRef] [PubMed]
Reyes A. Rubel E. Spain W. (1994). Membrane properties underlying the firing of neurons in the avian cochlear nucleus. Journal of Neuroscience, 14 (9), 5352–5364. [PubMed]
Schmolesky M. Wang Y. Hanes D. Thompson K. Leutgeb S. Schall J. D. Leventhal A. G. (1998). Signal timing across the macaque visual system. Journal of Neurophysiology, 79 (6), 3272–3278. [PubMed]
Sumby W. Pollack I. (1954). Visual contributions to speech intelligibility in noise. The Journal of the Acoustical Society of America, 26 (2).
Ulrich R. Miller J. (1994). Effects of truncation on reaction time analysis. Journal of Experimental Psychology: General, 123 (1), 34–80. [CrossRef] [PubMed]
Wallace M. Wilkinson L. Stein B. (1996). Representation and integration of multiple sensory inputs in primate superior colliculus. Journal of Neurophysiology, 76 (2), 1246–1266. [PubMed]
Footnotes
1  For saccadic eye movements, this may correspond to a gradual inhibition of fixation neurons (in superior colliculus) and/or omnipause neurons (in midline pontine brainstem).
Footnotes
2  Some distribution-free model tests have been performed as well; see Diederich and Colonius (2008b).
Footnotes
3  Note that the parameter of the auditory peripheral processing time only enters this equation via P(Iτ,ω). One might argue that termination of first stage processing should be determined by the minimum of V and A. This is a plausible model version but experimental tests did not find strong support for it (Colonius & Arndt, 2001).
Footnotes
4  However, Colonius and Diederich (2013) developed a first analysis of RT variability in TWIN.
Footnotes
5  Including implausible and unrealistic parameter values presents an additional stress test of the model. While a high degree of flexibility of a model to accommodate even implausible parameter combinations may be considered a drawback, it should be realized that the model does not know about their implausibility and, as long as plausible parameters are recoverable, there is no reason to be worried.
Appendix
Probability of integration as function of SOA and window width
The peripheral processing times V for the visual and A for the acoustic stimulus have an exponential distribution with parameters λV and λA, respectively. Thus,   for t ≥ 0, and fV(t) = fA(t) ≡ 0 for t < 0. The corresponding distribution functions are referred to as FV(t) and FA(t), respectively. The visual stimulus is the target and the acoustic stimulus is the nontarget. By the model  where τ denotes the SOA value and ω is the width of the integration window. Computing the integral expression requires that we distinguish between three cases for the sign of τ + ω
Figure 1
 
Layout of and typical results for the focused attention paradigm (FAP). (A) Experimental setup. Sets of combined loud speakers and LED lights are arranged in a semicircle around the subject. Subjects are instructed to focus their attention on the target stimulus and identify at which side it appears by looking at it (saccadic response). In the example used throughout this study, the visual stimulus is the target stimulus and the acoustic stimulus is the nontarget. (B) Spatial layout. In more complex FAP settings, the visual target and the acoustic nontarget stimuli are presented either at the same side (ipsi) or at opposite sides (contra), thereby influencing whether the visual and the acoustic cue are perceived as emerging from one or different sources. However, in the evaluation study conducted here, we concentrated on the temporal cues, that is, as if visual and acoustic stimuli always emerged from the same position. (C) Temporal layout of a single trial. By convention, the visual target stimulus is presented at t = 0, whereas the accessory acoustic stimulus is presented either before (negative SOAs: τ < 0), in synchrony with (τ = 0), or after the visual cue (positive SOAs: τ > 0). In the sketch, τ = −100 ms. (D) Typical results of FAP experiments. In comparison to the condition in which the visual cue is presented alone (uniV, rightmost), reaction times are significantly reduced if the acoustic stimulus antecedes the visual. This facilitation (gain) is generally interpreted as indicating the successful integration of the two stimuli.
Figure 1
 
Layout of and typical results for the focused attention paradigm (FAP). (A) Experimental setup. Sets of combined loud speakers and LED lights are arranged in a semicircle around the subject. Subjects are instructed to focus their attention on the target stimulus and identify at which side it appears by looking at it (saccadic response). In the example used throughout this study, the visual stimulus is the target stimulus and the acoustic stimulus is the nontarget. (B) Spatial layout. In more complex FAP settings, the visual target and the acoustic nontarget stimuli are presented either at the same side (ipsi) or at opposite sides (contra), thereby influencing whether the visual and the acoustic cue are perceived as emerging from one or different sources. However, in the evaluation study conducted here, we concentrated on the temporal cues, that is, as if visual and acoustic stimuli always emerged from the same position. (C) Temporal layout of a single trial. By convention, the visual target stimulus is presented at t = 0, whereas the accessory acoustic stimulus is presented either before (negative SOAs: τ < 0), in synchrony with (τ = 0), or after the visual cue (positive SOAs: τ > 0). In the sketch, τ = −100 ms. (D) Typical results of FAP experiments. In comparison to the condition in which the visual cue is presented alone (uniV, rightmost), reaction times are significantly reduced if the acoustic stimulus antecedes the visual. This facilitation (gain) is generally interpreted as indicating the successful integration of the two stimuli.
Figure 2
 
Outline of the time window of integration (TWIN) model. The TWIN model posits that the subject's reaction time critically depends on the time of arrival of the attended visual and the unattended acoustic cue at the point of integration. (A) The RT for the unimodal condition is the sum of the durations of the sensory processing time for the target visual stimulus (V, parameter 1/λV) and of the secondary common processing time (M, parameter μ). (B) & (C) In the bimodal case, a nontarget acoustic cue is presented additionally to the visual cue with an SOA (τ). Upon arrival of the auditive cue at the point of integration (processing time A, parameter 1/λA), a time window for the integration is opened for a period of the length ω. (B) If the target visual information arrives at the point of integration with the time window being open, then the stimuli are integrated and the duration of the second step is reduced by a given amount of time (parameter Δ). (C) However, the cues are not integrated and RT is not reduced if the visual information arrives either before the temporal window has been opened (e.g., due to a negative SOA like τ1 in the upper half of the panel) or after the window has already been closed again (due to a long positive SOA like τ2, lower half of the panel).
Figure 2
 
Outline of the time window of integration (TWIN) model. The TWIN model posits that the subject's reaction time critically depends on the time of arrival of the attended visual and the unattended acoustic cue at the point of integration. (A) The RT for the unimodal condition is the sum of the durations of the sensory processing time for the target visual stimulus (V, parameter 1/λV) and of the secondary common processing time (M, parameter μ). (B) & (C) In the bimodal case, a nontarget acoustic cue is presented additionally to the visual cue with an SOA (τ). Upon arrival of the auditive cue at the point of integration (processing time A, parameter 1/λA), a time window for the integration is opened for a period of the length ω. (B) If the target visual information arrives at the point of integration with the time window being open, then the stimuli are integrated and the duration of the second step is reduced by a given amount of time (parameter Δ). (C) However, the cues are not integrated and RT is not reduced if the visual information arrives either before the temporal window has been opened (e.g., due to a negative SOA like τ1 in the upper half of the panel) or after the window has already been closed again (due to a long positive SOA like τ2, lower half of the panel).
Figure 3
 
Outline of the parameter recovery evaluation. The evaluation comprises seven steps: (1) Defining a set of values for each of the five parameters. (2) Generating 576 parameter vectors by complete combination of these values. (3) Conducting a virtual experiment with 40 “subjects” (simulations) for each of the vectors, thereby (4) generating 200 reaction times in each of nine experimental conditions (see text), (5) feeding the means of these RTs into the parameter estimation procedure for fitting parameters. (6) These parameter estimates are then gathered and evaluated, first for all subjects of one experiment, then averaged using the median and (7) analyzed across all experiments with respect to both accuracy (unbiasedness) in terms of the constant error to assess the validity, and precision (variance and standard deviation) in terms of the variable error to assess the reliability of the parameter estimation process.
Figure 3
 
Outline of the parameter recovery evaluation. The evaluation comprises seven steps: (1) Defining a set of values for each of the five parameters. (2) Generating 576 parameter vectors by complete combination of these values. (3) Conducting a virtual experiment with 40 “subjects” (simulations) for each of the vectors, thereby (4) generating 200 reaction times in each of nine experimental conditions (see text), (5) feeding the means of these RTs into the parameter estimation procedure for fitting parameters. (6) These parameter estimates are then gathered and evaluated, first for all subjects of one experiment, then averaged using the median and (7) analyzed across all experiments with respect to both accuracy (unbiasedness) in terms of the constant error to assess the validity, and precision (variance and standard deviation) in terms of the variable error to assess the reliability of the parameter estimation process.
Figure 4
 
Evaluation on the subject level (A & B) and group level (C & D). (A) Parameters were estimated by minimizing the objective function defined in Equation 6. In this sample subject in the experiment with the plausible vector, the difference between generated and predicted mean RTs is less than 2 ms per condition. (B) Across all experiments and subjects the mean difference was lower than 5.95 ms (±3.64) per condition. (C) Recovered parameters compared to the original values (here shown for 500 pretest subjects in the experiment with the plausible vector). Most individual parameter estimates are within a small range of ±15 ms around the original value, indicating that the process recovers the parameter values with a very high accuracy and a good precision. The number of statistical outliers (dots outside the antennas) is within the normal range (< 3%). (D) Plots indicate the relative amounts of subjects whose parameter recoveries resulted in veridical estimates (black bars) and those for whom the process terminated with estimates at or close nearby the lower or upper boundary (gray bars to the left and right of the black bar, respectively). Analysis is given separately for each of the parameters and their values. Slim black lines indicate the respective boundaries.
Figure 4
 
Evaluation on the subject level (A & B) and group level (C & D). (A) Parameters were estimated by minimizing the objective function defined in Equation 6. In this sample subject in the experiment with the plausible vector, the difference between generated and predicted mean RTs is less than 2 ms per condition. (B) Across all experiments and subjects the mean difference was lower than 5.95 ms (±3.64) per condition. (C) Recovered parameters compared to the original values (here shown for 500 pretest subjects in the experiment with the plausible vector). Most individual parameter estimates are within a small range of ±15 ms around the original value, indicating that the process recovers the parameter values with a very high accuracy and a good precision. The number of statistical outliers (dots outside the antennas) is within the normal range (< 3%). (D) Plots indicate the relative amounts of subjects whose parameter recoveries resulted in veridical estimates (black bars) and those for whom the process terminated with estimates at or close nearby the lower or upper boundary (gray bars to the left and right of the black bar, respectively). Analysis is given separately for each of the parameters and their values. Slim black lines indicate the respective boundaries.
Figure 5
 
Main results. Histograms show accuracy and precision for the parameter estimates. For each parameter (A)–(E), the panel on the far left side shows the histogram for the parameter aggregated across all values, whereas the panels to the right show histograms for each parameter value individually. X axes show absolute deviations from the original value (in milliseconds), and y axes the percentage. Accuracy refers to a (low) difference of the distribution mean from the central value to the left or right, while precision corresponds to a (small) widths of the distribution. Histograms are computed across the median estimates of each experiment, and thus comprise 576 data points (left panels), or the respective fraction of them, 144 or 192 (right panels).
Figure 5
 
Main results. Histograms show accuracy and precision for the parameter estimates. For each parameter (A)–(E), the panel on the far left side shows the histogram for the parameter aggregated across all values, whereas the panels to the right show histograms for each parameter value individually. X axes show absolute deviations from the original value (in milliseconds), and y axes the percentage. Accuracy refers to a (low) difference of the distribution mean from the central value to the left or right, while precision corresponds to a (small) widths of the distribution. Histograms are computed across the median estimates of each experiment, and thus comprise 576 data points (left panels), or the respective fraction of them, 144 or 192 (right panels).
Figure 6
 
Consistency of the parameter estimation process. Consistency was tested using the plausible parameter vector (20, 50, 150, 200, 50), 40 virtual subjects and an increasing number of trials (sample size) per condition. The expectation was that both the constant error (mean and median) and the standard error (standard and median deviation) decrease with the sample size. Results for both error measures show such a pattern suggesting consistency of the estimators. Median differences are shown in black and median absolute deviation in gray.
Figure 6
 
Consistency of the parameter estimation process. Consistency was tested using the plausible parameter vector (20, 50, 150, 200, 50), 40 virtual subjects and an increasing number of trials (sample size) per condition. The expectation was that both the constant error (mean and median) and the standard error (standard and median deviation) decrease with the sample size. Results for both error measures show such a pattern suggesting consistency of the estimators. Median differences are shown in black and median absolute deviation in gray.
Figure 7
 
Correlations between the recovered parameter values of the experiment with the plausible parameter vector (20, 50, 100, 200, 50). Graphs show pairwise scatter plots along with the correlation coefficient. Axes show absolute deviations of the individual estimates from the original parameter value.
Figure 7
 
Correlations between the recovered parameter values of the experiment with the plausible parameter vector (20, 50, 100, 200, 50). Graphs show pairwise scatter plots along with the correlation coefficient. Axes show absolute deviations of the individual estimates from the original parameter value.
Figure 8
 
Robustness of the parameter recovery against contamination. For the plausible parameter vector (20, 50, 150, 200, 50), graphs show dependence of the parameter estimates on the proportion p of contaminated response generations, i.e., when subjects do or do not integrate in trials in which they wouldn't or would have to integrate. Graphs show the constant error (median difference from the original parameter value; black lines) and the variable error (MAD; gray lines). (A) Robustness is stronger (less errors) and weaker (more errors) if subjects have a tendency to either omit integration (left panel) or to commit integration (right panel), as opposed to an unbiased response (central panel). (B) ω values do neither significantly support nor destroy robustness. (C) Parameter estimation becomes much more robust if one uses the median rather than the mean for averaging single RT data before presenting them to the estimation procedure.
Figure 8
 
Robustness of the parameter recovery against contamination. For the plausible parameter vector (20, 50, 150, 200, 50), graphs show dependence of the parameter estimates on the proportion p of contaminated response generations, i.e., when subjects do or do not integrate in trials in which they wouldn't or would have to integrate. Graphs show the constant error (median difference from the original parameter value; black lines) and the variable error (MAD; gray lines). (A) Robustness is stronger (less errors) and weaker (more errors) if subjects have a tendency to either omit integration (left panel) or to commit integration (right panel), as opposed to an unbiased response (central panel). (B) ω values do neither significantly support nor destroy robustness. (C) Parameter estimation becomes much more robust if one uses the median rather than the mean for averaging single RT data before presenting them to the estimation procedure.
Figure 9
 
Further analysis for ω = 300 ms. (A) Further analysis of the main results revealed that the relationship between Image not available and Image not available divides the estimates for ω into two subpopulations: for Image not available > Image not available (gray) a narrow distribution results, whereas the spread is much larger for Image not availableImage not available . (B) This can be explained by the relationship between ω and the probability of integration. Graphs show the probability of integration for Image not available > Image not available (left) and Image not availableImage not available (right), and for ω values of 100 (light gray), 200, 300, … , 700 ms (black). While for the first case (left panel) the integration probability curves differ clearly across all ω values, for the second case (right panel) the curves more or less collaps for ω values of 300 ms and higher. This relative insensitivity of P(I) with respect to the exact ω value for higher window widths makes determination of optimal ω values less precise for the subpopulation defined by Image not available > Image not available .
Figure 9
 
Further analysis for ω = 300 ms. (A) Further analysis of the main results revealed that the relationship between Image not available and Image not available divides the estimates for ω into two subpopulations: for Image not available > Image not available (gray) a narrow distribution results, whereas the spread is much larger for Image not availableImage not available . (B) This can be explained by the relationship between ω and the probability of integration. Graphs show the probability of integration for Image not available > Image not available (left) and Image not availableImage not available (right), and for ω values of 100 (light gray), 200, 300, … , 700 ms (black). While for the first case (left panel) the integration probability curves differ clearly across all ω values, for the second case (right panel) the curves more or less collaps for ω values of 300 ms and higher. This relative insensitivity of P(I) with respect to the exact ω value for higher window widths makes determination of optimal ω values less precise for the subpopulation defined by Image not available > Image not available .
Figure 10
 
Precision of the simulation process in comparison to data obtained from single cell recordings. Gray and light gray lines represent (cumulative) distributions of visually evoked single-cell latencies in the magnocellular LGN, striate area V1, extrastriate areas V3 and MT, and the frontal eye field (FEF), which presumably mark the cortical processing stream of visually induced saccades. Superimposed is the distribution (sim) of the recovered estimates for the parameter value of 1/λV = 50 ms. Note that the precision of the estimation results is higher (corresponding to a steeper graph) or as high as the precision of single cell recording data obtained from awake monkeys.
Figure 10
 
Precision of the simulation process in comparison to data obtained from single cell recordings. Gray and light gray lines represent (cumulative) distributions of visually evoked single-cell latencies in the magnocellular LGN, striate area V1, extrastriate areas V3 and MT, and the frontal eye field (FEF), which presumably mark the cortical processing stream of visually induced saccades. Superimposed is the distribution (sim) of the recovered estimates for the parameter value of 1/λV = 50 ms. Note that the precision of the estimation results is higher (corresponding to a steeper graph) or as high as the precision of single cell recording data obtained from awake monkeys.
Table 1
 
Low level results. For each parameter, the first line gives the lower and upper boundary limiting the recovery search process. Below that, the percentage of valid recoveries (bold face) is given next to the percentages of recoveries that terminated in or close to these boundaries.
Table 1
 
Low level results. For each parameter, the first line gives the lower and upper boundary limiting the recovery search process. Below that, the percentage of valid recoveries (bold face) is given next to the percentages of recoveries that terminated in or close to these boundaries.
Parameter Value % lower B % valid % upper B
1/λA Bounds: [ 5, , 250 ]
20 20.3 79.7 0.0
50 10.1 89.8 0.1
100 4.3 94.7 1.0
150 1.1 92.5 6.4
1/λV Bounds: [ 5, , 250 ]
20 14.0 86.0 0.0
50 8.4 91.4 0.2
100 4.7 92.8 2.5
150 2.4 87.4 10.2
μ Bounds: [ 0,
50 16.1 83.9 0.0
100 6.8 93.2 0.0
150 1.2 98.8 0.0
200 0.0 100.0 0.0
ω Bounds: [ 5, , 1000 ]
100 1.0 96.2 2.9
200 0.3 93.6 6.1
300 0.1 80.1 19.8
Δ Bounds: [ 0, , 175 ]
20 7.0 85.4 7.6
50 1.5 90.8 7.7
100 0.2 88.7 11.1
Table 2
 
Main results of the study. Accuracy and precision of the recovered parameters are given for each parameter separately: first across all parameter values (All), then for each parameter value individually. Accuracy (unbiasedness) refers to a preferably low constant error, i.e., difference from the original parameter (mean or median), whereas a high precision is indicated by a low variable error (standard deviation, SD, or median absolute deviation [MAD]).
Table 2
 
Main results of the study. Accuracy and precision of the recovered parameters are given for each parameter separately: first across all parameter values (All), then for each parameter value individually. Accuracy (unbiasedness) refers to a preferably low constant error, i.e., difference from the original parameter (mean or median), whereas a high precision is indicated by a low variable error (standard deviation, SD, or median absolute deviation [MAD]).
Parameter Value Mean Diffs SD Median Diffs MAD
1/λA All −0.25 4.50 −1.94 12.10
20 0.75 2.85 1.85 6.87
50 −0.52 3.40 −1.19 10.5
100 −2.25 5.65 −4.81 13.35
150 −0.75 5.85 −3.59 15.07
1/λV All −0.73 4.03 −2.77 12.61
20 −1.03 2.03 −1.34 4.33
50 −1.95 4.20 −4.21 10.27
100 1.00 5.43 −1.02 15.04
150 0.00 6.45 −4.55 16.70
μ All 1.35 3.58 3.20 11.97
50 1.20 3.68 3.49 14.26
100 1.48 3.30 3.18 12.13
150 1.05 3.78 2.79 9.69
200 1.70 3.93 3.33 11.49
ω All −2.82 8.05 2.26 51.54
100 −1.50 2.95 −4.32 9.08
200 −4.05 5.65 −9.71 23.59
300 −6.75 34.68 20.73 82.48
Δ All 3.65 3.60 6.71 9.25
20 4.70 3.20 7.62 8.07
50 2.78 3.18 6.41 10.05
100 2.75 3.65 6.09 9.48
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×