One way to describe how crosstalk interferes with selection is to imagine that some proportion of the output from the distractor channel is leaked into the output of the target channel. Poor selection can be exemplified by the extent to which the probability of a yes response is greater when: (a) a distractor alone occurred compared to no transients at all, and/or (b) both a target and distractor occurred compared to a target alone.
We formalized this crosstalk concept into a model called
the limited-capacity sharing model with crosstalk. The term
limited capacity refers to the fact that we allowed sensitivity to vary freely across conditions, in contrast to the specific limited-capacity model that assumes a fixed rate of information processing (Shaw,
1980). The term
sharing refers to the assumption that both tasks are performed independently but with limited capacity. The model begins with an encoding stage: each feature is encoded by an independent sensory channel, the output of which is a normally distributed random variable. We assumed that on a transient-absent trial the output of the channel was drawn from a “noise” distribution: a normal distribution with a mean of zero and a standard deviation of one. On transient-present trials the output of the channel was drawn from a “signal” distribution: a normal distribution with a mean greater than or equal to zero and a standard deviation of one. Finally, we assumed that the sensitivity of the two motion/luminance channels were the same (e.g., the sensitivity to upward and downward motion is equivalent).
To illustrate the model, consider the motion-task for a given trial in which upward motion is cued (
Figure 7).
Figure 7A depicts the probability density functions (PDF) for the outputs of the upward and downward motion channels (above and below respectively). The random variable
x1 denotes the internal evidence for an upward motion transient (target), and the random variable
x2 denotes the internal evidence for a downward motion transient (distractor). In the absence of a transient, the output of either channel is drawn from the noise distribution (with a mean equal to zero). On trials containing a target or distractor transient, the output is drawn from the signal distribution (with a mean shifted from zero). Based on the assumption that attention has no effect on the stimulus encoding stage, the mean of the signal distribution for target and distractor transients are the same. The mean of the signal distribution is equal to the sensitivity (
d′). As sensitivity decreases, the overlap between the two distributions increases, making the perceptual discrimination between stimuli present/absent more difficult.
Following stimulus encoding, a decision is based on the output of the cued channel (
x1), plus some amount of leak, or crosstalk, from the uncued channel (
x2). The amount of crosstalk is controlled by a gain term. If selection were perfect then the value of the gain parameter would equal zero and the distractor would have no effect on the decision. If the subject were unable to differentiate the target from the distractor—a complete failure of selective attention—then the gain parameter would equal one. A yes response is made if the pooled output of the two channels is greater than a criterion value (
Figure 7B). The proportion of yes responses increases as a function of crosstalk (
Figure 7C). If selection were perfect (a crosstalk gain parameter equal to zero), then distractors should have no effect on responses (
Figure 7C, top). Given a moderate level of crosstalk (gain = 0.5), distractors will increase the proportion of yes responses (
Figure 7C, middle). Given a complete failure of selective attention (gain = 1.0), the proportion of yes responses given a distractor alone will equal the proportion of yes responses given a target alone, and will increase to the combined probability of a yes response when both a target and distractor occur (
Figure 7C, bottom).
The model contains three parameters: sensitivity, which defines the mean channel output corresponding to a transient-present trial (signal distribution); a gain term, which controls the amount of leak from the uncued channel; and a decision criterion, which determines how large the pooled output from the two channels must be in order to produce a yes response. We used a maximum likelihood procedure to estimate the parameter values that yielded the greatest probability of generating our observed data set. In order to take full advantage of the information in our data set, we divided trials into four categories based on the pairwise combination of target and distractor transients and tallied the number of yes responses in each category. We then fit the model to these four yes response probabilities. The motion and luminance tasks for each of the three cue conditions were fit separately.
To visualize the model predictions to the data, we replotted in
Figure 8 the values from each pair of ROC points from
Figure 6A and
B on a common axis—the proportion of yes response. Three general patterns are immediately apparent when inspecting the proportion of yes response for each task across the three cue conditions. (a) The distribution of yes responses was nearly identical between the single-task condition and the dual-task, within-surface condition. (b) Crosstalk was more evident for the luminance task than for the motion task across all three cue conditions (compare the proportion of yes responses with and without distractors: cyan vs. blue and red vs. yellow). (c) Performance dropped, and selection errors became more prevalent when attention was divided between-surfaces.
The difference in the observed probability distributions between the single-task and the dual-task, within-surface conditions (
Figure 8) was statistically indistinguishable, χ
2(3, 4) = 51.81 and 28.87 for the luminance and motion task respectively;
p > 0.05. Fitting the model separately to these two conditions improved the fit by less than 3% (increase in maximum likelihood) for the motion condition and less than 1% for the color condition. Thus, we reduced our parameters by fitting the combined data for the two conditions (single-task and dual-task within), hitherto referred to as the baseline condition. None of the residual differences between the model predictions and the observed proportion of yes responses were not statistically different from zero (
p > 0.05 for all 24
t-tests) (
Figure 9).
The average parameter values for the baseline and the dual-task, between-surfaces conditions are displayed in
Table 2. Considering the baseline condition alone (
Table 2, first column), the model describes behavioral performance as follows. First, detection sensitivity was high for both tasks (
d′ of 3.4 ± 0.2 for the motion task and 3.8 ± 0.2 for luminance task). Second, the crosstalk gain parameter determines how well the subjects were able to select the cued feature and ignore distractor transients within the same feature dimension. The crosstalk gain parameter was significantly greater than zero in both cases,
t(4) = 6.95 and 17.75;
p < 0.01, suggesting that even in the baseline condition, subjects were not able to completely filter out distractors. There was more crosstalk in the luminance task than in the motion task (0.50 ± 0.03 vs. 0.17 ± 0.02, for the luminance and motion task, respectively). This difference is reflected in the data by the increase in the false alarm rate when a distractor transient occurred (0.21 ± 0.05 vs. 0.04 ± 0.02,
Figure 8 difference between cyan and dark blue bars), and an increase in the hit rate when a distractor transient co-occurred with a target transient (0.26 ± 0.06 vs. 0.08 ± 0.02,
Figure 8 difference between orange and yellow bars). Finally, the response criterion determines the trade-off between false alarms and misses. A response criterion equal to half an observer's sensitivity—zero bias—predicts an equivalent false alarm and miss rate. The higher the criterion—a conservative bias greater than zero—the more sensory evidence the observer requires to make a yes response. A conservative observer with a high criterion will commit more misses in order to avoid false alarms. Subjects tended to be conservative in both tasks (bias of 0.45 ± 0.04 and 1.02 ± 0.20, for the motion and luminance tasks, respectively), committing fewer false alarms than misses. This parameter is reflected in the data by a very low false alarm rate when no distractor occurred (0.037 ± 0.012 and 0.009 ± 0.004,
Figure 8, dark blue bars).
Dividing attention between surfaces is captured in the model by the ratio of the parameter values for the dual-task, between surface and baseline conditions. The log
10 of this ratio is tabulated in the third column of
Table 2. Dividing attention between surfaces results in: (a) a significant decrease in sensitivity shown by a log sensitivity ratio of −0.21 ± 0.04 for the motion task, and −0.37 ± 0.03 for the luminance task,
t(4) = 4.96 and 12.97;
p < 0.01, plotted on the left in
Figure 10; (b) a significant increase in crosstalk shown by a log of the crosstalk gain ratio of 0.35 ± 0.09 for the motion task and 0.21 ± 0.04 for the luminance task,
t(4) = 4.09 and 5.30;
p < 0.05, plotted on the right in
Figure 10. In addition, there was also a conservative shift in bias (corresponding to a change in
d′ units of 0.26 ± 0.04 and 0.05 ± 0.21) shown by a log criterion ratio of 0.20 ± 0.03 for the motion task and 0.03 ± 0.10 for the luminance task. However, this effect was only significant for the motion task,
t(4) = 5.8;
p < 0.05, and not for the luminance task,
t(4) = 0.29;
p > 0.05. For the luminance task, 2 of the 5 subjects showed a liberal shift in bias.
An observer may choose to implement a transient detection strategy by ignoring the cue and responding to transients in either surface (e.g., upward or downward speed changes for the motion task; or luminance changes across the red or green dots for the luminance task). When asked to divide attention between surfaces, did the subjects choose to pursue a transient detection strategy or were they unable to simultaneously select features from competing surfaces? A transient detection strategy would result in poor performance since observers' false alarm rate would equal their hit rate. Such a strategy would place an observer's performance on the negative diagonal in the AOC plots (
Figure 3), equivalent to ignoring one of the two cues (as predicted by the all-or-none switching model). Only one subject (S4) demonstrated that level of dual-task deficit. The other four subjects outperformed the theoretical low limit of dual-task performance. In addition, a pure transient strategy would result in a crosstalk gain parameter value of one. The maximum likelihood estimate for the crosstalk gain parameter was below one for both tasks (0.84 ± 0.09 and 0.39 ± 0.07). This suggests that even though selection was poor, subjects were, at the very least, attempting to ignore the distractor transients.
To summarize, the proportion of yes responses conditionalized on target and distractor transients (within the same feature dimension) were statistically identical between the single-task and the dual-task, within surface conditions. Our limited-capacity model with crosstalk adequately fit our data set (
Figure 8) with no consistent residual error (
Figure 9). Dividing attention between surfaces resulted in a dual-task deficit described by the model as a decrease in detection sensitivity paired with an increase in crosstalk (
Figure 10). The increase in crosstalk in the luminance task approached a complete failure of selective attention.
Overall, performance was statistically indistinguishable between the single-task conditions (motion or luminance task) and the dual-task within-surface condition (motion and luminance task). In contrast, dividing attention between surfaces to perform the motion and luminance tasks resulted in a significant dual-task deficit. Performance across the two tasks was statistically independent (uncorrelated), contrary to the predictions of an all-or-none switching model. In addition, distractors within the same feature dimension on the other surface increased the proportion of yes responses, indicative of crosstalk. Although crosstalk was observed in all conditions, it was greatest when attention was divided between surfaces (
Figure 6). Although distractors in the other feature on the other surface were successfully filtered (
Figure 8), luminance distractors masked motion transients within the same surface (
Figure 7). We constructed a limited-capacity sharing model that includes a crosstalk gain parameter to account for crosstalk within the feature dimension. Our model successfully fit the observed proportion of yes responses conditionalized on targets and distractors.