The first experiment measured thresholds for set sizes M = 1, 2, 4, and 8 at short, medium, and long standard temporal durations (0.5, 2.0, and 8.0 s). The spatial standards were 1.25, 5.0, and 20.0 cm. In the temporal task, the spatial length was always 5.0 cm, and in the spatial task, the temporal duration was 2.0 s. The observers were one of the authors (EG) and a psychophysically naive young male observer FG, who was not informed of the purpose of the experiment. Additional observations were carried out using the medium standard duration by MM and by another psychophysically naive young male (SG).
Maximum-likelihood estimates of threshold Weber fractions (see
Methods) appear in
Figure 2. It appears from
Figure 2 that the temporal task is more difficult than the spatial task. Before concluding that the temporal task is harder than the spatial, a possible problem to be considered is that there is no natural metric for comparing the spatial and the temporal standards. A “short” length might correspond in its internal representation to a “long” duration. If Weber fractions were constant, as Weber's Law claims, this would not matter. To test for significant differences between Weber fractions, we used only the
M = 1 data and fit the short, medium, and long conditions separately, with two-parameter psychometric functions of Weber fraction. The joint likelihood,
L U, of these unconstrained fits was compared with the joint likelihood,
L C, of fits in which the threshold parameter was constrained to be identical in all three conditions. The “generalized” ratio of these likelihoods −2ln(
L C/
L U), can then be compared to the chi-square distribution with 2 degrees of freedom (because the constrained fit has 2 fewer free parameters; Mood, Graybill, & Boes,
1974).
Applying this chi-square test to our spatial data, we found the generalized likelihood ratios to be 0.1 for EG and 4.3 for FG, whereas the critical value χ 0.95 2(2) = 6.0. Thus, the separate fits were not significantly different, and we have no evidence against Weber's Law. However, the same test of our temporal data tells a different story. The generalized likelihood ratios were 58 for EG and 14 for FG. Thus, the separate fits were significantly different, and Weber's Law did not hold for duration. Therefore, some temporal standards may be remembered with relatively greater precision than others. The Weber fraction for comparison with these “easy” durations may be comparable to that for size comparisons.
In some conditions (e.g., “medium”), the relative difficulty of the temporal task appeared to increase with set size. However, there was a set size effect for both tasks. A small set-size effect in spatial tasks has been reported before (Palmer et al.,
1993; Treisman,
1988) and is not necessarily inconsistent with unlimited capacity once the effects of spatial uncertainty have been taken into account. If an “early” source of perceptual noise perturbed each estimate of spatial and temporal length, then the total amount of noise would increase with set size. One possible strategy would be to select the (noisy) estimate that has the greatest absolute difference from the standard and to report the sign of that difference. This “Max rule” of signal detection theory has provided a successful description of many set-size effects (Morgan & Solomon,
2005).
We fit the data in
Figure 2 with the Max model and found that the fit was poor. (Fitted values of internal noise and log-likelihoods are documented in
Table 1.) Thresholds rose more rapidly with set size than predicted by the Max rule. To quantify how much bigger than the Max model's prediction our set-size effects were, we considered a modification of the Max model, in which there was a linear relationship between the number of search items and the standard deviation of the perceptual noise. Maximum-likelihood fits of this model are shown in
Figure 2 and in
Table 1. The addition of the second parameter improved the fit over the Max model significantly in every case, except for FG in the “short” temporal condition. (Specifically, with the one exception, the generalized likelihood ratio of the two fits exceeded the critical value
χ 0.9999 2(1) = 15.1; see
Table 1.) In general, the precision of temporal estimates fell more rapidly with set size than the precision of spatial estimates. Spatial noise increased less than 1% with each additional item, but—with the exception of FG in the “short” condition—temporal noise increased more than 1% with each additional search item.
We also considered an averaging model of performance, in which the observer computes on each trial the mean value over all the stimuli (Morgan, Ward, & Castet,
1998; Parkes, Lund, Angelucci, Solomon, & Morgan,
2001) and compares it to the standard. The averaging model was a poorer fit than the two-parameter increasing-noise version of the Max model in all cases except FG in the short/temporal condition. The fit of the unmodified Max model was better than that of the averaging model in all cases (
Table 1).
Several panels in
Figure 2 show a large increase in duration thresholds when the number of items increases from 1 to 2. This suggests that we cannot accurately assess the duration of two temporally overlapping events. Of course, having a single stopwatch does not preclude multiple duration estimates in a single trial; it merely precludes parallel estimates. Very long targets can always be at least partially monitored because they will be the last to disappear. Even a single stopwatch can tell when the duration between the last two disappearances is much longer than the standard. Therefore, psychometric functions should always have ceilings near 100%.
Observer EG reported that he based several decisions on the first search item to be presented. To investigate this point, further data were collected. These new data (shown in
Figure 3) confirmed that accuracy was indeed highest when the target was presented first. EG also reported using the temporal order of onset and offset as a cue. Suppose four items numbered 1–4 appear in the temporal order [4 1 3 2] and disappear in the order [1 3 2 4]. It is clear that item 4 is longer than the other. Similarly, the onset pattern [1 4 2 3] followed by [4 1 2 3] means that item 4 is shorter than the others. This is not a high precision strategy since it is unavailable in the case [4 1 3 2] followed by offsets [4 1 3 2].
The highly inefficient search for duration is compatible with two explanations. The first is that there is only a single master stopwatch, which can estimate only one duration at a time. The second is that there are multiple stopwatches distributed around the visual field, but—unlike real stopwatches—they have to be read soon after they have been stopped. According to the latter interpretation, the reason for the inefficiency with overlapping durations is that while the observer is reading one stopwatch, another may terminate and lose its information before the observer can read it. These two interpretations would be difficult to distinguish experimentally.
The greater efficiency of search for size may suggest that there are multiple “rulers” distributed over space, but it is not inconsistent with the notion of a single ruler. Whereas duration information only becomes available at the end of an event, size information remains available throughout. This reasoning suggests that it might be possible to devise a size task with a capacity limit similar to that for the duration task. This could be done by making the size information available for only a brief moment at the end of the stimulus. This was the aim of our second experiment.