Vernier offset discrimination is compromised by closely neighboring flanks. Interference is strongest when a single flank is placed on each side at a distance of 2–3 arcmin and is less for smaller or larger separations (see
Figure 1; Westheimer & Hauske,
1975). We have here demonstrated how both the length and the pattern of the flanks influence this masking. Except for the equal-length condition in
Figure 3, in almost all configurations we tried, the threshold elevation due to a single pair of flanks is reduced when the flanks become part of a more extended configuration.
Originally, vernier interference was explained in terms of local interactions between the vernier stimulus and the flanks (Levi, Klein, & Aitsebaomo,
1985; Westheimer & Hauske,
1975; Westheimer et al.,
1976). The simplest form of interaction, light summation within the optical image on the retina can be quickly discounted. If optical overlap between target lines and flanks were the cause of the rise in threshold, then the effect ought to diminish monotonically with increasing target–flank separation. This is not what the data in
Figure 1 show (also, see Westheimer & Hauske,
1975).
Once optical factors are ruled out, one turns to the properties of the neural circuits utilized in fine spatial localization tasks. Those involved in vernier acuity remain to be fully elucidated, and the first structure to be examined in this connection is the retina. At the outset, it is clear that the magnitude of the thresholds precludes reliance solely on the local signs of individual elements of the retinal mosaic. Hence, processing from ensembles of neurons is required for a signal to emerge with the few arc second precision of alignment acuity. The evidence is overwhelming that the earliest location in the visual stream of the phenomena under study here is the visual cortex. First, the change with eccentricity in the visual field is different from that of retinal structure and functional organization of ganglion cells, and second, masking is dichoptic (Westheimer & Hauske,
1975).
Once it is accepted that the primary visual cortex is involved, one can examine the properties of the neurons there, a subject that has been thoroughly investigated even in the preparation most germane to human performance, the alert primate. Here, a great deal of information has been accumulated recently about neural interactions, specifically about the influence of contextual stimuli on the firing of individual orientation-selective neurons. Exploration has been extended from the receptive field, that is, the spatial region in which explicit stimuli induce firing of the neuron, to the so-called “nonclassical receptive fields,” a surrounding zone from which stimulation, which, by itself, does not affect the neuron, nevertheless will influence the response to stimulation within the receptive field (Allman, Miezin, & McGuinness,
1985; Kapadia, Ito, Gilbert, & Westheimer,
1995; Knierim & van Essen,
1992; Levitt & Lund,
1997; Li et al.,
2000; Sillito, Grieve, Jones, Cudeiro, & Davis,
1995). Such findings invite comparison with interaction phenomena in the realm of flank masking of visual thresholds in the spatial hyperacuity range.
A class of explanations argues that there is signal averaging within small spatiotemporal windows and a subsequent pooling of information (Badcock & Westheimer,
1985; Baldassi & Burr,
2000; Parkes, Lund, Angelucci, Solomon, & Morgan,
2001; Pelli, Palomares, & Majaj,
2004). There are also long-standing proposals of contour interactions either in the context of metacontrast masking (e.g., Werner,
1935) or in a more general manner (e.g., Blachowsky,
1912). None of these concepts, however, can accommodate our findings that extending the number of flanking lines improves performance dramatically. Nor can the improvement of performance be explained by any kind of energy mechanisms because longer and shorter flanks yield an improvement of performance, which is more pronounced in the longer line conditions (see
Figure 3). Even more surprisingly, worst performance is reached if the flanks have the same length as the vernier. This nonmonotonic dependence of performance on flank length imposes serious restrictions on many models of spatial processing.
Other explanations invoke lateral inhibition between elements to explain vernier threshold elevation postulating that the flanks fall into the inhibitory region of the vernier offset detector mechanism, introducing a desensitization. Additional flanks might then reduce such inhibition. The gap experiments of
Figure 5 would then be interpreted as examples of breaking a hypothetical chain of inhibition/disinhibition. A credible model with the nuanced set of parameters to cover the range of effects revealed in this study with reasonable verisimilitude is beyond the scope of this article.
Whereas our results tend to support a role of target conspicuity in crowding research, other researchers have denied such a role (Chung, Levi, & Legge,
2001; Felisberti, Solomon, & Morgan,
2005). Their results were obtained in peripheral vision; we presented stimuli foveally. Because spatial visual processing in the retinal periphery may differ in significant respects from that for foveal stimuli, comparison between such results may be difficult. For example, improved performance with increases in the number contextual elements was reported previously in foveal vision (e.g., Li et al.,
2000; Wehrhahn et al.,
1996), whereas in peripheral vision, performance does not seem to improve with an increasing number of flanks (e.g., Felisberti et al.,
2005; Parkes et al.,
2001; Pelli et al.,
2004; Strasburger, Harvey, & Rentschler,
1991; Wilkinson, Wilson, & Ellemberg,
1997).
Here, we are taking the first steps to examine the proposition that the results of the experiments described above warrant interpretation in terms of grouping processes in which the flanks, if their number is small or if they share the target lines' length (e.g.,
Figure 3, “equal length”), join cluster with the target and mask it, whereas when their number is large and they are of different length, they form their own configuration and allow the target to stand out by itself (e.g.,
Figures 2,
3, and
6).
We see reinforcement of this view in the observation that on casual inspection of the configurations, it appears that the more conspicuous the vernier pattern, the lower is the threshold. The impression that the more the actual test pattern stands out from its laterally flanking surround, the better its vernier acuity, can be given numerical expression by comparing the thresholds with the subjective ranking of conspicuity: The lower the score is, the more conspicuous the vernier pattern is. Five observers were asked to rank the eight configurations in
Figure 7A in the order of how conspicuous the central test vernier pattern appeared to them. The averaged rank order was plotted as a function of the average vernier threshold.
Figure 7B shows that there is indeed a strong tendency for the threshold to be higher the less the test vernier stands out from the whole display.
The data presented here open up the vista of formulating rules that grouping of feature elements obey in masking vernier acuity and, by extension, in acting as modules in perception.