Table 1 lists nine studies in which contrast thresholds were collected for identification of alphanumeric symbols as a function of size (Alexander, Xie, & Derlacki,
1994; Aparicio et al.,
2010; Blommaert & Timmers,
1987; Ginsburg,
1978; Legge, Rubin, & Luebker,
1987; McAnany & Alexander,
2006; Pelli, Burns, Farell, & Moore-Page,
2006; Strasburger, Harvey, & Rentschler,
1991). The final study (Watson) refers to data collected for this report. The studies varied in their methods, including exposure duration, symbol color (black or white), the luminance of the background L
0, the font, the number of symbols, the percentage correct that defined threshold, monocular or binocular viewing, and the type of display. The values of these parameters are provided for each study in
Table 1. In all studies, trials were blocked by size, except in the study by Ginsburg (
1978), in which an eye chart ordered by size was used.
The data from these studies are summarized in
Figure 1a, plotted as contrast sensitivities (inverse contrast thresholds). To make the summary data more legible, we have averaged over observers within each study, using the method described in
Appendix 1. There is substantial variability between studies, in part because of the variation in conditions. For example, the lowest sensitivities are for the briefest durations (McAnany & Alexander,
2006) whereas the highest are for the longest durations (Aparicio et al.,
2010; Legge et al.,
1987). But despite the variations, a general pattern is evident. The dashed gray line in the figure illustrates the behavior of a simple ideal observer: Sensitivity increases in proportion to size (the vertical position of the curve would be determined by the power spectral density of the noise, but here it is arbitrary). The data, on the other hand, show an initial slope that is much greater than 1, followed by a flattening and a decline beyond about 1 deg.
An alternative representation of these data is in terms of contrast difference energy. An ideal observer classifying a set of M images c
m will base its decision on the vector distance between the test image and each candidate image in the set. Performance for the set will depend on the squared distances between members of the set (Watson & Ahumada,
2008). A summary metric representing these differences is the average squared distance between members of the set (Ahumada & Watson,
2013; Dalimier & Dainty,
2008). This is equivalent to the average squared distance between each member and their average (Ahumada & Watson,
2013). Thus, a useful metric for the set is the contrast difference energy
where
c̄ is the average image,
dx and
dy are the width and height of a pixel, and
dt is the duration, or the integral of the square of the temporal waveform. In this expression, we have omitted the spatial image coordinates
x and
y.
Expressing identification thresholds in terms of contrast difference energy has two advantages: It automatically takes into account differences in duration and font weight and styling, and it displays performance variations relative to an ideal observer. To transform the data, we first computed the “unit” contrast difference energy of the set of symbols used in each study, at a nominal size of 1 deg and a duration appropriate to the study (see
Appendix 2 for details and
Table 1 for values). The contrast difference energy of each point in
Figure 1a can then be determined by multiplying the unit contrast difference energy by the square of the contrast and by the square of the letter size in deg.
A convenient unit for contrast energy or contrast difference energy is dBB (Watson & Ahumada,
2005; Watson & Solomon,
1997). This is given by
This is a decibel measure, adjusted so that 0 dBB approximates the minimum visible contrast energy for a sensitive human observer (Watson, Barlow, & Robson,
1983). Thresholds can be expressed as dBB, whereas sensitivity can be expressed as −dBB. The data in
Figure 1a are presented again in
Figure 1b as contrast difference energy sensitivities in −dBB. Note that the scatter of the data has been reduced considerably. With respect to the remaining variation, we note that even after compensating for duration and contrast energy, the studies differed in specific observers, font complexity, percentage correct, number of symbols, background luminance, pupil diameter, and contrast polarity. All of these are likely to have some effect on contrast sensitivity for identification. The largest variations are for the smallest symbols, where variations in acuity will have their effect.
Rather than attempting to fit the data for one study or one observer, we will fit the ensemble of data. In the case of data collected in our own lab, where all relevant experimental parameters are known, we will fit results for individual observers.
For an ideal observer, limited only by the inherent noise in the signal, contrast energy sensitivity would be constant when expressed in dBB. Considering the data in
Figure 1b, we note that the sensitivities first rise with increasing size, reaching a peak at a size around 1/4°, and then fall continuously at larger sizes. We will argue below that the initial rise is due to the escape from optical and neural blur, whereas the later decline is due to the decline in resolution of the periphery.