The third type of explanation for the shape of the speed vs. size curve deals with the visual processing of letter strings rather than individual letters. The
visual span is the number of adjacent letters that can be recognized reliably without moving the eyes (O'Regan,
1990). It has been proposed that the size of the visual span imposes a bottom-up sensory limitation on reading speed (Legge et al.,
2007; Legge, Mansfield, & Chung,
2001). Legge et al. (
2001) introduced a psychophysical method for measuring the size of the visual span. Subjects are shown briefly presented strings of three unrelated letters (trigrams) at several positions left and right of the point of fixation (
Figure 4, top). Clusters of letters are used rather than single letters because of their closer approximation to text and because interfering effects of adjacent letters (crowding) may be an important determinant of reading speed. In a trigram trial, the subject reports the three letters aloud in left-to-right order and is scored right or wrong for each letter. After a block of several hundred trials, the accumulated results are plotted as letter identification accuracy (percent correct) as a function of distance left and right of fixation. These plots, termed visual span profiles, usually peak at fixation where letter recognition accuracy is very high and decrease monotonically left and right of fixation. An example of a visual span profile is shown in
Figure 4 (bottom).
The size of the visual span can be quantified as the area under the profile or as the distance left and right of fixation for which letter recognition accuracy exceeds some criterion level of performance such as 80% correct. For instance, if letters in trigrams can be recognized with 80% accuracy or more for four letter positions left and right of fixation, but not at greater distances from fixation, as in
Figure 4, we could say that the visual span is nine characters wide (the fixated letter plus four letters on either side of fixation.)
Several studies have examined how both reading speed and the size of the visual span depend on stimulus attributes of text. High correlations have been found between reading speed and the size of the visual span for variations in character size and contrast (Legge et al.,
2007), character spacing (Yu et al.,
2007), text orientation, i.e., vertical text compared with conventional horizontal text (Yu, Park, Gerold, & Legge,
2010), and the retinal eccentricity of text presentation (Legge et al.,
2001). These high correlations mean that when the visual span gets small, reading speed slows down. In the case of print size, for example, the size of the visual span remains constant at its maximum value over the same range of print sizes for which reading speed is maximum (the “fluent range”). For letters smaller or larger than the fluent range, the visual span gets smaller and reading speed slows down. Intuitively, the idea is that as the visual span gets smaller, fewer letters can be recognized on each fixation, or in a given unit of time, and reading slows down because more “looks” at text are required to read. A theoretical framework for the role of the visual span in reading was presented as part of an ideal observer model of reading called Mr. Chips (Legge, Hooven, Klitz, Mansfield, & Tjan,
2002; Legge, Klitz, & Tjan,
1997). Simulation results showed that the model's mean saccade length decreased as the model's visual span size decreased. Given that a reduction in mean saccade length would normally correspond to a reduced reading speed, the model shows how a smaller visual span size would result in a slower reading speed.
Reading speed may be linked to the size of the visual span, but what determines the size of the visual span? Why does the visual span (measured as the number of highly recognizable letters around fixation) suddenly begin to decrease in size for letters smaller than the critical print size? Pelli et al. (
2007) have made a persuasive case that crowding imposes the major limitation on the size of the visual span. An important and deep insight underlying their contention is that the limiting factor is not the size of the letters per se but the spacing between letters (assuming that the letters do not physically overlap).
Crowding refers to the observation that recognition of letters flanked by other letters (such as “g” in the trigram “tgu”) is much harder to recognize in peripheral vision than single letters with no flankers (Bouma,
1970; Woodworth,
1938). Bouma (
1970) measured percent correct letter recognition as a function of distance from the fovea for letters of a fixed size (the x-height was 14 arcmin). Target letters were presented alone or flanked on both sides with an “x” as in “xax.” Recognition accuracy was severely reduced by the flankers. The interfering effect extended over a large distance from the target letter, roughly equivalent to half the distance from the target to the point of fixation. Pelli et al. (
2007) described the spacing over which crowding effects extended from a given letter to a neighboring letter and defined the critical spacing
S within which crowding would result in a reduction to 80% or less letter recognition. They referred to the relationship between this critical spacing
S and retinal eccentricity
ϕ as the
Bouma law:
where
ϕ is the distance of a target letter from fixation in degrees,
S is the critical spacing in degrees,
S 0 is the critical spacing near fixation with a value roughly 0.1° to 0.2°, and
b is a constant named for Bouma. When adjacent letters are closer than the critical spacing, there will be crowding, and letter recognition will suffer.
In reading, letters in the text extend leftward and rightward away from the currently fixated letter. Letters farther from fixation are at greater eccentricity
ϕ with larger critical spacing
S. When the distance from fixation is large enough, adjacent letters fall within the critical spacing for the retinal eccentricity in question, and letter recognition performance suffers from crowding. It is this crowding that limits the size of the visual span. When print size gets small, the spacing between letters also gets small and the constant
S 0 in Bouma's law begins to dominate. Below a critical print size, the spacing between all letters in the text string falls within the critical spacing for crowding. When this occurs, the visual span shrinks rapidly and reading slows down. In short, according to Pelli et al. (
2007), the critical print size for reading is not a consequence of the size of the letters per se, but a result of the center-to-center spacing between adjacent letters falling within the critical spacing for crowding.
This discussion of crowding and spacing might seem to imply that extra-wide spacing between letters in text would reduce crowding, increase the size of the visual span, and result in faster reading. However, empirical study has shown that extra-wide spacing does not enlarge the visual span nor increase reading speed (Yu et al.,
2007).
To briefly summarize the findings reviewed in this section, oculomotor factors do not determine the boundary print sizes for the fluent range. It is likely that the spatial frequency dependence of contrast sensitivity plays a role in the reduction of reading speed for very small and very large letters, but it remains uncertain why peak frequencies (cycles per letter) for letter recognition are size dependent. It appears likely that the size of the visual span is an important determinant of reading speed, and crowding is a primary determinant of the size of the visual span.