Abstract
A single-field contrast asynchrony refers to a stimulus configuration in which there is a single temporally modulated field and multiple sources of contrast information; the sources of contrast information modulate at different temporal phases or at different temporal frequencies. In this paper we show how single-field contrast asynchronies can lead to a wide variety of visual illusions. We investigate, in depth, the window shade/rocking disk configuration, in which a temporally modulated disk is surrounded by a split annulus (i.e., the top half is dark, and the bottom half is light). When the annulus is thick, the disk appears spatially inhomogeneous (shading); when the annulus is thin, the disk appears to rock back and forth (shifting). We measure the proportion of trials that a disk appears to shade or, on separate trials, appears to shift as a function of modulation amplitude, surround thickness, temporal frequency, and disk size. We account for the shading effects by postulating a combination of separate first- and second-order responses and/or a multi-scale spatial filtering process. We account for the shifting effects by examining four elemental motion conditions. For luminance modulation, the direction of the shift follows the same pattern as that produced by the rectified output of an array of spatial center-surround filters applied to the X, t plot. For equiluminant modulation, the direction of the shifts is consistent with a sequence-tracking (or third-order) motion response.
The information in any visual scene can be expressed in terms of a variety of stimulus characteristics (spatial frequency, luminance, contrast, temporal frequency, chromaticity, etc.). The visual system encodes this information through neural channels, each of which responds to only a small range within a few of these dimensions. One goal of psychophysics is to separate and characterize the response properties of these channels, but this goal is challenging because the signals relayed to the brain from these channels are integrated prior to perception.
This paper examines visual illusions that stem from attempts to construct stimuli that separate the visual response to color/luminance information (first-order response) from the visual response to contrast information (second-order response). The impetus for creating such stimuli came from experiments that showed that chromatic contrast adaptation occurs at rates much faster than the temporal response of chromatic systems (Zaidi, Spehar, & Debonet,
1998; Webster & Wilson,
2000; Shapiro, Hood, & Mollon,
2003). Shapiro et al. (
2003) concluded that the system that controls contrast adaptation is not the same as the color pathways that carry the signal that was ultimately detected.
To separate and characterize the first- and second-order color responses, Shapiro & D'Antona (
2003); Shapiro et al. (
2004a); and Shapiro, D'Antona, Smith, Belano, & Charles (
2004b) developed a class of stimuli (termed “contrast asynchronies”) defined by the asynchronous modulation of multiple sources of contrast information. The initial contrast asynchrony consisted of two physically identical disks, one surrounded by a dark annulus and the other by a light annulus. When the luminance levels of both disks are modulated simultaneously, the contrast (relative to the surround) modulates in antiphase, but the luminance levels remain in phase (an interactive version of this stimulus can be seen by clicking on
Figure 1).
The two-disk asynchrony indicates that at the level of perception, the visual system is able to represent both luminance and contrast information. For instance, a curious phenomenon associated with the two-disk asynchrony is that at 1 Hz, observers report the paradoxical perception that the disks modulate in antiphase (i.e., with the contrast signal), yet also appear to get light and dark at the same time (i.e., in phase, with the luminance signal). At low temporal frequencies, therefore, the perceptual system can separate the response to contrast information from the response to luminance information. At 3 Hz, only the contrast signal could be seen, indicating that the visual response to the contrast information is faster than the visual response to the luminance information.
In addition to presenting the two-disk contrast asynchrony, Shapiro et al. (
2004a,
2004b) documented variants in which the antiphase contrast information was integrated within a single disk (see
Figure 1, and click on
Figure 1 to see interactive demonstration). One variant (which Shapiro et al.
2004a, referred to as a window shade/rocking disk illusion) consists of a disk whose luminance is modulated sinusoidally in time, surrounded by a split annulus, half light and half dark. The contrast at the dark edge is in phase with the luminance modulation, and the contrast at the light edge is in antiphase with the luminance modulation. The appearance of the disk depends upon the thickness of the surrounding annulus. If the surround is thick, a veil of lightness appears to slide back and forth across the disk (like a window shade). If the surround is thin, the disk appears to shift location.
The window shade illusion shows a connection between contrast asynchronies and the effects of the spatial context on lightness/brightness perception. The effects of surrounds on the appearance of a center disk have been studied by generations of artists and scientists (for a historical review, see Kingdom,
1997). There are currently a wide range of theories concerning these types of illusions; for a review of some of these theories, see Adelson (
2000); Blakeslee & McCourt (
2004); Gilchrist et al. (
1999); Kingdom (
2003a,
2003b); Purves, Williams, Nundy, & Lotto (
2004); Shapley, Caelli, Grossberg, Morgan, & Rentshler (
1990). Most (but not all) of these theories recognize the importance of contrast on the appearance of a center field.
The rocking disk illusion (shifting effect) shows a connection between contrast asynchronies and the appearance of motion that arises when shifting between two different contrast levels (see, for instance, reverse phi and four-stroke motion, Anstis & Rogers,
1975,
1986; and the phenomenal phenomena of Gregory & Heard,
1983). The shifting effect may also be related to static tilt illusions such as the café wall illusion, twisted chords, and other recent variations (Kitaoka, Pinna, & Brelstaff,
2004; Pinna & Brelstaff,
2000).
In this paper, we investigate the general principle that a single-field contrast asynchrony with thick borders produces induced shading, whereas one with thin borders produces shifting. We show that the shifting and shading effects depend upon the modulation amplitude and the thickness of the surround. The results are consistent with Shapiro et al.'s (
2004a) interpretation that the shading effect produced by a contrast asynchrony results from the interaction of separate responses to the center light (first-order response) and to the contrast between the center and the surround (second-order response), and that the stimulus parameters change the relative weightings of these responses. The first- and second-order responses may result from a multi-scale spatial filtering of the visual image (such as in Blakeslee & McCourt,
2003). We also show that the shading/shifting principle can be used to generate numerous new illusions. The shifting effects produced in these illusions can be explained by examining the output of an array of center/surround filters. Lastly, we note that a sequence-tracking type of motion system can also see motion in this type of stimuli (Derrington,
1985); however, for some contrast-asynchrony configurations, the direction of motion predicted by sequence tracking is the opposite of the direction predicted by the local energy models. Motion for some equiluminant contrast asynchronies is perceived to follow the prediction of the sequence-tracking models.
In the center of the monitor, observers viewed a disk with a split annulus. One half of the annulus had a luminance level of 20 cd/m2 (dark) and the other half had a luminance level of 80 cd/m2 (light). The center disk had a mean luminance level of 50 cd/m2. We examined the effect of modulation amplitude as a function of temporal frequency (1 or 3 Hz), disk area (1° or 8°), and surround thickness (0.22°, 0.89°, and 3.5° for shading; 0.22°, 0.48°, and 1.7° for shifting). The amplitude of modulation varied between 0 and 30 cd/m2. We express amplitude on a scale from 0.0 (no modulation) to 1.0 (modulation amplitude of 30 cd/m2).
Observers were asked one of two questions: Was the disk shading in appearance (yes/no)? Or (on separate trials) did the disk appear to move (yes/no)? The relevant features of modulation always consist of both luminance and contrast components. It is impossible to change one of these values without changing the other. The stimulus, therefore, does not lend itself to a Type A experiment (Brindley,
1960) or to a forced-choice type of experiment. We therefore settled upon measuring observers' reports of the appearance of the stimulus, bearing in mind that this leaves the interpretation of the results open to questions of observer bias.
Observers ran the experiment in sessions with a fixed stimulus configuration (i.e., fixed temporal frequency, disk area, and surround size). The order of the stimulus presentation was determined prior to the start of the experiment. The observer viewed 10 presentations of the stimulus at each of the eight amplitude values. After each presentation, the observer pressed arrows on the keyboard to indicate whether the disk appeared to be shading or (on a separate session) whether the disk appeared to be shifting. The observer ran two sessions for each stimulus configuration (i.e., twenty presentations of each stimulus configuration for shading, and twenty presentations for shifting).
Figure 2 shows the proportion of trials perceived as shading as a function of modulation amplitude. The data represent the average for the three observers. The data from individual observers have been posted with this manuscript and can be seen from the following
link. The rows present the data measured at different temporal frequencies (1 and 3 Hz), and the columns present different disk diameters. In each panel, the symbols indicate the thickness of the surround (black circles, thin; green triangles, thick; and blue squares, thickest). For a 1°/1 Hz disk (upper left), the two thicker-surround conditions produce a standard monotonically increasing psychometric function that reaches a value of 1 at a magnitude of 0.3. The thin surround condition (black circles) produces a nonmonotonic function. At a modulation amplitude of 0.2, the disk was most frequently perceived as shading; at higher amplitudes, the disk was less frequently perceived as shading. Similar patterns are shown for an 8°/1 Hz disk (upper right panel) and a 1°/3 Hz disk (lower left panel).
In the 8°/3-Hz condition (lower right panel), the shading effect is seen less frequently—indeed, two of the observers seldom reported seeing shading in this condition. The upper temporal limit for shading depends upon the diameter of the disk. This suggests that shading depends upon a neural “filling-in” process: The large area and faster central modulation do not give enough time for the contrast induction to spread across the field.
The nonmonotonic relationship between the appearance of shading and the amplitude of modulation (
Figure 2) parallels visual-evoked potential (VEP) results of Crognale, Switkes, & Adams (
1997). In their study, Crognale et al. measured VEP response to gratings modulating in counter phase at 4 Hz and found a nonmonotonic relationship between the amplitude of the VEP 2
nd harmonic and the contrast of the gratings. They stated that the nonmonotonic relationship was believed to be the result of multiple generators that had different temporal frequencies but “had little or no consequence for the perception of contrast per se.” Here we suggest the possibility that a counter-phase modulating grating produces a visual response to both first- and second-order information. It is therefore conceivable that the nonmonotonic relationship found by Crognale et al. is related to the phenomena shown here.
Lastly, it is worth noting that although all three observers showed the nonmonotonic functions, this result depends on a subjective task. When shown a window shade illusion with large modulation and thin surrounds, half of a class of twelve students reported seeing shading, and the other half reported seeing the disk as uniform (with thick surrounds, they all reported seeing shading). This suggests that in a larger population there will be variability either due to response biases or to differences in the relative strength of the processes that respond to the first- and second-order information. Individual differences on such a task would be consistent with Shapiro et al. (
2004b) and with other studies that show variability in contrast response (e.g., Fraser & Wilcox,
1979; see also Backus & Oruç,
2004).
Figure 3 shows results for the apparent shifting of position as a function of modulation amplitude. The symbols and panels are the same as in
Figure 2. The apparent shifting consistently increased with the modulation amplitude. Shifting occurred most frequently with the thinnest surrounds and least frequently with the thicker surrounds.
The curves were steeper for 8° disks than for 1° disks (i.e., the amplitude at which the observer saw the rocking 50% of the time is smaller for the 8° disks than for the 1° disks). The probable reason for this is that the edges for the 8° disks are in the visual periphery. Other illusions that show motion with contrast (e.g., Pinna & Brelstaff,
2000) are often more noticeable in the periphery. It is likely that such effects have to do with the larger receptive fields in the periphery. Lastly, the results for the 1- and 3-Hz conditions are similar even for 8° disks. This suggests that a neural filling-in process like the one used to explain the results for induced shading is not necessary for shifting.
The results are consistent with the interpretation that the relative strength of the processes that respond to first- and second-order information changes with the stimulus configuration. This interpretation assumes that (1) the response to first-order information creates the appearance of a uniform disk, and the response to second-order information leads to the appearance of shading and to the appearance of shifting; (2) the strength of the response to the second-order information that leads to shading increases as the thickness of the surround increases; and (3) the first-order response as a function of modulation amplitude increases faster than the second-order response in this stimulus. Thus, when the surrounds are thin and amplitude is large, the response to first-order information is greater than the response to second-order information. As the surrounds become larger, the magnitude of the second-order response is more on par with the magnitude of the first-order response.
The basic task was the same as the previous section. Observers were asked whether the disk appeared to shade or shift position. The method of constant stimuli was used to measure psychometric curves (proportion of “yes” responses versus annulus thickness). These curves were measured for six different disk diameters (0.25°, 0.5°, 1°, 2°, 4°, and 8°) and for two modulation amplitudes (0.3 and 1.0).
In each session, observers saw 10 presentations of six disk diameters and eight surround thicknesses. There were four sessions: In two sessions, the observer was asked whether the disk appeared to shade, and in the other two, the observer was asked if the disk appeared to shift (thus, there were twenty presentations of each diameter and surround combination). The observers in the 1.0 modulation condition ran four sessions for both horizontal-split and vertical-split surrounds. The 0.3 modulation condition was run as a follow-up to the 1.0 modulation experiment; observers in the 0.3 modulation condition saw only the horizontal-split surrounds.
There were 16 naïve observers, each of whom ran in individual sessions. We used a physical disk (a CD) to illustrate what we meant by “rock” (i.e., perceived motion that originated from the intersection of the light and dark segments of the annular surround) and by “pivot” (i.e., perceived motion that was tied to a single focal point). The set of stimuli contained disks of 0.25°, 0.5°, 1°, 2°, 4°, or 8° diameter; each of these had a thin annular surround (0.05°) that was split vertically (i.e., the intersections of the surrounds occurred at 90° and 270°) or horizontally (the intersections occurred at 0° and 180°). The order of presentation was generated by the MatLab randomization routine. Observers viewed each presentation for 5 s. The observers were asked whether the disk appeared to move; if they responded yes, they were asked whether the disk appeared to pivot or rock.
Contrast asynchronies arise whenever a stimulus configuration contains contrast signals that differ in temporal phase (Shapiro et al.,
2004a,
2004b). This paper examined contrast asynchronies that arise within a single visual patch. As a rule, thin borders create the appearance of shifting, and thick borders create the appearance of shading (or brightness spreading). We have shown that this rule can be applied to a variety of stimulus conditions to generate a wide range of new visual illusions.
At low modulation levels, the threshold thicknesses for shading are approximately the same as the threshold thicknesses for the elimination of shifting. The similar thresholds might suggest a common etiology for the two perceptual responses—for instance, when the borders are smaller than a ganglion cell receptive-field center, the edge appears to move, but when the borders are larger than the center, the edge appears to shade. However, for high modulation levels there is a range of disk areas and surround thicknesses that most observers perceive to neither shift nor shade. We have also demonstrated that shifting and shading can be perceived at the same time; therefore, the mechanisms underlying these effects do not appear to be yoked.
In the sections below, we discuss the mechanisms underlying shifting and shading as if these two aspects of contrast asynchronies are independent of each other.
Contextual effects on the center modulation have been the source of numerous theories concerning the nature of context on visual appearance. The results here and in Shapiro et al. (
2004a) show that observers can perceive both a first-order response originating in the center modulation and a second-order response resulting from the contrast modulation. The two responses remain separate late enough to be distinguished from each other perceptually (as in the two-disk contrast asynchrony). It is likely that the separation results from some form of multi-scale spatial filtering of the visual scene, such as that described in the model presented by Blakeslee & McCourt (
1999,
2003,
2004). Blakeslee and McCourt's model involves multiple levels of the oriented differences of Gaussians (ODOG) that combine through a normalized output channel. Their account has been successful at describing a wide range of static lightness illusions.
However, a dynamic multi-scale model has a number of additional factors with which it must contend. For instance, to account for the appearance of the two-field asynchrony, a dynamic model would have to produce multiple output channels that could be identified perceptually. Some of these channels would presumably respond in phase with center modulation; others would respond in antiphase with the contrast, depending on the size of the surround.
Furthermore, although most of the examples shown so far seem amenable to the suggestion that shading is simply a low spatial frequency response, there are other examples that indicate that the second-order effects are quite sensitive to thin edges.
Figure 11a shows the two-field asynchrony as rectangular bars. In the interactive demonstration, the levers control thin bars that can extend from the top and bottom of the rectangles. The fields initially appear to modulate in antiphase. When very thin edges are added at the top and bottom of the modulating fields, the fields appear to modulate in phase. Conversely, if the modulating fields are placed in a White's effect configuration (i.e., as if the modulating fields were placed on light and dark bars), the fields appear to modulate in phase, and the addition of thin edges produces the appearance of antiphase modulation (Shapiro et al.,
2004c). Thus, any model that describes the asynchrony as the response to a low spatial frequency filter also has to account for the apparent effect of high-spatial frequency edges.
The effect of thin edges can also be shown in a single-field contrast asynchrony (see
Figure 11b, the barbell illusion). This illusion shows a modulating bar placed between two static squares, one light and one dark, separated by a gray field. If the bar extends into the squares, the bar appears to shade. The effect may at first appear to be the result of a low-frequency filter because the shading extends over a large range of the modulating field. However, if there is a thin gap between the modulating bar and the squares, the shading effects stops, again indicating the possible effect of high-spatial frequency edges.
These effects should not be considered an argument against a dynamic multi-scale model, but rather a cautionary note concerning the range of effects that a complete model will need to describe. It is possible, for instance, that a small gap will decrease the response of the low spatial frequency filter in such a way as to decrease the appearance of shading. Or perhaps there is an active interaction between the response from high spatial frequency filters and response from low spatial frequency filters that would allow for the sharpening of edges. We are currently constructing a model that will allow for the testing of such hypotheses.
It is also possible that there really are separate processes for first- and second-order information. Lights, after all, can be described by their contrast relative to their surround (a second-order response) and by their relative brightness levels (a first-order response). For example, consider the four disks in
Figure 12, which represent two frames from the two-disk contrast asynchrony. The disks can be described by their relative brightness (the disks on the bottom are both dark, and the disks on the top are both light) or by the relative apparent contrast (the top-left and bottom-right are high contrast disk/surround pairs, and the top-right and bottom-left are low contrast disk/surround pairs). Indeed, if one considers this division seriously, then the paradoxical perception produced by the two-field asynchrony (i.e., that they modulate in antiphase but get light and dark at the same time) should not be at all surprising.
It is possible that the division between high/low contrast and high/low lightness (such as that shown in
Figure 12) is the end product of some multi-scale spatial filtering of the stimulus. One way that this could arise is that the first-order appearance could be constructed at later stages of visual processing. Such a system would be efficient (Why send two signals down the optic nerve when it may be possible to send only one? See Barlow,
1965) and consistent with the slow response to the information in the center relative to the contrast information (Shapiro et al.,
2004a). This type of system can explain why there is a reduction in shading for large disks (8°) with 3 Hz modulation (
Figure 2) and other large shading effects such as the “watercolor” illusion (Pinna, Brelstaff, & Spillman,
2001). Another possibility is that the separation of contrast and lightness arises from the mechanisms like the plenoptic structures of Adelson & Bergan (
1991). Viewed from this perspective, contrast asynchronies would represent processes that respond to light changes over time versus processes that respond to the spatial derivative over time. Such mechanisms have been suggested previously in the literature (see Brown,
1965).
A late combination of separate first- and second-order responses could be a process for creating a signed contrast signal, such as that suggested by Whittle's brightness contrast (
1994a,
1994b) and by Gilroy & Hock (
2004) who used a two-field contrast asynchrony to investigate apparent motion across disks. The physiological evidence for such a division is sparse; however, there does appear to be a subset of cortical cells that may respond directly to first-order information (Kinoshita & Komatsu,
2001). In addition, a division between first- and second-order responses may be related to different divisions between contributory elements proposed by other studies: for example, boundary contour versus feature contour (Grossberg & Todoroviâc,
1988; Rudd & Zemach,
2004); retinal contrast versus cyclopean mechanism (Shevell, Holliday, & Whittle,
1992); and even contrast versus assimilation (Hong & Shevell,
2004; Jameson & Hurvich,
1964; Reid & Shapley,
1988).
A separation between first- and second-order responses may prove useful for models of lightness constancy. Whittle & Challand (
1969; see Brown,
2003) have argued in favor of the two types of constancy: one for illumination changes that affect both the background and the foreground object (contrast remains constant) and another that arises when an object moves in front of two different backgrounds (luminance remains constant). The lack of attention to the second type of constancy has been harshly criticized (Gilchrist,
1994; Gilchrist et al.,
1999; Gilchrist & Economou,
2003). This type of constancy seems to directly correspond to a second-order response.
The visual illusions shown in this paper were all generated with the same underlying principle (i.e., stimuli in which the contrast signals modulate out of phase with each other). Thick-edge contrast asynchronies create the appearance of shading; thin-edge contrast asynchronies create the appearance of shifting. Explanations of shading effects seem to require either a combination of the visual response to contrast and luminance information or separate analysis by multi-scale spatial filters. Shifting effects can be accounted for by examining the response of an array of center-surround filters.
The threshold thickness for apparent shifting corresponds roughly to measurements of ganglion cell receptive field centers. The receptive field centers for M and P ganglion cells (physiologically defined) increase from a radius of about 10 min at 4° peripherally to about 20 min at 10° in the periphery (Derrington & Lennie,
1984; Sun & Lee,
2004). The results are therefore consistent with Sun, Ruttiger, & Lee's (
2004) suggestion that positional sensitivity is the result of a sophisticated cortical interpretation of the output of the ganglion cell array.
The information contained in an array of center-surround cells may correctly predict the presence and direction of shifting but cannot explain the interpretation of the stimulus.
Table 1 shows that observers reported that motion within a vertical-split surround appeared to pivot from a single focal point, and motion within a horizontal-split surround appeared to bounce up and down in the plane of the image or to flip out of the plane of the image (in one observer's words, “A movement like that of a butterfly valve”).
We have no firm explanation for why there should be differences between the appearances of the horizontal- and vertical-split conditions. However, the description of the stimulus as a “butterfly valve” does suggest that the horizontal-split stimulus can be interpreted in a manner analogous to a 3-D object illuminated with a single source originating from above. This aspect of the effect suggests explanations based on inferences about illumination and reflection (Helmholtz,
1866/
1962) or on theories that assume that two-dimensional lightness illusions are related to the observer's learned understanding of the three-dimensional world (for instance, Purves et al.,
2004).
Our proposal should not be taken as an argument that higher-order inferences are not involved in lightness/brightness perception, but rather that the input into higher-order processes should be considered in terms of a filtered visual image, not the image per se. (After all, why should early filtering have such a clear effect on motion but no effect on lightness/brightness?)
Our interpretations are, in principle, consistent with Kingdom's (
2003a,
2003b) proposal regarding levels of brightness perception and with Blakeslee & McCourt's (
2003) multi-scale filtering model. However, based on the evidence here and in Shapiro et al. (
2004a), we emphasize that the visual system can respond simultaneously to luminance and contrast information. The perceptual response must therefore represent more than the summed responses of the filters' output. We also would not be surprised if under some conditions the network of filters acted as if there were neural filling-in, and if the network turned out not to be passive, but worked to maximize the informational content in the display (Barlow & Foldiak,
1989; MacLeod,
2003; Zaidi & Shapiro,
1993).
The contrast-asynchrony principle(s) can be used to generate an infinite number of illusions. We have concentrated our investigations on luminance and luminance contrast; however, analogous illusions can be created on a variety of stimulus dimensions (orientation, color, spatial frequency, contrast-contrast induction). Indeed, contrast asynchronies are possible in principle for any stimulus dimension along which contrast effects can be observed; for example, faces (Webster, Kaping, Mizokami, & Duhamel,
2004) or blur (Webster, Georgeson, & Webster,
2002). Most of these variations have not yet been explored.
Two-disk asynchrony. The luminance levels of the two center disks are always identical as they modulate in time. When the surrounds are present, the contrast information modulates in antiphase. At 1 Hz, observers perceive the luminance information modulating in phase and the contrast information (relative to the surrounds) modulating in antiphase. At 3 Hz, observers perceive only the contrast information.
Window-shade illusion (thick annular surround). The luminance of the center disk modulates in time. The disk looks as if a shade is being pulled back and forth across its width. The direction of shading depends upon the orientation of the surround. The introduction of a gray gap produces the simultaneous appearance of shading and shifting.
Tilt/Ramp Asynchrony. The uniform field is sandwiched between two ramps, both oriented so that they are dark on bottom and light on top. When the ramps are thick, the uniform field appears inhomogeneous, with the point of zero contrast sliding up and down the surface. When the ramps are thin, the field appears to become thicker at one end and thinner at the other. The dark end becomes thinner when the modulating field is in the dark phase, and the light end becomes thinner when the field is in the light phase. When the ramps are thick, a thin gray gap interposed between the field and the ramps produces the appearance of both shading and contortion.
Similar to figure 6a, but the luminance of the left ramp goes from white on bottom to black on top. When the ramps are thin, the field tilts from left to right. When the ramps are thick, the induced shading slides up and down opposite sides of the ramp. The insertion of a gray gap when the surrounds are thick produces shifting and (to a lesser extent) shading.
The effect of a ramp placed on a single side of the modulating field. The edge appears to move only when the ramp is thin.
Equiluminant tilt illusion. The appearance of equiluminant tilt is qualitatively different from the achromatic effects. The color of the center field snaps toward the similar color of the surrounding ramp. Similar effects can be created for achromatic lights under some conditions.
Gauge asynchrony. The inverse of the ramp and tilt illusions shown in 6a. A fixed rectangular ramp is surrounded by a modulating field. As the background changes from dark to light, the ramp appears to be divided wherever the contrast is 0 (i.e., the point at which the luminance level of the background matches the luminance of the ramp). The perceptual effect is that of a gliding scale in which a divider slides up and down the ramp. The effect is referred to as a gauge asynchrony because the divider can be used as a gauge that indicates the luminance level of the background light. The divider does not rise as high on the ramp when the modulation is red-to-black or green-to-black.
The gauge asynchrony from 7a can also be created with a non-contiguous gauge.
Stationary edges next to modulating fields appear to shift position. When the light and dark edges are on contiguous sides of the polygons (“same sides”), the polygons appear to shift up and down; when the light and dark edges are on opposite sides (“opposite sides”), the polygons appear to contort. As with the rocking disk illusion, the effect disappears when the edges become thick enough (click on “Hexagon and square with long sides”). We note that the square with long sides produces the appearance of transparency, whereas the hexagon does not.
The effect in 8a is similar to the “phenomenal phenomena” of Gregory and Heard (1982). We reproduce a version of their effect for comparison.
Pulsing sides. Modulating edges next to stationary fields create the appearance of a shift in position (the inverse of figure 8a). Here, we show this effect with several variations of the luminance level of the center and the phase of modulation. In all conditions, the location of the sides remains constant.
Lucy in the sky. The basic principles in figures 8 and 9 can be combined to create a variety of new illusions. The diamonds are configured either as a chain or in a loop. The phase of the luminance modulation of each diamond is offset relative to the others. The buttons also control the direction of the polarity of the edges.
Honeycomb. The basic principles in figures 8 and 9 can be combined to create a variety of new illusions. In this example, many modulating hexagons are placed next to each other. The luminance levels of the borders between the hexagons, and the relative phase of the center modulations, are varied to create a variety of distortion effects.
House of Cards. The basic principles in figures 8 and 9 can be combined to create a variety of new illusions. In this example, different borders around many modulating squares produce various levels of distortion.
The two-field contrast asynchrony with edges. The levers control the thickness and luminance of two edges placed at the top and bottom of the modulating bars. In the standard configuration, the bars appear to modulate in antiphase; when edges are visible, the bars appear to modulate in phase. Conversely, in a White's effect configuration, the fields appear to modulate in phase, and the insertion of edges produces the appearance of antiphase modulation.
The Barbell Illusion. The lever controls the height of the modulating bar. The bar is perceived as shading when the bar extends into the light and dark squares. The buttons control the visibility of the squares or a background ramp.
Four elemental conditions. In the top two panels, the luminance of the edge modulates next to stationary black or white center fields. In the bottom two panels, the luminance of the center modulates against black or white stationary edges. The modulating edges appear to move away from the similar shadings of the center, whereas modulating centers appear to move toward similar shadings of the edge.