Since the masking differences in
Experiments 1–3 imply an early neural substrate for those effects, yet cannot be explained by contrast biases at the lowest or highest spatial frequencies, the question as to why
α = 1.0 noise masks produce the largest masking effects (i.e., the target-mask slope similarity effect) remains. Interestingly, the modulation of masking strength by mask
α in the current study is virtually identical to the simultaneous masking effects observed by Hansen and Hess (
2012) in which participants had to identify the orientation of Gabor patches or identify letters. There,
α = 1.0 noise masks were found to produce stronger masking than
α = 0.0 or 1.5 noise masks. Hansen and Hess (
2012) argued that the different
α masking strengths could be explained by modeled cortical interactions employing contrast gain control processes (e.g., Carandini & Heeger,
1994; Heeger,
1992a,
1992b; Wilson & Humanski,
1993), which take place exclusively within V1. Their contrast gain control account built on the work of David Field and Nuala Brady (Brady & Field,
1995; Field,
1987; Field & Brady,
1997) who proposed a modified multichannel model of V1. That model predicts overall larger spatial frequency channel responses for
α = 1.0 stimuli (compared to stimuli with smaller or larger αs). In their model, the spatial frequency bandwidth of different early visual channels is held constant in octaves on a log axis, with the peak sensitivity of each filter channel also held constant, based on the results of a large sample of striate neurons (R. L. De Valois, Albrecht, & Thorell,
1982). With such a configuration, any image possessing an amplitude spectrum slope
α ≈ 1.0 will produce equivalently large contrast energy responses across all channels in an inhibitory contrast gain pool, regardless of the peak spatial frequency to which each filter is tuned (see Brady & Field,
1995, for further detail). Thus, if
α = 1.0 noise largely and equally drives all spatial channels, a tuned gain pool for
α = 1.0 noise should be much more active than with smaller or larger
αs, thus producing the greatest inhibitory effect on the output signal to perceptual decision processes. Furthermore, contrast gain control processes were shown to strongly operate during the first 50 ms of processing time in a backward masking paradigm (Essock, Haun, & Kim,
2009). Thus, the masking effects produced by the amplitude masks in
Experiments 1–3 may simply reflect a variant of a contrast gain control modulated signal-to-noise ratio in early visual processes that depends much more on the distribution of contrast across spatial frequency than across orientation. Therefore, the target-mask slope similarity effect observed in
Experiments 1–3 may result from a modified contrast gain control process implemented entirely in V1. The implication here is that the majority of amplitude-only masking effects may have more to do with specific early visual mechanisms than later scene categorization processes (e.g., in the PPA, the LOC, etc., Walther, Caddigan, Fei-Fei, & Beck,
2009). If so, such interactions would apply equally well to a wide array of visual tasks (e.g., Gabor orientation discrimination or letter recognition), rather than being limited only to rapid scene categorization. Additionally, the lack of any category-specific effects in
Experiments 1–3 further supports the idea that those masking effects resulted from general neural operators, at least for amplitude-only-defined content derived from global spectral manipulations.