Full-color natural images were drawn from a set of 4,025 images of beaches, city streets, forests, highways, mountains, and offices. These six categories were selected in an attempt to capture a representative sample of natural and man-made environments. Each image was rated to indicate how representative it was of its category by workers via the Internet (Torralbo et al.,
2013). Briefly, workers using Amazon Mechanical Turk rated each image on a scale from 1 (
poor) to 5 (
good) or indicated that it did not belong to the specified category (see Torralbo et al.,
2013, for more details). If more than 25% of the workers indicated the image was not from the category, it was removed from the data set. For each of the six categories, we selected 40 “good,” 40 “medium,” and 40 “bad” images based on their mean ratings (mean scores were 4.70, 3.99, and 2.88, respectively). Scenes were phase scrambled by combining in the Fourier domain the amplitude of an intact scene with the phase from a random noise image and taking the inverse fast Fourier transform of this hybrid image. Examples of intact and scrambled good and bad exemplars are shown in
Figure 1. Perceptual masks were colored images of white noise at multiple spatial frequencies with naturalistic textures overlaid used in previous studies of rapid scene categorization (Torralbo et al.,
2013; Walther et al.,
2009). All images were presented at a resolution of 800 × 600 pixels and subtended approximately 30° of visual angle.