From the literature discussed above, it is clear that both structural (i.e., shape/geometric information) and surface-related (i.e., texture, color) visual cues are diagnostic to scene perception. This comes from human neuroimaging research demonstrating the importance of spatial layout in scene processing (for review, see
Epstein & Baker, 2019), and the joint contribution of both shape and texture to scene representation in PPA (
Lowe et al., 2016;
Lowe, Rajsic, Gallivan, Ferber, & Cant, 2017). Beyond neuroimaging research in neurologically intact individuals, neuropsychological research with patients 1, who has a profound visual form agnosia and cannot accurately perceive structual details of objects but can perceive their surface properties (
Humphrey, Goodale, Jakobson, & Servos, 1994), demonstrated that patients 1 exhibited higher activation in PPA for appropriately colored scenes compared with black-and-white versions of these scenes (the latter of which can only be recognized on the basis of structural cues;
Steeves, Humphrey, Culham, Menon, Milner, & Goodale, 2004). Moreover, recent neuroimaging and neurophysiological research has demonstrated the importance of both shape and texture to scene-selective regions of the macaque brain, with potentially greater representational weighting for the latter (
Kornblith, Cheng, Ohayon, & Tsao, 2013). Importantly, these neuroimaging, neuropsychological, and neurophysiological findings are consistent with behavioral scene research. For many years, much of this research focused on shape-related visual cues, owing in part to the findings that both rats and human infants reorient themselves in their environment solely on the basis of geometric cues (
Cheng, 1986;
Hermer & Spelke, 1994). However, subsequent research revealed the additional role of surface-based visual cues (such as color and texture) to scene perception and recognition. For example, natural scenes are recognized more quickly when presented in color, as opposed to in black-and-white (
Gegenfurtner & Rieger, 2000), a finding that mirrors the neuropsychological research discussed above (
Steeves et al., 2004). Moreover, surface-based cues can be used to categorize scenes (i.e., scene “gist”) without the need to identify particular objects in those scenes (
Biederman, Mezzanotte, & Rabinowitz, 1982;
Møller & Hurlbert, 1996;
Oliva & Schyns, 1997,
2000;
Oliva & Torralba, 2001;
Schyns & Oliva, 1994,
1997;
Vailaya, Jain, & Zhang, 1998). Finally, models that use textural information have successfully demonstrated how the visual system represents scenes, particularly beyond the fovea where visual information is degraded (
Balas, Nakano, & Rosenholtz, 2009;
Rosenholtz, 2011). Taken together, the number of lines of evidence discussed above demonstrate the importance of both shape and surface-properties to scene representation, but it remains unclear how the processing of these features are represented in scene-selective cortex (i.e., interactively or independently), and how their representation relates to the processing of global scene properties in PPA. Gaining a deeper understanding of these issues is important, because it will not only inform future computational and neural-network models of scene representation, but may also resolve ambiguities in the neuropsychological literature relating to the relative weighting of shape- versus surface-related cues in everyday scene perception and recognition (e.g.,
Robin, Lowe, Pishdadian, Rivest, Cant, & Moscovitch, 2017).