The peripheral advantage for scene recognition in our model supports the importance of PPA in scene recognition. The PPA is activated more for buildings and scenes than other categories, such as faces (Epstein et al.,
1999), and it is involved in scene memory (Brewer, Zhao, Desmond, Glover, & Gabrieli,
1998; Ranganath, DeGutis, & D'Esposito,
2004). The PPA has also been shown to respond to the spatial layout or geometry of the scene—that is, whether a scene is “open” versus “closed” (Oliva & Torralba,
2001), in tasks like navigation (Epstein & Kanwisher,
1998; Janzen & Van Turennout,
2004) and scene classification (Park, Brady, Greene, & Oliva,
2011). However, PPA activity is not modulated by the number of objects in the scene (Epstein & Kanwisher,
1998). Many studies using multivoxel pattern analysis (MVPA) have shown that the fMRI activity of PPA can be decoded and used to distinguish between different scene categories (Naselaris, Prenger, Kay, Oliver, & Gallant,
2009; Park et al.,
2011; Walther, Caddigan, Fei-Fei, & Beck,
2009). One thing to note is that although we showed peripheral vision contributes more to scene recognition than central vision, we did not ignore the fact that central vision (or LOC according to our mapping) may actually have a role in scene recognition. In our visualization experiment (
Figure 9), we showed that the central pathway has more features characteristic of manmade scene categories, and this may be because the particular objects in those categories (such as a desk in a home, and vegetables and fruit in the market category) are important to distinguish them from other categories. In fact, since LOC is specialized in representing object shapes and object categories (Grill-Spector, Kushnir, Edelman, Itzchak, & Malach,
1998), it stands to reason that LOC should be encoding the content of a scene when there are objects presented in the scene. In fact, the pattern of neural responses in the LOC has also been shown to differentiate among scene categories (Park et al.,
2011; Walther et al.,
2009) and decode whether certain objects were presented within the scenes (Peelen, Fei-Fei, & Kastner,
2009).