Our finding that coarse object structure accounts for categorization is consistent with four lines of evidence. First, coarse structure is sufficient for categorization: removing other details such as color (Delorme et al.,
2000), removing high spatial frequencies (Nandakumar & Malik,
2009), and even reducing objects to silhouettes (Quinn et al.,
2001) all have modest effects on categorization. However, the level of coarseness required may be task-dependent (Collin & McMullen,
2005; Harel & Bentin,
2009; Morrison & Schyns,
2001). Indeed, our results suggest that at least for animal and vehicle categorization, the relevant spatial frequencies are similar but that a finer spatial scale is required for tool categorization. Second, visual processing proceeds in a coarse-to-fine manner (Bar,
2003; Bar et al.,
2006; Bullier,
2001; Frazor, Albrecht, Geisler, & Crane,
2004; Kveraga, Boshyan, & Bar,
2007; Macé, Delorme, Richard, & Fabre-Thorpe,
2010; Macé, Thorpe, & Fabre-Thorpe,
2005; Morrison & Schyns,
2001; Navon,
1977; Sripati & Olson,
2009). This early availability of coarse information is consistent with evidence that animal categorization is extremely fast (Rousselet et al.,
2002; Thorpe, Fize, & Marlot,
1996) and that it involves feedforward processing (Serre, Olivia, & Poggio,
2007; Thorpe et al.,
1996). Third, our results are consistent with evidence that coarse structure influences object representations in visual cortex (Bermudez, Vicente, Romero, Perez, & Gonzalez,
2009; Frazor et al.,
2004; Sripati & Olson,
2009,
2010) and with evidence that neuronal representations in monkey inferotemporal cortex (Kiani, Esteky, Mirpour, & Tanaka,
2007) as well as voxel-based representations in human object-selective cortex (Kriegeskorte et al.,
2008) contain category information. Finally, our results accord with suggestions that categorization may be based on features with intermediate complexity since these features are likely to occur at coarse spatial scales (Delorme, Richard, & Fabre-Thorpe,
2010; Fabre-Thorpe,
2011; Ullman, Vidal-Naquet, & Sali,
2002).