Our decision to use faces as the image category in this frequency-tagging approach applied to high-level vision was motivated by several factors. Faces are visually homogenous and almost symmetrical, making them ideal to study the interaction between their left and right halves. They are highly salient and familiar stimuli of our visual environment, and their perception is associated with large and at least partly specific neural responses (e.g., Desimone et al.,
1984; Sergent et al.,
1992; Allison, Puce, Spencer, & McCarthy,
1999; Haxby, Hoffman, & Gobbini,
2000; Tsao et al.,
2006; Weiner & Grill-Spector,
2013 for a review). As explained in the
Introduction, because their parts are meaningful and can activate other parts through completion, faces also pose particularly difficult challenges to the study of part/whole relationships. Given this, one may wonder whether our approach and findings are specific to faces or could be extended to the perceptual integration of other classes of natural images. On the one hand, there are no reasons why the approach could not be extended to study perceptual integration of other classes of images. However, since we did not test other object stimuli, we cannot and do not claim that our findings are specific to faces. On the other hand, there are at least two aspects of our data that point to face-specific mechanisms. First, our IM responses were strongly right lateralized and observed specifically over the same electrode sites where face-sensitive electrophysiological responses are recorded (the “N170” component, Bentin et al.,
1996; Rossion & Jacques,
2011 for a review). This right occipito-temporal activation has been specifically associated with holistic perception of faces (e.g., Sergent,
1988; Hillger & Koenig,
1991; Rossion, Dricot, Goebel, & Busigny,
2011; Caharel, Leleu, Bernard, Lolande, & Rebai,
in press). Second, if the stimulus was merely a 2-D circle, introducing a gap between its left and right halves might not reduce IM responses significantly, because the visual system could “fill the gap,” thanks to contour closure (Wertheimer,
1923; Wagemans et al.,
2012). However, the introduction of a gap between face halves in our study does only break a face in two halves; it also changes substantially the width/height ratio of the face, so that it may not fit a holistic face template anymore. Moreover, the introduction of a vertical gap breaks the face-specific local configuration of internal diagnostic parts (nose, mouth, combination of the two eyes and eyebrows, etc.), possibly contributing to the reduction of the IM responses in this condition. Finally, inversion of the whole face reduced significantly the IM components while leaving the part-based responses intact, an observation offering direct support to the view that inversion reduces holistic/configural face perception (Sergent,
1984; Young et al.,
1987; Rhodes, Brake, & Atkinson,
1993; Tanaka & Farah,
1993; Rossion,
2008; Sekunova & Barton,
2008; Van Belle, de Graef, Verfaillie, Rossion, & Lefevre,
2010; Rossion,
2013). Given that the decrease of behavioral performance for inverted relative to upright stimuli is a well-known marker of face-specific processes (Yin,
1969), future studies may also find that this electrophysiological effect is specific to faces.