At the cortical level two areas seem particularly interesting: pSTS and IPS. Both have been implicated in the analysis of biological motion and show interesting multisensory properties (Bonda, Petrides, Ostry, & Evans,
1996; Brooks et al.,
2007; Bruce, Desimone, & Gross,
1981; Grossman & Blake,
2002; Howard et al.,
1996; Servos, Osu, Santi, & Kawato,
2002; Vaina, Solomon, Chowdhury, Sinha, & Belliveau,
2001). STS shows particularly interesting complex audiovisual interactions: for example it responds to a stimulus such as an object striking a surface, but neither the image nor the sound of the event alone will elicit a response; nor will the simultaneous presentation of simple stimuli like a flash and click (Bruce et al.,
1981). These kinds of patterns—purposeful audiovisual stimuli—seem to have similar characteristics to the tap dance routines. However, the cortical structures most likely to be implicated in the recognition of tap dance patterns are those population of cross-modal neurons specifically tuned for perception (and/or production) of specific action patterns: audiovisual mirror neurons. As mentioned in the
Introduction section, mirror neurons sensitive to visual as well as auditory patterns of a specific action have been localized in the rostral ventral premotor cortex of monkeys (Keysers et al.,
2003; Kohler et al.,
2002). These neurons seem to be implicated in the recognition and interpretation of actions by matching the auditory and visual patterns of external actions onto subject's internal motor coordinates. The response profiles of these neurons are so strictly tuned to specific action patterns that from their firing rate it is possible to discriminate among different actions with an error rate lower than 5% (Keysers et al.,
2003). Together, all these evidences support the hypothesis that audiovisual mirror neurons play a central role in recognition of actions implicating visual and auditory patterns, such as tap dance. As evidence about the existence of cross-modal mirror neurons have been collected also in humans (Gazzola, Aziz-Zadeh, & Keysers,
2006) these cortical mechanisms could underlie the integration of auditory and visual information in the recognition of human actions, and thus play a key role in the perception of audiovisual motion profiles as tap dance routines. However, whatever the neural mechanisms involved, these results provide further demonstration of the flexibility and versatility of the human perceptual system in optimizing its performance.