These studies, along with others investigating sensorimotor prediction of voluntary actions and their consequences, have mostly focused on one sensory modality at a time. In general, sensory suppression has been observed for somatosensory, auditory, and visual domains, both on a behavioral (Bays, Flanagan, & Wolpert,
2006; Cardoso-Leite, Mamassian, Schutz-Bosbach, & Waszak,
2010; Roussel et al.,
2014; Sato,
2008) as well as at a neural level (Blakemore, Frith, & Wolpert,
2001; Blakemore, Wolpert, et al.,
1998; Desantis, Roussel, & Waszak,
2014; Lange,
2011; Leube et al.,
2003; Shergill et al.,
2013; Waszak, Cardoso-Leite, & Hughes,
2012). However, our actions in the real world often lead to multiple sensory consequences; i.e., when we knock on a door, we would get visual, auditory, and somatosensory information simultaneously. Moreover, each of these modalities might be affected by the presence of the other. Research on cross-modal processing points to the facilitatory effect of a stimulus modality on another modality when they are spatially and/or temporally in synchrony (Arabzadeh, Clifford, & Harris,
2008; Bresciani, Dammeier, & Ernst,
2008; Diederich & Colonius,
2004; McDonald, Teder-Sälejärvi, & Hillyard,
2000; Teder-Sälejärvi, Di Russo, McDonald, & Hillyard,
2005). On the other hand, there is evidence that a specific modality can have an inhibitory effect on another modality (Colavita,
1974; Kawashima, O'Sullivan, & Roland,
1995; Meredith,
2002; Morein-Zamir, Soto-Faraco, & Kingstone,
2003; Sinnett, Soto-Faraco, & Spence,
2008; Wang et al.,
2012). These context-dependent cross-modal interactions might begin at relatively early stages of information processing in regions known as modality-specific (Calvert, Campbell, & Brammer,
2000; Laurienti et al.,
2002; Macaluso,
2006; Macaluso, Frith, & Driver,
2000; Shimojo & Shams,
2001). In the case of action-related effects, studies focusing on the role of cross-modal processing of self-generated movement consequences suggest facilitation of behavioral performance with the inclusion of an additional modality (Desantis & Haggard,
2016a; Desantis, Mamassian, Lisi, & Waszak,
2014; Farrer, Valentin, & Hup,
2013; Kawabe, Roseboom, & Nishida,
2013; van Kemenade, Arikan, Kircher, & Straube,
2016). With respect to neural correlates, our group has previously shown that perceiving single auditory or visual (unimodal) and audiovisual (bimodal) consequences of voluntary button presses lead to reduced BOLD activity in somatosensory, visual, and auditory cortices for self-generated stimuli compared with passive viewing of these stimuli. In addition, participants were more sensitive to delays between their button press and the sensory consequence of the button press in the bimodal condition, suggesting that the additional modality enhanced delay detection for self-generated movements and their consequences (Straube et al.,
2017).