Human perception and behavior are often driven by the integration of information from multiple sensory inputs. This tendency for integration is so strong that conflicting signals arriving from different sensory modalities often lead to perceptual illusions in the observer (Bertelson & Radeau,
1981; McGurk & MacDonald,
1976; Shams, Kamitani, & Shimojo,
2000). Recent neuroimaging studies have provided mounting evidence that some of these multisensory interactions express in brain areas classically considered unimodal (Driver & Noesselt,
2008; Kayser, Petkov, Augath, & Logothetis,
2005; Lakatos, Chen, O'Connell, Mills, & Schroeder,
2007), in addition to the well-known convergence of sensory information in heteromodal brain areas (subcortical: Meredith & Stein,
1983; Nagy, Eördegh, Paróczy, Márkus, & Benedek,
2006; Wallace, Meredith, & Stein,
1998; or cortical: Barbas et al.,
2005; Beauchamp, Lee, Argall, & Martin,
2004; Cohen & Andersen,
2004; Helbig et al.,
2012; Werner & Noppeney,
2010). Indeed, the cross-modal activation of sensory cortices associated with the perception of cross-modal illusions (Mishra, Martinez, Sejnowski, & Hillyard,
2007; Watkins, Shams, Tanaka, Haynes, & Rees,
2006; for a review, Driver & Noesselt,
2008) suggests that illusory and nonillusory percepts experienced in a given sensory modality might be supported by similar neural underpinnings. In the present study we addressed whether sound-induced visual illusions respond to physical stimulus manipulations in a manner similar to real (i.e., nonillusory) visual percepts. For this purpose we utilized one well-known case of auditory driving, named the double flash illusion (DFI; Shams,
2002; Shams et al.,
2000; see also Shipley,
1964). In this illusion, a single visual flash accompanied by two auditory beeps presented in rapid sequence is perceived as two flashes. That is, due to the auditory stimulus, two flashes are seen when physically only one is present. According to neuroimaging studies, the DFI involves the activity of subcortical convergence structures like the superior colliculus (SC), cortical association areas like the superior temporal sulcus (STS; Watkins et al.,
2006), and, critically, sensory cortical areas like the primary visual cortex (V1; Mishra et al.,
2007). Thus, one might expect that if primary visual cortex is causally involved in cross-modal induced visual illusions like the DFI, then the DFI should be sensitive to the stimulus manipulations that are known to modulate primary visual cortex. In the present study we focus on the temporal limitations of the perception of fast visual events, which strongly depend on processing in early sensory visual areas (Jiang, Zhou, & He,
2007; Mullen, Thompson, & Hess,
2010; Rager & Singer,
1998; Ress & Heeger,
2003; Simonson & Brožek,
1952). Specifically we assess whether the visual perceptual limitations induced by luminance contrast and chromaticity manipulations also apply to the perception of illusory events arising from cross-modal interactions. It is well known that luminance contrast affects visual temporal resolution, that is, the ability to perceptually resolve two flashes presented in a rapid sequence. For instance, increasing contrast leads to an improvement in the temporal resolution for flicker detection (De Lange,
1958; Tyler & Hamer,
1990). Similarly, additional chromatic information also leads to improvements in visual temporal resolution compared to same luminance contrast achromatic stimuli (Pokorny & Smith,
1997; Sun, Pokorny, & Smith,
2001; Swanson, Ueno, Smith, & Pokorny,
1987). Thus, we capitalize on these prior findings to evaluate if the ability to resolve two real flashes corresponds with the perception of two illusory flashes across manipulations in luminance and chromaticity.