An alternative explanation for the asymmetrical audiovisual integration window may be related to the competition between unimodal and crossmodal processing. Here, unimodal processing corresponds to perceptual grouping based on the law of smooth continuation, and crossmodal processing corresponds to the auditory signal modulating the percept of the continuous motion displays. Previous studies have reported that unimodal processing, such as perceptual grouping in a single modality, overrides crossmodal processing (Kawachi & Gyoba,
2006,
2013; Keetels, Stekelenburg, & Vroomen,
2007; Sanabria, Soto-Faraco, Chan, & Spence,
2005; Watanabe & Shimojo,
2001). The present results suggest that a tone synchronous with the preceding StdSBD coincidence overrides a possible contribution of perceptual grouping cues such as the law of smooth continuation (Grassi & Casco,
2010; Metzger,
1937) that could contribute to the resolution of the subsequent TstSBD. When the tone/StdSBD pairing follows the TstSBD, the perceptual ambiguity in the leading TstSBD could be resolved based on the law of smooth continuation. The presentation of the tone at coincidence in the subsequent StdSBD could trigger a re-examination of the resolution of perceptual ambiguity in the preceding TstSBD, but it would have to occur before the grouping processes are complete. Temporal offsets for which bouncing was promoted (30–60 ms before the subsequent StdSBD and a tone) might be soon enough to override unimodal grouping processes related to the preceding TstSBD before they are complete. This analysis opens up another possible line of investigation in which transients such as tones or visual flashes at or near the point of coincidence, which would suggest bouncing, are pitted against perceptual grouping cues, such as proximity, size, shape, color, and smooth continuation, which could be rendered consistent with streaming. A variety of cue combinations could be investigated to determine the relative strength of organizing cues in a similar approach to Grove and Sakurai (
2009), who showed that auditory-induced bouncing persisted even when the trajectories of the individual disks were spatially displaced, tipping the probability of the resolution towards streaming (see also Grove, Ashton, Kawachi, & Sakurai,
2012 for a similar effect when the targets were rendered distinguishable via texture density differences).