June 2006
Volume 6, Issue 6
Vision Sciences Society Annual Meeting Abstract  |   June 2006
Audio-visual statistical learning
Author Affiliations
  • Ladan Shams
    Department of Psychology, University of California, Los Angeles
  • Aaron Seitz
    Department of Psychology, Boston University
  • Virginie van Wassenhove
    Department of Psychology, University of California, Los Angeles, and Division of Biology, California Institute of Technology
Journal of Vision June 2006, Vol.6, 152. doi:https://doi.org/10.1167/6.6.152
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Ladan Shams, Aaron Seitz, Virginie van Wassenhove; Audio-visual statistical learning. Journal of Vision 2006;6(6):152. https://doi.org/10.1167/6.6.152.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

Statistical learning theories suggest that people implicitly learn arbitrary stimulus-stimulus associations based solely on the statistics of inter-stimulus contingencies (Fiser and Aslin, 2001). Various studies have supported such learning in the visual, auditory and somatosensory modalities. To date, cross-modal statistical learning has not been investigated. Here, we present results of a novel audio-visual statistical learning procedure where participants are passively exposed to arbitrary audio-visual pairings (comprised of artificial/synthetic auditory and visual stimuli). Following this exposure period, participants' degree of familiarity to the experienced audio-visual pairings is evaluated against novel audio-visual combinations drawn from the same stimulus set. The results of this comparison demonstrate the existence of audio-visual statistical learning.

Additionally, we investigated whether audio-visual associations with an appropriate “gestalt” were learned more robustly than those with less appropriate relationships. We used a procedure in which visual objects disappeared into a ‘visual abyss’ and reappeared as new objects (Fiser and Aslin 2002). During each disappearance-to-reappearance interval, an upward or downward frequency-modulated sound was played. The Gestalt+ condition consisted of downward-frequency sweeps paired with disappearances and upward-frequency sweeps paired with appearances. The opposite combinations were assigned to the Gestalt- condition. In this condition, subjects showed greater familiarity for Gestalt+ than Gestalt- audio-visual pairs.

Our results suggest that audio-visual statistical learning occur naturally despite the absence of a task or of an explicit attentional engagement, for audio-visual stimuli that are spatio-temporally coincident. More importantly, the degree of learning depends partially on appropriate gestalt relationships between the multisensory events.

Shams, L. Seitz, A. van Wassenhove, V. (2006). Audio-visual statistical learning [Abstract]. Journal of Vision, 6(6):152, 152a, http://journalofvision.org/6/6/152/, doi:10.1167/6.6.152. [CrossRef]

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.