December 2022
Volume 22, Issue 14
Open Access
Vision Sciences Society Annual Meeting Abstract  |   December 2022
Mechanisms of human dynamic visual perception revealed by sequential deep neural networks
Author Affiliations & Notes
  • Lynn K. A. Sörensen
    University of Amsterdam
  • Sander M. Bohté
    Centrum Wiskunde & Informatica
  • Heleen A. Slagter
    Vrije Universiteit Amsterdam
  • H. Steven Scholte
    University of Amsterdam
  • Footnotes
    Acknowledgements  This work was funded by a Research Talent Grant (406.17.554) from the Dutch Research Council (NWO) awarded to all authors.
Journal of Vision December 2022, Vol.22, 3590. doi:https://doi.org/10.1167/jov.22.14.3590
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Lynn K. A. Sörensen, Sander M. Bohté, Heleen A. Slagter, H. Steven Scholte; Mechanisms of human dynamic visual perception revealed by sequential deep neural networks. Journal of Vision 2022;22(14):3590. https://doi.org/10.1167/jov.22.14.3590.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Our visual world and its perception are dynamic. Rapid serial visual presentation (RSVP) — a task in which observers see rapid sequences of natural scenes — is an example of such dynamic sequential visual stimulation. Remarkably, humans are still able to recognise scenes when images are shown as briefly as 13 ms/image. This feat has been attributed to the computational power of the first feedforward sweep in sensory processing. In contrast, slower presentation durations (linked to better performance) have been suggested to increasingly engage recurrent processing. Yet, the computational mechanisms governing human sequential object recognition remain poorly understood. Here, we developed a class of deep learning models capable of sequential object recognition. Using these models, we compared different computational mechanisms: feedforward and recurrent processing, single and sequential image processing, as well as different forms of rapid sensory adaptation. We evaluated how these mechanisms perform on an RSVP task, and to what extent they explain human behavioural patterns (N=36) across varying presentation durations (13, 40, 80 ms/image). We found that only models that integrate images sequentially via lateral recurrence captured human performance levels across different presentation durations. Such sequential models also displayed a temporal correspondence to single-trial performance, with few model steps best explaining human behaviour for the fastest durations and vice versa. Importantly, this temporal correspondence was achieved without reducing the model’s overall explanatory power. Finally, augmenting this sequential model with a power-law adaptation mechanism was essential to provide a plausible account of how neural processing obtains informative representations based on the briefest visual stimulation. Taken together, these results shed new light on how local recurrence and adaptation jointly enable object recognition to be as fast and effective as required by a dynamic visual world.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×