September 2024
Volume 24, Issue 10
Open Access
Vision Sciences Society Annual Meeting Abstract  |   September 2024
Error consistency between humans and machines as a function of presentation duration
Author Affiliations & Notes
  • Thomas Klein
    Universität Tübingen
    Max Planck Institute for Intelligent Systems, Tübingen, Germany
  • Wieland Brendel
    Max Planck Institute for Intelligent Systems, Tübingen, Germany
  • Felix Wichmann
    Universität Tübingen
  • Footnotes
    Acknowledgements  Funded by the German Research Foundation (DFG) under Emmy Noether grant BR 6382/1-1. Supported by EXC grant 2064/1, project 390727645 and SFB 1233, project 276693517. TK would like to thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for support.
Journal of Vision September 2024, Vol.24, 1172. doi:https://doi.org/10.1167/jov.24.10.1172
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Thomas Klein, Wieland Brendel, Felix Wichmann; Error consistency between humans and machines as a function of presentation duration. Journal of Vision 2024;24(10):1172. https://doi.org/10.1167/jov.24.10.1172.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Within the last decade, Artificial Neural Networks (ANNs) have emerged as powerful computer vision systems that match or exceed human performance on some benchmark tasks such as image classification. But whether current ANNs are suitable computational models of the human visual system remains an open question: While ANNs have proven to be capable of predicting neural activations in primate visual cortex, psychophysical experiments show behavioral differences between ANNs and human subjects as quantified by error consistency. Error consistency is typically measured by briefly presenting natural or corrupted images to human subjects and asking them to perform an n-way classification task under time pressure. But for how long should stimuli ideally be presented to guarantee a fair comparison with ANNs? Here we investigate the role of presentation time and find that it strongly affects error consistency. We systematically vary presentation times from 8.3ms to >1000ms, followed by a noise mask, and measure human performance and reaction times on natural, lowpass-filtered and noisy images. Our experiment constitutes a fine-grained analysis of human image classification under both image corruption and time pressure, showing that even drastically time-constrained humans who are exposed to the stimuli for only a single frame, i.e. 8.3ms, can still solve our 8-way classification task with success rates above chance. Importantly, the shift and slope of the psychometric function relating recognition accuracy to presentation time also depends on the type of corruption. In addition we find that error consistency also depends systematically on presentation time. Together our findings raise the question of how to properly set presentation time in human-machine comparisons. Second, the differential benefit of longer presentation times depending on image corruption is consistent with the notion that recurrent processing plays a role in human object recognition, at least for images that are difficult to recognise.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×