September 2024
Volume 24, Issue 10
Open Access
Vision Sciences Society Annual Meeting Abstract  |   September 2024
Spatial scrambling in human vision: investigating efficiency for discriminating scrambled letters using convolutional neural networks and confusion matrices
Author Affiliations & Notes
  • Xingqi Raffles Zhu
    McGill University
  • Robert F. Hess
    McGill University
  • Alex S. Baldwin
    McGill University
  • Footnotes
    Acknowledgements  This work has been supported by Réseau de Recherche en Santé de la Vision (RRSV), Fond de Recherche du Québec - Santé (FRQ-S), and Canadian Institutes of Health Research (CIHR).
Journal of Vision September 2024, Vol.24, 1078. doi:https://doi.org/10.1167/jov.24.10.1078
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Xingqi Raffles Zhu, Robert F. Hess, Alex S. Baldwin; Spatial scrambling in human vision: investigating efficiency for discriminating scrambled letters using convolutional neural networks and confusion matrices. Journal of Vision 2024;24(10):1078. https://doi.org/10.1167/jov.24.10.1078.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

One limitation in our ability to discriminate different letters would be any spatial disorganization in the projections between different visual areas. This “scrambling” could be a source of a positional noise limiting human performance. In this study, we explored different forms this scrambling could take. Based on the idea that letter identification is supported by an optimal spatial frequency, we used spatially-bandpass letters. We devised a physiologically-inspired decomposition and resynthesis scheme, to generate letters composed of log Gabor wavelets. The form of these wavelets is similar to that of an oriented “simple cell” receptive field. We then introduced two forms of scrambling. The first was scrambling at the input to the "oriented receptive field" stage (subcortical scrambling of the receptive field). The second was scrambling at the output from that stage (scrambling connections to the higher “cortical” stages). We also performed a bandpass noise control condition. To compare against human performance, we simulated the responses of both a template-matching observer (TMO) and three convolutional neural networks (CNNs). The three CNNs were trained on the letter stimuli to perform each of the three noise conditions. We computed human efficiency relative to CNN performance. We also characterized mistakes using confusion matrices and computed the population stability index (PSI) as a distance measure between mistakes made by human and model observers. We found the CNNs employed distinct strategies for each condition. Human relative efficiency was higher for subcortical than cortical scrambling. In bandpass noise, PSIs for both TMO and CNNs were comparable. For our scrambling conditions however, the PSI of TMO was significantly higher than that of CNNs in all but one comparison. Our results suggest that the human strategy for identifying scrambled letters is better captured by CNNs, which may share more similar strategies for identifying scrambled letters than a simple TMO.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×