August 2023
Volume 23, Issue 9
Open Access
Vision Sciences Society Annual Meeting Abstract  |   August 2023
Harmonizing the visual strategies of image-computable models with humans yields more performant and interpretable models of primate visual system function.
Author Affiliations & Notes
  • Ivan Felipe Rodriguez
    Brown University
  • Drew Linsley
    Brown University
    Carney instititue for Brain Science
  • Jay Gopal
    Brown University
  • Thomas Fel
    Brown University
    Artificial and Natural Intelligence Toulouse Institute
  • Michael J. Acaro
    Harvard University
  • Saloni Sharma
    Harvard University
  • Margaret Livingstone
    Harvard University
  • Thomas Serre
    Brown University
    Carney instititue for Brain Science
  • Footnotes
    Acknowledgements  ONR (N00014-19-1-2029), NSF (IIS-1912280 and EAR-1925481), DARPA (D19AC00015), NIH/NINDS (R21 NS 112743), and the ANR-3IA ANITI (ANR-19-PI3A-0004). Carney Institute for Brain Science and the Center for Computation and Visualization (CCV). Google TFRC program. NIH S10OD025181.
Journal of Vision August 2023, Vol.23, 5768. doi:https://doi.org/10.1167/jov.23.9.5768
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Ivan Felipe Rodriguez, Drew Linsley, Jay Gopal, Thomas Fel, Michael J. Acaro, Saloni Sharma, Margaret Livingstone, Thomas Serre; Harmonizing the visual strategies of image-computable models with humans yields more performant and interpretable models of primate visual system function.. Journal of Vision 2023;23(9):5768. https://doi.org/10.1167/jov.23.9.5768.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Over the past decade, deep neural networks (DNNs) have been the standard paradigm for modeling biological brains and behavior. While initial reports suggested that the ability of DNNs to model biology correlated with their object classification accuracy (Yamins et al., 2014), this no longer appears to be the case: image-evoked activity in a self-supervised ResNet50 — an architecture introduced seven years ago — has the highest correlation with IT recordings on Brain-Score.com. We recently discovered that DNNs are also becoming progressively less aligned with human perception as their object classification accuracy has increased. This problem however can be resolved through “neural harmonization”: a drop-in training module for DNNs that forces their learned visual strategies to be consistent with those of humans (Fel et al., 2022). DNNs that are trained for object classification and harmonized with behavioral data describing human visual strategies for the same task are more interpretable, performant, and accurate at predicting human behavior. Here, we investigated if harmonizing DNNs with human behavioral data could also yield better models of the primate visual system. To test this, we turned to recordings of primate IT while animals viewed complex natural images (Arcaro et al., 2020). These experiments produced spatially resolved activity maps, which illustrate how neurons respond to every part of an image, thus revealing which features drove neural responses. After fitting a variety of state-of-the-art DNNs trained for object classification to this data, ranging from convolutional neural networks to vision transformers, we discovered that harmonizing these models with human visual strategies significantly improved their predictions of IT neural activity and reproduced qualitative features of neurons’ spatial activity maps that unharmonized models did not. Our findings demonstrate the importance of large-scale human behavioral and psychophysics data for generating more accurate and interpretable models of brain and behavior.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×