December 2022
Volume 22, Issue 14
Open Access
Vision Sciences Society Annual Meeting Abstract  |   December 2022
Distributed population activity in the macaque inferior temporal cortex but not current deep neural networks predict the ponzo illusion
Author Affiliations & Notes
  • Vivian C. Paulun
    Massachusetts Institute of Technology
  • Kristine Zheng
    Massachusetts Institute of Technology
  • Kohitij Kar
    Massachusetts Institute of Technology
  • Footnotes
    Acknowledgements  This work was supported by the German Research Foundation (PA 3723/1-1), the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216 and Simons Foundation grant SCGB-542965 (James J DiCarlo). We thank the DiCarlo and Kanwisher Labs for helpful discussions.
Journal of Vision December 2022, Vol.22, 3354. doi:https://doi.org/10.1167/jov.22.14.3354
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Vivian C. Paulun, Kristine Zheng, Kohitij Kar; Distributed population activity in the macaque inferior temporal cortex but not current deep neural networks predict the ponzo illusion. Journal of Vision 2022;22(14):3354. https://doi.org/10.1167/jov.22.14.3354.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Primates must accurately estimate the size of objects in their environment to interact with them efficiently. Hong, Yamins, Majaj, and DiCarlo (2016) reported that one could accurately approximate an object’s size within an image from the population activity across the macaque inferior temporal (IT) cortex upon brief (100 ms) image presentations. These neural predictions were consistent with human behavioral estimates of object size within the same images—suggesting a linear IT readout model as the leading neural decoding hypothesis for object size estimation in primates. However, perceived and image-based (i.e., retinal) object sizes were highly correlated in the Hong et al. (2016) study. Notably, two objects with identical retinal sizes may be perceived to differ in size when embedded at different locations along a linear perspective (Ponzo illusion). Therefore, this size illusion allows us to perform a stronger test and assess whether the IT-based linear readout model predicts the perceived or retinal size. We created a set of image pairs by placing objects "near" or "far" with respect to a linear perspective background. We performed large-scale neural recordings (2 Utah arrays; n=192 sites) across the macaque IT cortex while the monkey fixated the images for 100 ms. Extending the results of Hong et al. (2016), we observed that approximations of object sizes from the IT responses (~190-205 ms) showed a significant bias (“far”-”near”; Δ=22%; p<0.001) that is qualitatively similar to that measured behaviorally in humans. Interestingly, however, most deep convolutional neural network (DCNN) models that so far best approximate the primate IT responses failed to demonstrate such a bias. Together, our results provide further support for the linear IT readout model of object size perception while exposing a significant explanatory gap in current DCNNs as models of primate vision.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×