October 2020
Volume 20, Issue 11
Open Access
Vision Sciences Society Annual Meeting Abstract  |   October 2020
Integrating Single-Unit and Pattern Codes in DCNNs Trained for Face Identification
Author Affiliations & Notes
  • Connor J. Parde
    The University of Texas at Dallas
  • Y. Ivette Colon
    The University of Texas at Dallas
  • Matthew Q. Hill
    The University of Texas at Dallas
  • Alice J. O'Toole
    The University of Texas at Dallas
  • Carlos Castillo
    University of Maryland Institute for Advanced Computer Studies
  • Footnotes
    Acknowledgements  Research reported in this publication was supported by the National Eye Institute of the National Institutes of Health under award number R01EY029692.
Journal of Vision October 2020, Vol.20, 1462. doi:https://doi.org/10.1167/jov.20.11.1462
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Connor J. Parde, Y. Ivette Colon, Matthew Q. Hill, Alice J. O'Toole, Carlos Castillo; Integrating Single-Unit and Pattern Codes in DCNNs Trained for Face Identification. Journal of Vision 2020;20(11):1462. https://doi.org/10.1167/jov.20.11.1462.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Historically, studies aimed at understanding neural codes have probed either individual neurons or patterns of neural activation. Here, we integrated these two levels of encoding by investigating individual simulated neurons (i.e., units) and high-level coding patterns in a deep convolutional neural network (DCNN) trained for face identification. These networks simultaneously encode identity, gender, and viewpoint (Parde et al., 2017) and allow for an investigation of representations at multiple scales. First, we measured individual units’ capacity to distinguish identities, genders, and viewpoints (“attributes”). Second, we re-expressed face representations as directions in the high-dimensional space, quantified using principal component analysis (PCA), and measured PCs’ capacity to distinguish face attributes. Coding capacity in individual units was measured by effect sizes in one-way ANOVAs for distinguishing identity (mean R^2 = 0.71, SD = 0.016), gender (mean R^2 = 0.004, SD = 0.007), and viewpoint (mean R^2 = 0.002, SD = 0.002). Although the effects for gender and viewpoint were small, they were of consistent magnitude across units, and predictions from the ensemble of units were accurate (gender-classification accuracy 92.3%, viewpoint estimation within 7.8 degrees). All units provided significant identity information, 71% provided gender information, and 50% provided viewpoint information (all p < 0.05, Bonferroni corrected). To investigate the organization of the three attributes in the PCA space, we computed the cosine similarity between each PC and directions diagnostic of identity, gender, and viewpoint separation. This analysis shows that the attributes are separated into subspaces such that identity information is encoded along axes that explain the most variance, followed by gender, and then viewpoint. Combined, these results indicate that the ensemble code that emerges from the DCNN organizes attributes semantically, though the individual units entangle this information. Therefore, these units cannot be interpreted as simple visual feature detectors in a traditional sense.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×