September 2018
Volume 18, Issue 10
Open Access
Vision Sciences Society Annual Meeting Abstract  |   September 2018
Nonlinear visual mechanisms for 2D shape discrimination with pose uncertainty
Author Affiliations
  • Ingo Fruend
    Centre for Vision Research, York University, Toronto, ON, Canada
  • John Wilder
    University of Toronto, Toronto, ON, Canada
  • James Elder
    Centre for Vision Research, York University, Toronto, ON, Canada
Journal of Vision September 2018, Vol.18, 420. doi:
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Ingo Fruend, John Wilder, James Elder; Nonlinear visual mechanisms for 2D shape discrimination with pose uncertainty. Journal of Vision 2018;18(10):420. doi:

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

Humans are very good at recognizing objects from just their 2D outlines. Previous work has modeled discrimination as correlation, using linear systems identification methods to identify internal shape templates (Kurki et al, 2014, JOV; Wilder et al, 2015, Perception). However, Wilder et al. also noted evidence for nonlinearities in human shape discrimination that are not accounted for by the linear correlation model. One function of these nonlinearities may be to extract high-frequency shape information despite pose uncertainty. To test this hypothesis, we reconsider the experiment conducted by Wilder et al. in which human observers discriminated between shapes corrupted with additive Gaussian coordinate noise. A linear model that assumes no internal pose uncertainty can be estimated from these data and 56% of the variance of human responses. If the linear model is forced to account for a large degree of internal pose uncertainty (up to 40% in-plane rotation), explained variance drops to 36% and is only marginally better than chance. However, a deep neural network (DNN) model trained on the human responses completely recovers this lost variance. By analyzing the gradient of the DNN output with respect to the input, we show that the DNN model achieves this by undoing these random internal pose variations to yield a shape representation that is roughly pose invariant. Most importantly, these gradients also show a sensitivity to higher shape frequencies that is not revealed by linear systems identification methods. A DNN model reveals nonlinearities in human shape discrimination. These nonlinearities allow higher shape frequencies to be used for shape discrimination despite substantial amounts of internal pose uncertainty.

Meeting abstract presented at VSS 2018


This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.