June 2007
Volume 7, Issue 9
Free
Vision Sciences Society Annual Meeting Abstract  |   June 2007
Unsupervised learning of higher order statistics of visual features: evidence for relational encoding
Author Affiliations
  • Elan Barenholtz
    Brown University Dept. of Cognitive and Linguistic Sciences
  • Michael J. Tarr
    Brown University Dept. of Cognitive and Linguistic Sciences
Journal of Vision June 2007, Vol.7, 798. doi:https://doi.org/10.1167/7.9.798
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Elan Barenholtz, Michael J. Tarr; Unsupervised learning of higher order statistics of visual features: evidence for relational encoding. Journal of Vision 2007;7(9):798. https://doi.org/10.1167/7.9.798.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

A number of important theories of visual recognition assume that objects are represented on the basis of parts and their explicitly defined relations (e.g. Biederman, 1987). Much of the evidence pertaining to the encoding of relations between features (as opposed to encoding of the features themselves) comes from so-called ‘configural effects’, such as the advantage in recognizing one part of a face when other parts are present (Tanaka & Sengko, 1997). However, many of these findings might be explained by invoking ‘larger’ features that incorporate multiple smaller features, e.g. using a single ‘eye-and-nose’ feature, rather than separate ‘eye’ and ‘nose’ features in a particular spatial relationship. The current research aims to disassociate the roles of features and their relations: subjects viewed patterns composed of multiple, distinct polygonal shapes in which the spatial relations between spatially non-contiguous features were controlled, so that specific pairs of features (‘base-pairs) always appeared together, and in the same spatial relation to one another, across multiple patterns. Across three experiments, we tested whether subjects had learned the statistics of the patterns in terms of the joint and conditional probability of the positions of base-pair features. Our results showed that subjects could learn the statistical properties of non-contiguous features while discounting the properties of features located between them. These results are inconsistent with a plausible ‘larger feature’ hypothesis, which would necessarily include the spatially intermediate features, and provide direct support for the explicit encoding of relations between features in unsupervised learning.

References:

I. Biederman. (1987). Recognition by components: A theory of human image understanding. Psychological Review, 94, 115-147.

J. W. Tanaka and J. A. Sengco. (1997). Features and their configuration in face recognition. Memory & Cognition, 25, 583-592.

Barenholtz, E. Tarr, M. J. (2007). Unsupervised learning of higher order statistics of visual features: evidence for relational encoding [Abstract]. Journal of Vision, 7(9):798, 798a, http://journalofvision.org/7/9/798/, doi:10.1167/7.9.798. [CrossRef]
Footnotes
 Both authors funded by NGA Award #HM1582-04-C-0051.
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×