November 2002
Volume 2, Issue 7
Vision Sciences Society Annual Meeting Abstract  |   November 2002
Unsupervised learning of visual structure
Author Affiliations
  • Benjamin P. Hiles
    Cornell University, USA
  • Nathan Intrator
    Brown University, USA
  • Shimon Edelman
    Cornell University, USA
Journal of Vision November 2002, Vol.2, 74. doi:
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Benjamin P. Hiles, Nathan Intrator, Shimon Edelman; Unsupervised learning of visual structure. Journal of Vision 2002;2(7):74.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

How object and scene structure is represented is an important problem in vision. Considerations of coding efficiency and of systematicity suggest that representations of structurally related objects should share components (which, as our earlier results show, need not be generic, categorical, or mutually exclusive). Can a useful set of object fragments be acquired in an unsupervised fashion? Statistical interdependence criteria such as pairwise conditional probabilities of the fragments, or Barlow's “suspicious coincidence” ratio (the joint probability of two fragments divided by the product of their marginal probabilities), can provide a basis for unsupervised learning. If humans use such criteria in learning structured objects, they would tend to lump together a pair of highly interdependent fragments, perceiving them as a single shape. We tested this hypothesis in two experiments involving a part verification task, in which subjects are known to detect a unitary probe embedded in a larger target faster than a composite one. Altogether, over 90 subjects were tested. As predicted, mere exposure to a set of statistically controlled, structured stimuli (80 and 100 objects in the first and second experiments, respectively) led to a larger speedup for fragment pairs with higher interdependence (conditional probability). This effect was modulated in a complicated manner by the fragment coincidence as measured by Barlow's ratio. Single-fragment probes generally exhibited a smaller speedup than composites. Our study complements and extends recent results by Aslin and others that demonstrate some of the learning strategies used by the brain in dealing with structured stimuli. To elucidate the mechanisms behind this kind of unsupervised learning, we developed a computational model of visual structure acquisition, which accepts the same stimuli seen by the human subjects, and exhibits similar patterns of behavior.

Hiles, B. P., Intrator, N., Edelman, S.(2002). Unsupervised learning of visual structure [Abstract]. Journal of Vision, 2( 7): 74, 74a,, doi:10.1167/2.7.74. [CrossRef]

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.