June 2007
Volume 7, Issue 9
Vision Sciences Society Annual Meeting Abstract  |   June 2007
Learning invariant and variant components of time-varying natural images
Author Affiliations
  • Bruno Olshausen
    Helen Wills Neuroscience Insitute and School of Optometry, UC, Berkeley
  • Charles Cadieu
    Helen Wills Neuroscience Insitute and School of Optometry, UC, Berkeley
Journal of Vision June 2007, Vol.7, 964. doi:10.1167/7.9.964
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Bruno Olshausen, Charles Cadieu; Learning invariant and variant components of time-varying natural images. Journal of Vision 2007;7(9):964. doi: 10.1167/7.9.964.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

A remarkable property of biological visual systems is their ability to infer and represent invariances in the visual environment. This information is important for determining ‘what’ we are seeing- i.e. recognizing objects and interpreting scenes. However, such a representation only addresses half of the story: the variant part, such as the motion of an object, captures the ‘where’ or ‘how’ information which is equally important for interpreting and interacting with the environment. Therefore, a complete visual representation should capture both the invariant and variant parts of images. Here we present a model that learns to separate the variant from the invariant part of time varying natural images. First, we reformulate the sparse coding model [Olshausen and Field, 1996] so that images are explained in terms of a multiplicative interaction between two sets of causal variables. One set of variables is constrained to change slowly over time (the invariant representation), and another set of variables is allowed to change quickly over time and is encoded as a phase angle (the variant representation). After training on natural image sequences, the learned basis functions are similar to those produced by the original sparse coding model — i.e., a set of Gabor-like functions that are spatially localized, oriented and bandpass. In this case, though, the multiplicative decomposition produces both invariant components with slowly changing responses, representing aspects of visual shape, and variant components in the form of phase angles precessing over time, representing their transformations. The model predicts the existence of two classes of cells in primary visual cortex that form the beginnings of a ‘what’ and ‘where’ representation of images. Moreover, the decomposition provided by this model paves the way toward the construction of hierarchical models for capturing more global aspects of the ‘what’ and ‘where’ structure in natural images.

Olshausen, B. Cadieu, C. (2007). Learning invariant and variant components of time-varying natural images [Abstract]. Journal of Vision, 7(9):964, 964a, http://journalofvision.org/7/9/964/, doi:10.1167/7.9.964. [CrossRef]
 NGA grant MCA 015894-UCB, NSF grant IIS-06-25223

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.