June 2007
Volume 7, Issue 9
Free
Vision Sciences Society Annual Meeting Abstract  |   June 2007
Some tests of the standard model
Author Affiliations
  • Kenneth Hayworth
    University of Southern California
  • Xiaomin Yue
    University of Southern California
  • Irving Biederman
    University of Southern California
Journal of Vision June 2007, Vol.7, 924. doi:https://doi.org/10.1167/7.9.924
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Kenneth Hayworth, Xiaomin Yue, Irving Biederman; Some tests of the standard model. Journal of Vision 2007;7(9):924. https://doi.org/10.1167/7.9.924.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

HMAX (Serre et al. 2005), a model of processing in the primate visual cortex, has been referred to (by its authors) as the “standard model.” HMAX extends a classical Gabor-filter model of V1 by interleaving layers performing spatial pooling (to achieve invariance) with layers computing feature conjunctions, some learned, to achieve more complex features.

From object line drawings we produced local feature deleted (LFD) complementary pairs (A, B) by deleting every other vertex from one image and the alternating vertices from the other. We scrambled the contour fragments of A (by translation only) generating A_SCR, then conducted match-to-sample trials (Is A more similar to B or A_SCR?). Subjects invariably chose B. HMAX chose A_SCR in 95% of trials. With learned features, HMAX performed close to chance. In a separate test, we created match-to-sample trials where the target depicted Object1, the first test image also depicted Object1 but with complementary vertices, the second test image depicted Object2 but matched in local vertex content to the target. HMAX (with and without learned features) matched Object2 to Object1—exactly opposite to what humans did.

HMAX fails on these tests because it perceives an object only as a list of features. Parts-based structural descriptions (SD) can explain these results because LFD complements contain information sufficient for the same parts to be extracted. Recent single unit studies in IT are supportive of SDs. Yamane et al. (2006) reported IT neurons that were tuned to individual parts and relations between parts. We describe ways to revise feature hierarchy models (like HMAX) to achieve a model of human performance that more closely accommodates both our own psychophysical experiments as well as the neural data.

Hayworth, K. Yue, X. Biederman, I. (2007). Some tests of the standard model [Abstract]. Journal of Vision, 7(9):924, 924a, http://journalofvision.org/7/9/924/, doi:10.1167/7.9.924. [CrossRef]
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×