September 2005
Volume 5, Issue 8
Vision Sciences Society Annual Meeting Abstract  |   September 2005
Components of bottom-up gaze allocation in natural scenes
Author Affiliations
  • Robert J. Peters
    University of Southern California, Computer Science, and California Institute of Technology, Computation and Neural Systems
  • Asha Iyer
    California Institute of Technology, Computation and Neural Systems
  • Christof Koch
    California Institute of Technology, Computation and Neural Systems
  • Laurent Itti
    University of Southern California, Computer Science
Journal of Vision September 2005, Vol.5, 692. doi:
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Robert J. Peters, Asha Iyer, Christof Koch, Laurent Itti; Components of bottom-up gaze allocation in natural scenes. Journal of Vision 2005;5(8):692.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

A model of bottom-up visual attention (“baseline salience model”, based on local detectors with coarse global surround inhibition) has been shown (Parkhurst et al., 2002) to account in part for the spatial locations fixated by people while free-viewing complex natural and artificial scenes. Here, we tested the additional roles in bottom-up gaze allocation played by several visual cortical mechanisms. In each case, we added a component to the salience model: non-linear interactions among orientation-tuned units both at short spatial ranges (for clutter reduction) and long ranges (for contour facilitation), and a detailed model of eccentricity-dependent changes in visual processing. Subjects free-viewed naturalistic and artificial images while their eye movements were recorded, and we used a metric called the Normalized Scanpath Salience (NSS) to compare the resulting fixation locations with the different models' predicted salience maps. NSS values indicate, on average, how many standard deviations above or below the mean salience was the model-predicted salience at human-fixated locations. Thus the minimum NSS value (when the model and human behavior are unrelated) is 0; the theoretical maximum NSS value is given by the ability of one observer's fixations to be predicted by the remaining observers' fixations, which in practice fell in the range 1.1—1.3 for different image categories. The baseline salience model predicted fixations at 39—57% of the maximum NSS level. Adding short-range orientation interactions increased this range to 50—65%, contour facilitation further increased it to 53—74%, and eccentricity-dependent processing increased it to 84—95%. Thus the proposed cortical interactions indeed appear to play a significant role in the spatiotemporal deployment of attention in natural scenes. This suggests that bottom-up attentional guidance does not depend solely on local visual features, but must also include the effects of non-local interactions.

Peters, R. J. Iyer, A. Koch, C. Itti, L. (2005). Components of bottom-up gaze allocation in natural scenes [Abstract]. Journal of Vision, 5(8):692, 692a,, doi:10.1167/5.8.692. [CrossRef]

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.