May 2008
Volume 8, Issue 6
Vision Sciences Society Annual Meeting Abstract  |   May 2008
Search for arbitrary objects in natural scenes is remarkably efficient
Author Affiliations
  • Jeremy Wolfe
    Brigham and Women's Hospital, Boston, MA, and Harvard Medical School, Boston, MA
  • George Alvarez
    MIT, Cambridge, MA
  • Ruth Rosenholtz
    MIT, Cambridge, MA
  • Aude Oliva
    MIT, Cambridge, MA
  • Antonio Torralba
    MIT, Cambridge, MA
  • Yoana Kuzmova
    Brigham and Women's Hospital, Boston, MA
  • Max Uhlenhuth
    duPont Manual High School, Louisville, KY
Journal of Vision May 2008, Vol.8, 1103. doi:
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Jeremy Wolfe, George Alvarez, Ruth Rosenholtz, Aude Oliva, Antonio Torralba, Yoana Kuzmova, Max Uhlenhuth; Search for arbitrary objects in natural scenes is remarkably efficient. Journal of Vision 2008;8(6):1103.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

In visual search tasks, the time required to find targets (reaction time - RT) is a function of the number of items in the display (set size). Targets can be found efficiently if they can be uniquely defined by the presence one of a limited set of features. Thus, for example, in search for red targets among blue distractors, the slope of the RT x set size function will be close to zero. Other tasks (e.g. search for a letter among various distracting letters) will be inefficient even if the items can be resolved and identified without eye movements. This holds for artificial tasks typically used in laboratory search experiments. What about searches in the real world where the target is not precisely specified (“Find a bottle.”) and where one's goal changes from search to search (Find the bottle, now the fork, now the bread)? A major obstacle to studying such searches in real scenes has been that it is very hard to specify set size (How many objects are in your field of view right now? Does the keyboard constitute one object or many?) We adopted a brute force method, hand-labeling every object in a set of 100 indoor scenes and using the number of labeled items as a conservative estimate of set size. By this method, we placed scenes into set size bins from 20–30 to 80–90 items. On each trial, twelve observers searched for different targets, drawn at random from the set of labeled items. Targets were present on 50% of trials. Slopes of RT x set size functions averaged 4.6 msec/item for target-present, 4.7 for target-absent trials. Search in these scenes seems to be guided very effectively by something other than the usual attributes like color, orientation, etc. We propose that scene-based properties efficiently guide attention.

Wolfe, J. Alvarez, G. Rosenholtz, R. Oliva, A. Torralba, A. Kuzmova, Y. Uhlenhuth, M. (2008). Search for arbitrary objects in natural scenes is remarkably efficient [Abstract]. Journal of Vision, 8(6):1103, 1103a,, doi:10.1167/8.6.1103. [CrossRef]

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.