August 2012
Volume 12, Issue 9
Free
Vision Sciences Society Annual Meeting Abstract  |   August 2012
Statistics of three-dimensional natural scene structures
Author Affiliations
  • Weibing Wan
    Brain and Behavior Discovery Institute, Georgia Health Sciences University, Augusta, Georgia, USA\nDepartment of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
  • Zhiyong Yang
    Brain and Behavior Discovery Institute, Georgia Health Sciences University, Augusta, Georgia, USA\nDepartment of Ophthalmology, Georgia Health Sciences University, Augusta, Georgia, USA
Journal of Vision August 2012, Vol.12, 1203. doi:10.1167/12.9.1203
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Weibing Wan, Zhiyong Yang; Statistics of three-dimensional natural scene structures. Journal of Vision 2012;12(9):1203. doi: 10.1167/12.9.1203.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Generating a detailed percept of the three-dimensional (3D) scene underlying any visual stimulus is the most important task of natural vision. It has long been suggested that the visual system uses a set of modules to derive 3D information via complex image-based processing. However, missing in these approaches is a full understanding of the extraordinarily complex 3D natural scene statistics. In this work, we acquired a large set of high-resolution 3D natural scenes and examined the statistics of 3D natural scenes. We first sampled a large number of scene patches (~2 degrees of visual angle) from the database and fitted the 3D data in the patches to a concatenation of 8th order polynomial functions. We found all the 3D natural scene patches that had distinctive distributions of ranges (referred to as 3D natural scene structures). Two 3D scene patches were deemed to have the same distributions of ranges if they can be transformed to each other by an affine transform (displacement, rotation, and scaling). The rationale is to remove the variations in ranges due to uniform changes in viewing angles and surface shapes. Finally, we examined the occurring frequencies of these structures and their compositional patterns in natural scenes and developed a probabilistic model for each of them. To demonstrate the utilities of these 3D natural scene structures, we used them to estimate 3D scenes from 2D images and to categorize 3D natural scenes. Our results showed that accurate 3D vision from a single monocular view is achievable in many situations and that near-human performance can be achieved on categorizing 3D natural scenes with the obtained structures. We thus conclude that the 3D natural scene structures obtained here capture faithfully the extraordinarily complex 3D natural scene statistics in a way that supports a range of tasks of natural vision.

Meeting abstract presented at VSS 2012

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×