September 2011
Volume 11, Issue 11
Free
Vision Sciences Society Annual Meeting Abstract  |   September 2011
Shape and albedo from shading using natural image statistics
Author Affiliations
  • Jonathan Barron
    EECS, UC Berkeley, USA
  • Jitendra Malik
    EECS, UC Berkeley, USA
Journal of Vision September 2011, Vol.11, 1194. doi:https://doi.org/10.1167/11.11.1194
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Jonathan Barron, Jitendra Malik; Shape and albedo from shading using natural image statistics. Journal of Vision 2011;11(11):1194. https://doi.org/10.1167/11.11.1194.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

A human observer perceives the Mona Lisa neither as paint on a flat canvas nor as a strange shape with uniform albedo, but instead as a familiar woman with light skin and dark hair. This suggests a statistical formulation of the problem, where certain shapes and albedos are more likely than others. We address the problem of recovering the most likely albedo and shape that explain a single image, using techniques from natural image statistics, but applied separately to the albedo and geometry of a scene.

Our technique is based on multiscale generalizations of the Roth & Black Field of Experts, which was originally proposed for image denoising. The FOE consists of a filter bank and a series of “experts” which each model the heavy-tailed response to one of the filters. A multiscale optimization algorithm is presented for searching over the space of shapes and albedos such that the likelihoods of the FOE models are maximized while the target image is exactly reconstructed. We find that multiscale priors, representations, and optimization are central to the success of our approach.

Our approach can be considered a generalization of “shape from shading” algorithms which is robust to variable albedo, or as a generalization of “intrinsic image” algorithms which explicitly recovers shape, rather than simply recovering shading. Our technique solves a superset of these two problems, and outperforms the previous best individual algorithms (such as those based on Retinex) on problems such as recovering shape and albedo from images of terrain and faces, on the MIT Intrinsic Images dataset.

Our results provide a normative model of how humans may solve this fundamental vision task, and suggests that other work in natural image statistics may aid our understanding of shape and albedo interpretation.

×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×