July 2013
Volume 13, Issue 9
Vision Sciences Society Annual Meeting Abstract  |   July 2013
Human and computer face detection under occlusion
Author Affiliations
  • Sam Anthony
    Department of Psychology, Harvard University
  • Ken Nakayama
    Department of Psychology, Harvard University
Journal of Vision July 2013, Vol.13, 166. doi:https://doi.org/10.1167/13.9.166
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Sam Anthony, Ken Nakayama; Human and computer face detection under occlusion. Journal of Vision 2013;13(9):166. https://doi.org/10.1167/13.9.166.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

The performance of computer algorithms that detect frontal face images has increased dramatically. However, numerous edge cases still exist where computer performance suffers compared to human ability; humans, able to detect faces from birth, perform at what can be considered an asymptotic level. We previously investigated the divergence between human and computer performance on a task that required detecting faces blended with phase-scrambled noise. Human performance greatly exceeded algorithmic performance on this task, but the stimuli were somewhat unrealistic, and thus a suboptimal benchmark for computer algorithms. Unlike phase-scrambled noise, occlusion is a common feature of natural scenes. Thus, in the present work we investigated the ability of humans and computers to detect heavily occluded faces. In the condition that produced the lowest human accuracy (faces composited over generated Portilla-Simoncelli textures with large, black occluding bars) subjects were able to achieve accuracies well above chance (57.6% mean accuracy on a 3-AFC task, N=409), and greatly exceeded the performance of the algorithms tested; successful detections by algorithms were near zero at levels of visibility where human performance approached ceiling. We investigated the nature of the difference between a range of computer algorithms and humans by using human performance information to gauge and scaffold both algorithm performance and the generalizability of trained classifiers. In typical machine learning training paradigms the training set is labeled only with a binary class identifier, but in our approach we integrate item-level accuracy, response time, and computed difficulty. This strategy of applying rich human performance data to the training and evaluation of algorithms points to promising techniques for increasing the performance and biological plausibility of face detection, face processing and other computer vision algorithms.

Meeting abstract presented at VSS 2013


This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.