May 2008
Volume 8, Issue 6
Free
Vision Sciences Society Annual Meeting Abstract  |   May 2008
Learning probability and reward through experience: Impact of value structure on reach planning
Author Affiliations
  • Erik Schlicht
    Harvard University, Psychology, and California Institute of Technology, Computation and Neural Systems
  • Shin Shimojo
    California Institute of Technology, Computation and Neural Systems
  • Ken Nakayama
    Harvard University, Psychology
Journal of Vision May 2008, Vol.8, 543. doi:https://doi.org/10.1167/8.6.543a
  • Views
  • Share
  • Tools
    • Alerts
      ×
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Erik Schlicht, Shin Shimojo, Ken Nakayama; Learning probability and reward through experience: Impact of value structure on reach planning. Journal of Vision 2008;8(6):543. https://doi.org/10.1167/8.6.543a.

      Download citation file:


      © ARVO (1962-2015); The Authors (2016-present)

      ×
  • Supplements
Abstract

Throughout our everyday routine we must make actions in the face of uncertainty. From a decision theoretic standpoint, optimal actions are those that maximize the value associated with the task. However, in order for humans to act optimally, it necessitates the brain has an accurate representation of both the reward and probability associated with each outcome. Previous research investigating how humans use value structure to perform reaching movements has exclusively focused on asymptotic performance, ignoring how this structure is learned. Therefore, this project investigates how value is learned by requiring subjects to reach to targets that appear after completing a portion of their movement towards the possible target locations. Since subjects have no information about the target at the beginning of the reach, their initial trajectories provide a way to quantify reach plans. Value is manipulated by varying either the probability or reward associated with each target. Subjects are awarded points for correctly acquiring the target, no points for reaching to the incorrect target, and are penalized points for taking too much time. Subjects receive bonus money after the experiment that is based on their point total, assuring that value structure in this paradigm has actual utility. Furthermore, we developed a model that learns through the subject's experience what initial biases result in maximal points. We can use the model to make predictions about the biases people should use and what experience is important for forming value estimates. The results show that as the difference in value between the targets increases, subjects' biases also increase at a rate that closely matches the maximum-point predictions. Moreover, changes in biases across trials are better predicted by recent experience, rather than global experience. Together, this suggests that people learn value structure through recent experience, and this knowledge is used to guide reach planning.

Schlicht, E. Shimojo, S. Nakayama, K. (2008). Learning probability and reward through experience: Impact of value structure on reach planning [Abstract]. Journal of Vision, 8(6):543, 543a, http://journalofvision.org/8/6/543/, doi:10.1167/8.6.543. [CrossRef]
Footnotes
 Shimojo Implicit Brain Function Grant (JST)
×
×

This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.

×