October 2020
Volume 20, Issue 11
Open Access
Vision Sciences Society Annual Meeting Abstract  |   October 2020
Learning visual contour tracing in a deep recurrent network based on a cortical columnar architecture
Author Affiliations
  • Daniel Schmid
    Ulm University
  • Christian Jarvers
    Ulm University
  • Heiko Neumann
    Ulm University
Journal of Vision October 2020, Vol.20, 1216. doi:https://doi.org/10.1167/jov.20.11.1216
  • Views
  • Share
  • Tools
    • Alerts
      This feature is available to authenticated users only.
      Sign In or Create an Account ×
    • Get Citation

      Daniel Schmid, Christian Jarvers, Heiko Neumann; Learning visual contour tracing in a deep recurrent network based on a cortical columnar architecture. Journal of Vision 2020;20(11):1216. https://doi.org/10.1167/jov.20.11.1216.

      Download citation file:

      © ARVO (1962-2015); The Authors (2016-present)

  • Supplements

Problem. Visual segmentation mechanisms comprise rapid base grouping of simple features by pre-configured circuits and slower but more flexible incremental grouping by dynamically enhancing feature representations (Roelfsema, Ann. Rev. Neurosci., 2006). How can neural circuits for low- and intermediate-level vision learn such mechanisms? Deep learning provides a framework to develop neuroscience-related models, in which inductive biases constrain model computations (Richards et al., Nat. Neurosci., 2019). Here, we investigate the task of contour tracing to incrementally group visual items for decision-making in a reinforcement learning scenario (Roelfsema, Lamme & Spekreijse, Nature, 1998) with architectural constraints inspired by cortical columns. Method. We propose a multi-layer, recurrent, convolutional network where each layer is composed of retinotopically organized cortical columns. Such columns are modeled as pairs of interacting excitatory-inhibitory (E-I) neurons in discrete feature channels. E-I pair activations are computed from input filtering, lateral integration, activity normalization over neuronal pools, and top-down modulating feedback. Reinforcement signals are provided during training, from which the network can learn rewarding actions over sequential stimulus presentations, similar to the corresponding psychophysical experiments in monkeys, where the task is to fixate and hold gaze at a fixation cue and finally making a saccade to target items indicated by tagging via learned neural tracing. Results and Conclusion. The network successfully solves incremental grouping tasks after learning to act in accordance with the different phases of contour target presentation. Activity spreads along the target line to enhance its neural representation relative to a distractor. Inspection of the model reveals that maintaining fixations is mediated by E-I interactions, while the tracing operation that constitutes incremental grouping is mainly implemented by feedforward and recurrent excitatory interactions.


This PDF is available to Subscribers Only

Sign in or purchase a subscription to access this content. ×

You must be signed into an individual account to use this feature.