Back to Search Start Over

Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks

Authors :
Heiko Neumann
Pieter R. Roelfsema
Tobias Brosch
Amsterdam Neuroscience
Adult Psychiatry
Integrative Neurophysiology
Neuroscience Campus Amsterdam - Brain Mechanisms in Health & Disease
Netherlands Institute for Neuroscience (NIN)
Source :
PLoS Computational Biology, Vol 11, Iss 10, p e1004489 (2015), PLoS computational biology, 11(10). Public Library of Science, PLoS Computational Biology, 11(10):e1004489. Public Library of Science, PLoS Computational Biology, 11(10). Public Library of Science, PLoS Computational Biology, Brosch, T, Neumann, H & Roelfsema, P R 2015, ' Reinforcement Learning of Linking and Tracing Contours in Recurrent Neural Networks ', PLoS Computational Biology, vol. 11, no. 10, e1004489 . https://doi.org/10.1371/journal.pcbi.1004489
Publication Year :
2015
Publisher :
Public Library of Science (PLoS), 2015.

Abstract

The processing of a visual stimulus can be subdivided into a number of stages. Upon stimulus presentation there is an early phase of feedforward processing where the visual information is propagated from lower to higher visual areas for the extraction of basic and complex stimulus features. This is followed by a later phase where horizontal connections within areas and feedback connections from higher areas back to lower areas come into play. In this later phase, image elements that are behaviorally relevant are grouped by Gestalt grouping rules and are labeled in the cortex with enhanced neuronal activity (object-based attention in psychology). Recent neurophysiological studies revealed that reward-based learning influences these recurrent grouping processes, but it is not well understood how rewards train recurrent circuits for perceptual organization. This paper examines the mechanisms for reward-based learning of new grouping rules. We derive a learning rule that can explain how rewards influence the information flow through feedforward, horizontal and feedback connections. We illustrate the efficiency with two tasks that have been used to study the neuronal correlates of perceptual organization in early visual cortex. The first task is called contour-integration and demands the integration of collinear contour elements into an elongated curve. We show how reward-based learning causes an enhancement of the representation of the to-be-grouped elements at early levels of a recurrent neural network, just as is observed in the visual cortex of monkeys. The second task is curve-tracing where the aim is to determine the endpoint of an elongated curve composed of connected image elements. If trained with the new learning rule, neural networks learn to propagate enhanced activity over the curve, in accordance with neurophysiological data. We close the paper with a number of model predictions that can be tested in future neurophysiological and computational studies.<br />Author Summary Our experience with the visual world allows us to group image elements that belong to the same perceptual object and to segregate them from other objects and the background. If subjects learn to group contour elements, this experience influences neuronal activity in early visual cortical areas, including the primary visual cortex (V1). Learning presumably depends on alterations in the pattern of connections within and between areas of the visual cortex. However, the processes that control changes in connectivity are not well understood. Here we present the first computational model that can train a neural network to integrate collinear contour elements into elongated curves and to trace a curve through the visual field. The new learning algorithm trains fully recurrent neural networks, provided the connectivity causes the networks to reach a stable state. The model reproduces the behavioral performance of monkeys trained in these tasks and explains the patterns of neuronal activity in the visual cortex that emerge during learning, which is remarkable because the only feedback for the model is a reward for successful trials. We discuss a number of the model predictions that can be tested in future neuroscientific work.

Details

Language :
English
ISSN :
15537358 and 1553734X
Volume :
11
Issue :
10
Database :
OpenAIRE
Journal :
PLoS Computational Biology
Accession number :
edsair.doi.dedup.....b1ab99205f7f7f4276bb898fcb69c8bb