Start Over

Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex.

Authors :: Chadderdon GL
Neymotin SA
Kerr CC
Lytton WW
Source :: PloS one [PLoS One] 2012; Vol. 7 (10), pp. e47251. Date of Electronic Publication: 2012 Oct 19.
Publication Year :: 2012
Abstract: Sensorimotor control has traditionally been considered from a control theory perspective, without relation to neurobiology. In contrast, here we utilized a spiking-neuron model of motor cortex and trained it to perform a simple movement task, which consisted of rotating a single-joint "forearm" to a target. Learning was based on a reinforcement mechanism analogous to that of the dopamine system. This provided a global reward or punishment signal in response to decreasing or increasing distance from hand to target, respectively. Output was partially driven by Poisson motor babbling, creating stochastic movements that could then be shaped by learning. The virtual forearm consisted of a single segment rotated around an elbow joint, controlled by flexor and extensor muscles. The model consisted of 144 excitatory and 64 inhibitory event-based neurons, each with AMPA, NMDA, and GABA synapses. Proprioceptive cell input to this model encoded the 2 muscle lengths. Plasticity was only enabled in feedforward connections between input and output excitatory units, using spike-timing-dependent eligibility traces for synaptic credit or blame assignment. Learning resulted from a global 3-valued signal: reward (+1), no learning (0), or punishment (-1), corresponding to phasic increases, lack of change, or phasic decreases of dopaminergic cell firing, respectively. Successful learning only occurred when both reward and punishment were enabled. In this case, 5 target angles were learned successfully within 180 s of simulation time, with a median error of 8 degrees. Motor babbling allowed exploratory learning, but decreased the stability of the learned behavior, since the hand continued moving after reaching the target. Our model demonstrated that a global reinforcement signal, coupled with eligibility traces for synaptic plasticity, can train a spiking sensorimotor network to perform goal-directed motor behavior.

Subjects :: Action Potentials physiology
Animals
Computer Simulation
Dopamine physiology
Humans
Motor Cortex physiology
Neural Networks, Computer
Neuronal Plasticity physiology
Neurons physiology
Reward
Stochastic Processes
Synapses physiology
Synaptic Transmission physiology
Models, Neurological
Movement
Reinforcement, Psychology

Details

Language :: English
ISSN :: 1932-6203
Volume :: 7
Issue :: 10
Database :: MEDLINE
Journal :: PloS one
Publication Type :: Academic Journal
Accession number :: 23094042
Full Text :: https://doi.org/10.1371/journal.pone.0047251

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Reinforcement learning of targeted movement in a spiking neuronal model of motor cortex.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources