Daniel M. Wolpert, J. Randall Flanagan, Jason P. Gallivan, Joshua B. Moskowitz, Daniel J. Gale, Moskowitz, Joshua B [0000-0002-7474-223X], Gale, Daniel J [0000-0002-9082-1659], Gallivan, Jason P [0000-0002-7362-109X], Wolpert, Daniel M [0000-0003-2011-2790], Flanagan, J Randall [0000-0003-2760-6005], Apollo - University of Cambridge Repository, Moskowitz, Joshua B. [0000-0002-7474-223X], Gale, Daniel J. [0000-0002-9082-1659], Gallivan, Jason P. [0000-0002-7362-109X], Wolpert, Daniel M. [0000-0003-2011-2790], and Flanagan, J. Randall [0000-0003-2760-6005]
It is well-established that people can factor into account the distribution of their errors in motor performance so as to optimize reward. Here we asked whether, in the context of motor learning where errors decrease across trials, people take into account their future, improved performance so as to make optimal decisions to maximize reward. One group of participants performed a virtual throwing task in which, periodically, they were given the opportunity to select from a set of smaller targets of increasing value. A second group of participants performed a reaching task under a visuomotor rotation in which, after performing a initial set of trials, they selected a reward structure (ratio of points for target hits and misses) for different exploitation horizons (i.e., numbers of trials they might be asked to perform). Because movement errors decreased exponentially across trials in both learning tasks, optimal target selection (task 1) and optimal reward structure selection (task 2) required taking into account future performance. The results from both tasks indicate that people anticipate their future motor performance so as to make decisions that will improve their expected future reward., Author summary A hallmark of motor learning is the reduction of performance errors with practice, which can have important ramifications for decision making. For example, when purchasing a new surfboard, our choice should anticipate our improvement with practice so that we select an appropriate board. In this paper we asked whether, in the context of two different motor learning tasks, people take into account their future, improved performance so as to make optimal decisions to maximize reward. In the throwing task, which involved learning the dynamics of the thrown object, participants were periodically given the opportunity to select from a set of smaller targets of increasing value. In the reaching task, which involved adapting to a visuomotor rotation, participants—after performing an initial block of trials—selected a reward structure (ratio of points for target hits and misses) for different exploitation horizons (i.e., numbers of trials they might be asked to perform). Performance errors decreased across trials in both tasks and therefore optimal target selection (task 1) and optimal reward structure selection (task 2) required taking into account future performance. In both tasks, participants anticipated their future motor performance so as to make decisions that improved their expected future reward.