Back to Search Start Over

Feature-based methods for large scale dynamic programming

Authors :
Benjamin Van Roy
John N. Tsitsiklis
Source :
Machine Learning. 22:59-94
Publication Year :
1996
Publisher :
Springer Science and Business Media LLC, 1996.

Abstract

We develop a methodological framework and present a few different ways in which dynamic programming and compact representations can be combined to solve large scale stochastic control problems. In particular, we develop algorithms that employ two types of feature-based compact representations; that is, representations that involve feature extraction and a relatively simple approximation architecture. We prove the convergence of these algorithms and provide bounds on the approximation error. As an example, one of these algorithms is used to generate a strategy for the game of Tetris. Furthermore, we provide a counter-example illustrating the difficulties of integrating compact representations with dynamic programming, which exemplifies the shortcomings of certain simple approaches.

Details

ISSN :
15730565 and 08856125
Volume :
22
Database :
OpenAIRE
Journal :
Machine Learning
Accession number :
edsair.doi...........97b1f36d255450ce9312237b63a32795
Full Text :
https://doi.org/10.1007/bf00114724