Back to Search
Start Over
Feature-based methods for large scale dynamic programming
- Source :
- Machine Learning. 22:59-94
- Publication Year :
- 1996
- Publisher :
- Springer Science and Business Media LLC, 1996.
-
Abstract
- We develop a methodological framework and present a few different ways in which dynamic programming and compact representations can be combined to solve large scale stochastic control problems. In particular, we develop algorithms that employ two types of feature-based compact representations; that is, representations that involve feature extraction and a relatively simple approximation architecture. We prove the convergence of these algorithms and provide bounds on the approximation error. As an example, one of these algorithms is used to generate a strategy for the game of Tetris. Furthermore, we provide a counter-example illustrating the difficulties of integrating compact representations with dynamic programming, which exemplifies the shortcomings of certain simple approaches.
- Subjects :
- Theoretical computer science
business.industry
Computer science
Dimensionality reduction
Feature extraction
Machine learning
computer.software_genre
Dynamic programming
Function approximation
Artificial Intelligence
Feature (computer vision)
Approximation error
Reinforcement learning
Artificial intelligence
business
computer
Software
Curse of dimensionality
Subjects
Details
- ISSN :
- 15730565 and 08856125
- Volume :
- 22
- Database :
- OpenAIRE
- Journal :
- Machine Learning
- Accession number :
- edsair.doi...........97b1f36d255450ce9312237b63a32795
- Full Text :
- https://doi.org/10.1007/bf00114724