Back to Search Start Over

GraphMDP: A New Decomposition Tool for Solving Markov Decision Processes

Authors :
Pierre Laroche
Autonomous intelligent machine (MAIA)
INRIA Lorraine
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA)
Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)
Loria, Publications
Source :
International Journal on Artificial Intelligence Tools, International Journal on Artificial Intelligence Tools, World Scientific Publishing, 2001, 10 (3), pp.325-343, Pierre Laroche, International Journal on Artificial Intelligence Tools, 2001, 10 (3), pp.325-343
Publication Year :
2001
Publisher :
HAL CCSD, 2001.

Abstract

Article dans revue scientifique avec comité de lecture.; In this paper, we present a new tool for solving weakly-coupled Markov Decision Processes using decomposition techniques. Using a predefined partition of the MDP, a directed graph is built to decompose the global MDP into small local MDPs which are independently solved. An approximate solution for the global MDP is obtained by combining local solutions. Our approach has been tested on a mobile robotics application. It allows near-optimal solutions to be obtained in significantly reduced time. We also present preliminary results concerning a parallel implantation of our tool.

Details

Language :
English
ISSN :
02182130
Database :
OpenAIRE
Journal :
International Journal on Artificial Intelligence Tools, International Journal on Artificial Intelligence Tools, World Scientific Publishing, 2001, 10 (3), pp.325-343, Pierre Laroche, International Journal on Artificial Intelligence Tools, 2001, 10 (3), pp.325-343
Accession number :
edsair.dedup.wf.001..546504f161fba61a95a98d42618af7d3