Back to Search
Start Over
An online multi-agent co-operative learning algorithm in POMDPs.
- Source :
-
Journal of Experimental & Theoretical Artificial Intelligence . Dec2008, Vol. 20 Issue 4, p335-344. 10p. 3 Graphs. - Publication Year :
- 2008
-
Abstract
- Solving partially observable Markov decision processes (POMDPs) is a complex task that is often intractable. This paper examines the problem of finding an optimal policy for POMDPs. While a lot of effort has been made to develop algorithms to solve POMDPs, the question of automatically finding good low-dimensional spaces in multi-agent co-operative learning domains has not been explored thoroughly. To identify this question, an online algorithm CMEAS is presented to improve the POMDP model. This algorithm is based on a look-ahead search to find the best action to execute at each cycle. Thus the overwhelming complexity of computing a policy for each possible situation is avoided. A series of simulations demonstrate this good strategy and performance of the proposed algorithm when multiple agents co-operate to find an optimal policy for POMDPs. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 0952813X
- Volume :
- 20
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- Journal of Experimental & Theoretical Artificial Intelligence
- Publication Type :
- Academic Journal
- Accession number :
- 34716841
- Full Text :
- https://doi.org/10.1080/09528130701679820