Optimization in Markov decision problems with transition-dependent cost functions

Authors :: J. Wang
Xi-Ren Cao
Source :: Proceedings of the 1999 IEEE International Conference on Control Applications (Cat. No.99CH36328).
Publication Year :: 2003
Publisher :: IEEE, 2003.
Abstract: The traditional MDP deals with the cost function which only depends on the state and the corresponding action. In the real world however, there are many applications where the cost incurred depends on the particular transition as well, which makes the traditional MDP solution infeasible for these problems. We apply the performance potential theory as an optimization tool for MDP. In particular the notion of the expanded Markov chain is introduced to map this problem to a general form. Both computation-based and sample-path-based algorithms are developed for potential derivation. We address ourselves to the complexity-reduction techniques. Finally, we apply these techniques to the "join the shortest queue" application, which is a significant component in the analysis of communication systems.

Subjects :: Queueing theory
symbols.namesake
Mathematical optimization
Computational complexity theory
Markov chain
Decision theory
Component (UML)
symbols
Markov process
Decision problem
Queue
Mathematics

Database :: OpenAIRE
Journal :: Proceedings of the 1999 IEEE International Conference on Control Applications (Cat. No.99CH36328)
Accession number :: edsair.doi...........20ec716235835b9c598e9a1dded6ef3a
Full Text :: https://doi.org/10.1109/cca.1999.801190

Full Text Access

Tools