Back to Search
Start Over
Optimization in Markov decision problems with transition-dependent cost functions
- Source :
- Proceedings of the 1999 IEEE International Conference on Control Applications (Cat. No.99CH36328).
- Publication Year :
- 2003
- Publisher :
- IEEE, 2003.
-
Abstract
- The traditional MDP deals with the cost function which only depends on the state and the corresponding action. In the real world however, there are many applications where the cost incurred depends on the particular transition as well, which makes the traditional MDP solution infeasible for these problems. We apply the performance potential theory as an optimization tool for MDP. In particular the notion of the expanded Markov chain is introduced to map this problem to a general form. Both computation-based and sample-path-based algorithms are developed for potential derivation. We address ourselves to the complexity-reduction techniques. Finally, we apply these techniques to the "join the shortest queue" application, which is a significant component in the analysis of communication systems.
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 1999 IEEE International Conference on Control Applications (Cat. No.99CH36328)
- Accession number :
- edsair.doi...........20ec716235835b9c598e9a1dded6ef3a
- Full Text :
- https://doi.org/10.1109/cca.1999.801190