Back to Search Start Over

LEARNING ALGORITHMS FOR TWO-PERSON ZERO-SUM STOCHASTIC GAMES WITH INCOMPLETE INFORMATION.

Authors :
Lakshmivarahan, S.
Narendra, Kumpati S.
Source :
Mathematics of Operations Research; Aug81, Vol. 6 Issue 3, p379-386, 8p
Publication Year :
1981

Abstract

This paper investigates conditions under which two learning algorithms playing a zero-sum sequential stochastic game would arrive at optimal pure strategies. Neither player has knowledge of either the pay-off matrix or the choice of strategies available to the other and both players update their own strategies at every stage entirely on the basis of the random outcome at that stage. The proposed learning algorithms are shown to converge to the optimal pure strategies when they exist with probabilities as close to ! as desired. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0364765X
Volume :
6
Issue :
3
Database :
Complementary Index
Journal :
Mathematics of Operations Research
Publication Type :
Academic Journal
Accession number :
9275291
Full Text :
https://doi.org/10.1287/moor.6.3.379