Back to Search
Start Over
An adaptive strategy via reinforcement learning for the prisoner dilemma game
- Source :
- IEEE/CAA Journal of Automatica Sinica. 5:301-310
- Publication Year :
- 2018
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2018.
-
Abstract
- The iterated prisoner U+02BC s dilemma U+0028 IPD U+0029 is an ideal model for analyzing interactions between agents in complex networks. It has attracted wide interest in the development of novel strategies since the success of tit-for-tat in Axelrod U+02BC s tournament. This paper studies a new adaptive strategy of IPD in different complex networks, where agents can learn and adapt their strategies through reinforcement learning method. A temporal difference learning method is applied for designing the adaptive strategy to optimize the decision making process of the agents. Previous studies indicated that mutual cooperation is hard to emerge in the IPD. Therefore, three examples which based on square lattice network and scale-free network are provided to show two features of the adaptive strategy. First, the mutual cooperation can be achieved by the group with adaptive agents under scale-free network, and once evolution has converged mutual cooperation, it is unlikely to shift. Secondly, the adaptive strategy can earn a better payoff compared with other strategies in the square network. The analytical properties are discussed for verifying evolutionary stability of the adaptive strategy.
- Subjects :
- 0209 industrial biotechnology
Adaptive strategies
Computer science
business.industry
Stochastic game
02 engineering and technology
Prisoner's dilemma
Complex network
020901 industrial engineering & automation
Artificial Intelligence
Control and Systems Engineering
Adaptive system
0202 electrical engineering, electronic engineering, information engineering
Reinforcement learning
020201 artificial intelligence & image processing
Artificial intelligence
Temporal difference learning
business
Game theory
Information Systems
Subjects
Details
- ISSN :
- 23299274 and 23299266
- Volume :
- 5
- Database :
- OpenAIRE
- Journal :
- IEEE/CAA Journal of Automatica Sinica
- Accession number :
- edsair.doi...........b807f450192269252150c0d9af388167
- Full Text :
- https://doi.org/10.1109/jas.2017.7510466