Back to Search
Start Over
Reinforcement learning algorithm based on minimum state method and average reward
- Source :
- Tongxin xuebao, Vol 32, Pp 66-71 (2011)
- Publication Year :
- 2011
- Publisher :
- Editorial Department of Journal on Communications, 2011.
-
Abstract
- In allusion to the problem that Q-Learning,which was used discount reward as the evaluation criterion,could not show the affect of the action to the next situation,AR-Q-Learning was put forward based on the average reward and Q-Learning.In allusion to the curse of dimensionality,which meant that the computational requirement grew exponen-tially with the number of the state variable.Minimum state method was put forward.AR-Q-Learning and minimum state method were used in reinforcement learning for Blocks World,and the result of the experiment shows that the method has the characteristic of aftereffect and converges more faster than Q-Learning,and at the same time,solve the curse of di-mensionality in a certain extent in Blocks World.
Details
- Language :
- Chinese
- ISSN :
- 1000436X
- Volume :
- 32
- Database :
- Directory of Open Access Journals
- Journal :
- Tongxin xuebao
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.9526bf106a34407b8f011453e908acf9
- Document Type :
- article