Back to Search Start Over

Experience Sharing Based Memetic Transfer Learning for Multiagent Reinforcement Learning.

Authors :
Wang, Tonghao
Peng, Xingguang
Jin, Yaochu
Xu, Demin
Source :
Memetic Computing; Mar2022, Vol. 14 Issue 1, p3-17, 15p
Publication Year :
2022

Abstract

In transfer learning (TL) for multiagent reinforcement learning (MARL), most popular methods are based on action advising scheme, in which skilled agents directly transfer actions, i.e., explicit knowledge, to other agents. However, this scheme requires an inquiry-answer process, which quadratically increases the computational load as the number of agents increases. To enhance the scalability of TL for MARL when all the agents learn from scratch, we propose an experience sharing based memetic TL for MARL, called MeTL-ES. In the MeTL-ES, the agents actively share implicit memetic knowledge (experience), which avoids the inquiry-answer process and brings highly scalable and effective acceleration of learning. In particular, we firstly design an experience sharing scheme to share implicit meme based experience among the agents. Within this scheme, experience from the peers is collected and used to speed up the learning process. More importantly, this scheme frees the agents from actively asking for the states and policies of other agents, which enhances scalability. Secondly, an event-triggered scheme is designed to enable the agents to share the experiences at appropriate timings. Simulation studies show that, compared with the existing methods, the proposed MeTL-ES can more effectively enhance the learning speed of learning-from-scratch MARL systems. At the same time, we show that the communication cost and computational load of MeTL-ES increase linearly with the growth of the number of agents, indicating better scalability compared to the popular action advising based methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18659284
Volume :
14
Issue :
1
Database :
Complementary Index
Journal :
Memetic Computing
Publication Type :
Academic Journal
Accession number :
155688383
Full Text :
https://doi.org/10.1007/s12293-021-00339-4