Back to Search
Start Over
Introducing Symmetries to Black Box Meta Reinforcement Learning
- Source :
- Proceedings of the AAAI Conference on Artificial Intelligence. 36:7202-7210
- Publication Year :
- 2022
- Publisher :
- Association for the Advancement of Artificial Intelligence (AAAI), 2022.
-
Abstract
- Meta reinforcement learning (RL) attempts to discover new RL algorithms automatically from environment interaction. In so-called black-box approaches, the policy and the learning algorithm are jointly represented by a single neural network. These methods are very flexible, but they tend to underperform in terms of generalisation to new, unseen environments. In this paper, we explore the role of symmetries in meta-generalisation. We show that a recent successful meta RL approach that meta-learns an objective for backpropagation-based learning exhibits certain symmetries (specifically the reuse of the learning rule, and invariance to input and output permutations) that are not present in typical black-box meta RL systems. We hypothesise that these symmetries can play an important role in meta-generalisation. Building off recent work in black-box supervised meta learning, we develop a black-box meta RL system that exhibits these same symmetries. We show through careful experimentation that incorporating these symmetries can lead to algorithms with a greater ability to generalise to unseen action & observation spaces, tasks, and environments.<br />AAAI 2022
- Subjects :
- FOS: Computer and information sciences
Computer Science::Machine Learning
Computer Science - Machine Learning
Artificial Intelligence (cs.AI)
Computer Science - Artificial Intelligence
Statistics - Machine Learning
Computer Science - Neural and Evolutionary Computing
Machine Learning (stat.ML)
Neural and Evolutionary Computing (cs.NE)
General Medicine
Machine Learning (cs.LG)
Subjects
Details
- ISSN :
- 23743468 and 21595399
- Volume :
- 36
- Database :
- OpenAIRE
- Journal :
- Proceedings of the AAAI Conference on Artificial Intelligence
- Accession number :
- edsair.doi.dedup.....a25fd4658abaed9747a4b6c2d61294dc
- Full Text :
- https://doi.org/10.1609/aaai.v36i7.20681