1. Intrinsic motivation and episodic memories for robot exploration of high-dimensional sensory spaces
- Author
-
Guido Schillaci, David Colliaux, Peter Hanappe, Verena V. Hafner, Timothée Wintz, and Antonio Pico Villalpando
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer science ,media_common.quotation_subject ,Stability (learning theory) ,Experimental and Cognitive Psychology ,Sensory system ,050105 experimental psychology ,Machine Learning (cs.LG) ,Adaptive models ,Computer Science - Robotics ,03 medical and health sciences ,Behavioral Neuroscience ,0302 clinical medicine ,0501 psychology and cognitive sciences ,Set (psychology) ,Episodic memory ,intrinsic motivation ,media_common ,robotics ,Computational model ,Forgetting ,Artificial neural network ,business.industry ,05 social sciences ,episodic memory ,predictive models ,004 Informatik ,Artificial Intelligence (cs.AI) ,memory consolidation ,Curiosity ,Unsupervised learning ,Artificial intelligence ,ddc:004 ,business ,Robotics (cs.RO) ,030217 neurology & neurosurgery - Abstract
This work presents an architecture that generates curiosity-driven goal-directed exploration behaviours for an image sensor of a microfarming robot. A combination of deep neural networks for offline unsupervised learning of low-dimensional features from images, and of online learning of shallow neural networks representing the inverse and forward kinematics of the system have been used. The artificial curiosity system assigns interest values to a set of pre-defined goals, and drives the exploration towards those that are expected to maximise the learning progress. We propose the integration of an episodic memory in intrinsic motivation systems to face catastrophic forgetting issues, typically experienced when performing online updates of artificial neural networks. Our results show that adopting an episodic memory system not only prevents the computational models from quickly forgetting knowledge that has been previously acquired, but also provides new avenues for modulating the balance between plasticity and stability of the models., Comment: This manuscript has been submitted for consideration for publication in the Adaptive Behaviour Sage Journal, edited by Tom Froese
- Published
- 2020
- Full Text
- View/download PDF