Back to Search
Start Over
A Structure of Restricted Boltzmann Machine for Modeling System Dynamics
- Source :
- IJCNN, IJCNN 2020-International Joint Conference on Neural Networks, IJCNN 2020-International Joint Conference on Neural Networks, IEEE, Jul 2020, Glasgow, United Kingdom. pp.8
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- International audience; This paper presents a new approach for learning transition function in state representation learning (SRL) for control. While state-of-the-art methods use different deterministic neural networks to learn forward and inverse state transition functions independently with auto-supervised learning, we introduce a bidirectional stochastic model to learn both transition functions. We aim at using the uncertainty of the model on its predictions as an intrinsic motivation for exploration to enhance the representation learning. More, using the same model to learn both transition functions allows sharing the parameters, which can reduce their number and should increase the embedding quality of the representation. We use a factored restricted Boltzmann machine (fRBM) based model, enhanced with dedicated structure for learning system dynamics and transitions with shared parameters. The presented work focuses on building the structure of the bidirectional transition model for unsupervised learning. Our fRBM structure is directly inspired from physics interactions between inputs and outputs in reinforcement learning framework. We compare different training algorithms for learning the model that must be able to predict observable random variables to be used in SRL framework. Our structure is not restricted to any type of observable, nevertheless in this paper we focus on learning dynamics from the OpenAI Gym environment Swinging Pendulum. We show that the proposed structure is able to learn bidirectional transition function and performs well in prediction task.
- Subjects :
- Computer Science::Machine Learning
Restricted Boltzmann machine
Artificial neural network
Unsupervised Deep Learning
business.industry
Stochastic process
02 engineering and technology
[INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]
System dynamics
03 medical and health sciences
0302 clinical medicine
State Representation Learning
0202 electrical engineering, electronic engineering, information engineering
Unsupervised learning
Reinforcement learning
020201 artificial intelligence & image processing
Artificial intelligence
business
Representation (mathematics)
Feature learning
Factored Restricted Boltzmann Machine
030217 neurology & neurosurgery
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2020 International Joint Conference on Neural Networks (IJCNN)
- Accession number :
- edsair.doi.dedup.....e1d9cf82812dad1ecc7bc946be082160
- Full Text :
- https://doi.org/10.1109/ijcnn48605.2020.9206758