Author: "Nat Dilokthanakul" / Topic: 0202 electrical engineering, electronic engineering, information engineering - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Nat Dilokthanakul"' showing total 3 results

Start Over Author "Nat Dilokthanakul" Topic 0202 electrical engineering, electronic engineering, information engineering

3 results on '"Nat Dilokthanakul"'

1. Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning

Author: Christos Kaplanis, Nick Pawlowski, Murray Shanahan, and Nat Dilokthanakul
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Networks and Communications, Computer science, 02 engineering and technology, Machine learning, computer.software_genre, Machine Learning (cs.LG), Artificial Intelligence, 0202 electrical engineering, electronic engineering, information engineering, Feature (machine learning), Reinforcement learning, Representation (mathematics), Artificial neural network, business.industry, Computer Science Applications, Visualization, Artificial Intelligence (cs.AI), Task analysis, Key (cryptography), 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Software
Abstract: One of the main concerns of deep reinforcement learning (DRL) is the data inefficiency problem, which stems both from an inability to fully utilize data acquired and from naive exploration strategies. In order to alleviate these problems, we propose a DRL algorithm that aims to improve data efficiency via both the utilization of unrewarded experiences and the exploration strategy by combining ideas from unsupervised auxiliary tasks, intrinsic motivation, and hierarchical reinforcement learning (HRL). Our method is based on a simple HRL architecture with a metacontroller and a subcontroller. The subcontroller is intrinsically motivated by the metacontroller to learn to control aspects of the environment, with the intention of giving the agent: 1) a neural representation that is generically useful for tasks that involve manipulation of the environment and 2) the ability to explore the environment in a temporally extended manner through the control of the metacontroller. In this way, we reinterpret the notion of pixel- and feature-control auxiliary tasks as reusable skills that can be learned via an intrinsic reward. We evaluate our method on a number of Atari 2600 games. We found that it outperforms the baseline in several environments and significantly improves performance in one of the hardest games—Montezuma’s revenge—for which the ability to utilize sparse data is key. We found that the inclusion of intrinsic reward is crucial for the improvement in the performance and that most of the benefit seems to be derived from the representations learned during training.
Published: 2019
Full Text: View/download PDF

2. Dynamical State Forcing on Central Pattern Generators for Efficient Robot Locomotion Control

Author: Kawee Tiraborisute, Thirawat Chuthong, Nat Dilokthanakul, Binggwong Leung, Potiwat Ngamkajornwiwat, and Poramate Manoonpong
Subjects: Computer science, PID controller, Central pattern generator, Motor control, 02 engineering and technology, 03 medical and health sciences, 0302 clinical medicine, Control theory, 0202 electrical engineering, electronic engineering, information engineering, Robot, 020201 artificial intelligence & image processing, 030217 neurology & neurosurgery, Parametric statistics, Robot locomotion
Abstract: Many CPG-based locomotion models have a problem known as the tracking error problem, where the mismatch between the CPG driving signal and the state of the robot can cause undesirable behaviours for legged robots. Towards alleviating this problem, we introduce a mechanism that modulates the CPG signal using the robot’s interoceptive information. The key concept is to generate a driving signal that is easier for the robot to follow, yet can drive the locomotion of the robot. This can be done by nudging the CPG signal in the direction of lower tracking error, which can be analytically calculated. Unlike other reactive CPG, the proposed method does not rely on any parametric learning ability to adjust the shape of the signal, making it a unique option for a biological adaptive motor control. Our experiment results show that the proposed method successfully reduces the tracking error. We also show that the CPG signal, regulated by the proposed method, is robust to perturbation and can smoothly return back to the default pattern.
Published: 2020
Full Text: View/download PDF

3. Investigating Partner Diversification Methods in Cooperative Multi-agent Deep Reinforcement Learning

Author: Rujikorn Charakorn, Nat Dilokthanakul, and Poramate Manoonpong
Subjects: education.field_of_study, Knowledge management, Computer science, business.industry, Multi-agent system, Population, 02 engineering and technology, 010501 environmental sciences, Overfitting, Diversification (marketing strategy), 01 natural sciences, Generalization (learning), 0202 electrical engineering, electronic engineering, information engineering, Reinforcement learning, 020201 artificial intelligence & image processing, Markov decision process, business, education, 0105 earth and related environmental sciences, Diversity (business)
Abstract: Overfitting to learning partners is a known problem, in multi-agent reinforcement learning (MARL), due to the co-evolution of learning agents. Previous works explicitly add diversity to learning partners for mitigating this problem. However, since there are many approaches for introducing diversity, it is not clear which one should be used under what circumstances. In this work, we clarify the situation and reveal that widely used methods such as partner sampling and population-based training are unreliable at introducing diversity under fully cooperative multi-agent Markov decision process. We find that generating pre-trained partners is a simple yet effective procedure to achieve diversity. Finally, we highlight the impact of diversified learning partners on the generalization of learning agents using cross-play and ad-hoc team performance as evaluation metrics.
Published: 2020
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Nat Dilokthanakul"'

1. Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning

2. Dynamical State Forcing on Central Pattern Generators for Efficient Robot Locomotion Control

3. Investigating Partner Diversification Methods in Cooperative Multi-agent Deep Reinforcement Learning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

Publisher

3 results on '"Nat Dilokthanakul"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources