Start Over

Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a Dynamic Balancing Model

Authors :: Luis Sentis
Jaemin Lee
Junhyeok Ahn
Source :: IEEE Robotics and Automation Letters. 5:4376-4383
Publication Year :: 2020
Publisher :: Institute of Electrical and Electronics Engineers (IEEE), 2020.
Abstract: In this letter, we formulate a novel Markov Decision Process (MDP) for safe and data-efficient learning for humanoid locomotion aided by a dynamic balancing model. In our previous studies of biped locomotion, we relied on a low-dimensional robot model, commonly used in high-level Walking Pattern Generators (WPGs). However, a low-level feedback controller cannot precisely track desired footstep locations due to the discrepancies between the full order model and the simplified model. In this study, we propose mitigating this problem by complementing a WPG with reinforcement learning. More specifically, we propose a structured footstep control method consisting of a WPG, a neural network, and a safety controller. The WPG provides an analytical method that promotes efficient learning while the neural network maximizes long-term rewards, and the safety controller encourages safe exploration based on step capturability and the use of control-barrier functions. Our contributions include the following (1) a structured learning control method for locomotion, (2) a data-efficient and safe learning process to improve walking using a physics-based model, and (3) the scalability of the procedure to various types of humanoid robots and walking.<br />Comment: 8 pages, 7 figures

Details

ISSN :: 23773774
Volume :: 5
Database :: OpenAIRE
Journal :: IEEE Robotics and Automation Letters
Accession number :: edsair.doi.dedup.....c998710a87bb57586f4bc03b1eb35536

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a Dynamic Balancing Model

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Data-Efficient and Safe Learning for Humanoid Locomotion Aided by a Dynamic Balancing Model

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources