Back to Search Start Over

A Heuristics-Based Reinforcement Learning Method to Control Bipedal Robots.

Authors :
Qin, Daoling
Zhang, Guoteng
Zhu, Zhengguo
Chen, Teng
Zhu, Weiliang
Rong, Xuewen
Xie, Anhuan
Li, Yibin
Source :
International Journal of Humanoid Robotics; Jun2024, Vol. 21 Issue 3, p1-22, 22p
Publication Year :
2024

Abstract

A new method is proposed to control bipedal robots to achieve flexible omni-directional motion and robust locomotion under complex disturbances, called the heuristics-based reinforcement learning (HBRL) framework. HBRL shows great training efficiency in simulation. Heuristic reference trajectories play a crucial role in HBRL, which guide the training process. Exploration rewards, leg-foot reset condition, and command curriculum are three significant components to optimize the training process. An estimator network is utilized to supply linear velocities and foot contact information. We train controllers on flat ground in simulation. To demonstrate robustness and versatility, the trained controllers were tested on BRAVER, a point-foot bipedal robot with three joints on each leg. The controllers enabled BRAVER to perform omni-directional locomotion with the maximum forward speed reaching 2 m/s. The robot could also maintain balance under external pushing and over uneven terrains. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02198436
Volume :
21
Issue :
3
Database :
Complementary Index
Journal :
International Journal of Humanoid Robotics
Publication Type :
Academic Journal
Accession number :
177635869
Full Text :
https://doi.org/10.1142/S0219843623500135