Start Over

Reinforcement Learning Control for a 2-DOF Helicopter With State Constraints: Theory and Experiments

Authors :: Zhao, Zhijia
He, Weitian
Mu, Chaoxu
Zou, Tao
Hong, Keum-Shik
Li, Han-Xiong
Source :: IEEE Transactions on Automation Science and Engineering: A Publication of the IEEE Robotics and Automation Society; January 2024, Vol. 21 Issue: 1 p157-167, 11p
Publication Year :: 2024
Abstract: This study focuses on the novel reinforcement learning control strategy of a nonlinear two-degrees-of-freedom (2-DOF) helicopter system for tracking the desired trajectory while minimizing the tracking error. First, gradient descent algorithm is incorporated in the context of the reinforcement learning control scheme to obtain the adaptive laws. Subsequently, considering the uncertainties in the nonlinear system, radial basis function (RBF) neural networks (NNs) are exploited to approximate the unknown internal dynamics. In contrast to the previous studies, aiming at accelerating the convergence in reinforcement learning control, a barrier Lyapunov function is constructed to constrain the states to ensure that the tracking error rapidly converges to a neighborhood of zero. Under the proposed control strategy, the states of the closed-loop system are proven to be semi-globally uniformly ultimately bounded through rigorous Lyapunov analyses, and the state constraints are satisfied. Furthermore, the simulations and experiments conducted on a Quanser laboratory platform reveal that the proposed control functions are suitable and effective. Note to Practitioners—This paper is motivated by designing a reinforcement learning control strategy to enhance online learning capability and control performance of the controller for a nonlinear 2-DOF helicopter system. The control framework is divided into the design of the critic and actor NNs, responsible primarily for evaluating the control performance and approximating uncertainties in the system separately. Unlike the adaptive NN control, the actor NN weights are updated by combining information of states and inputs from the critic NN. In addition, aiming at accelerating the convergence, a barrier Lyapunov function is constructed to constrain the states to ensure that the tracking error rapidly converges to a neighborhood of zero. Finally, the proposed control strategy is validated in simulation and experiment on the Quanser laboratory platform.

Details

Language :: English
ISSN :: 15455955 and 15583783
Volume :: 21
Issue :: 1
Database :: Supplemental Index
Journal :: IEEE Transactions on Automation Science and Engineering: A Publication of the IEEE Robotics and Automation Society
Publication Type :: Periodical
Accession number :: ejs65156459
Full Text :: https://doi.org/10.1109/TASE.2022.3215738

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Reinforcement Learning Control for a 2-DOF Helicopter With State Constraints: Theory and Experiments

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Reinforcement Learning Control for a 2-DOF Helicopter With State Constraints: Theory and Experiments

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources