1. An Offline Reinforcement Learning Approach for Path Following of an Unmanned Surface Vehicle
- Author
-
Zexing Zhou, Tao Bao, Jun Ding, Yihong Chen, Zhengyi Jiang, and Bo Zhang
- Subjects
soft actor–critic ,offline reinforcement learning ,unmanned surface vehicles ,path following control ,Naval architecture. Shipbuilding. Marine engineering ,VM1-989 ,Oceanography ,GC1-1581 - Abstract
Path following is crucial for enhancing the autonomy of unmanned surface vehicles (USVs) in water monitoring missions. This paper presents an offline reinforcement learning (RL) controller for USVs. The controller employs the soft actor–critic algorithm with a diversified Q-ensemble to optimize the steering control policy using a pre-collected dataset of USV path-following trials. A Markov decision process (MDP) tailored for path following is formulated. The proposed offline RL steering controller, trained on static datasets, demonstrates improved sample efficiency and asymptotic performance due to an expanded ensemble of Q-networks. The accuracy and adaptive learning capabilities of the RL controller are validated through simulations and free-running tests.
- Published
- 2024
- Full Text
- View/download PDF