Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay

Authors :: Minjae Park
Chaneun Park
Nam Kyu Kwon
Source :: Biomimetics, Vol 9, Iss 1, p 51 (2024)
Publication Year :: 2024
Publisher :: MDPI AG, 2024.
Abstract: In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.

Subjects :: deep deterministic policy gradient
multifunctional reward shaping
hindsight experience replay
mobile robot
autonomous driving
Technology

Full Text Access

Tools