Back to Search Start Over

Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay

Authors :
Minjae Park
Chaneun Park
Nam Kyu Kwon
Source :
Biomimetics, Vol 9, Iss 1, p 51 (2024)
Publication Year :
2024
Publisher :
MDPI AG, 2024.

Abstract

In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.

Details

Language :
English
ISSN :
23137673
Volume :
9
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Biomimetics
Publication Type :
Academic Journal
Accession number :
edsdoj.950a3daf4b644819c7c65edf2e9359e
Document Type :
article
Full Text :
https://doi.org/10.3390/biomimetics9010051