Back to Search
Start Over
Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay
- Source :
- Biomimetics, Vol 9, Iss 1, p 51 (2024)
- Publication Year :
- 2024
- Publisher :
- MDPI AG, 2024.
-
Abstract
- In this paper, we propose a reinforcement learning-based end-to-end learning method for the autonomous driving of a mobile robot in a dynamic environment with obstacles. Applying two additional techniques for reinforcement learning simultaneously helps the mobile robot in finding an optimal policy to reach the destination without collisions. First, the multifunctional reward-shaping technique guides the agent toward the goal by utilizing information about the destination and obstacles. Next, employing the hindsight experience replay technique to address the experience imbalance caused by the sparse reward problem assists the agent in finding the optimal policy. We validated the proposed technique in both simulation and real-world environments. To assess the effectiveness of the proposed method, we compared experiments for five different cases.
Details
- Language :
- English
- ISSN :
- 23137673
- Volume :
- 9
- Issue :
- 1
- Database :
- Directory of Open Access Journals
- Journal :
- Biomimetics
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.950a3daf4b644819c7c65edf2e9359e
- Document Type :
- article
- Full Text :
- https://doi.org/10.3390/biomimetics9010051