1. 基于出租车司机经验的约束深度 强化学习算法路径挖掘.
- Author
-
黄敏, 毛锋, and 钱宇翔
- Subjects
- *
REINFORCEMENT learning , *DEEP learning , *MACHINE learning , *AIR travel , *TIME measurements , *ROUTE choice - Abstract
This paper proposed constrained deep reinforcement learning( CDRL) to compute the fastest route online using taxi drivers' experience in different time period. Firstly, this paper described the extraction of experiential road segment database (ERSD). Then it introduced CDRL method, which mainly comprised of two phase: bounded condition of route and deep Q-learning algorithm. In the first phase, the task was to generate alternative constrained road segments of OD pair. In the second phase, it devised deep Q-learning algorithm to learning the experience of taxi drivers, and computed the fastest route of a given OD according to their departure time. Lastly, this paper tested an empirical studies in CBD of Guangzhou. The results show that the routes computed by CDRL method is approximately equal to shortest route ( SR) and fastest route (FR) method in travel time and route length. Furthermore, the CDRL method notably outperforms FR and SR in computing efficiency, so it is more suitable for online fastest route computation. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF