Back to Search Start Over

TD3LVSL: A lane-level variable speed limit approach based on twin delayed deep deterministic policy gradient in a connected automated vehicle environment.

Authors :
Lu, Wenqi
Yi, Ziwei
Gu, Yuanli
Rui, Yikang
Ran, Bin
Source :
Transportation Research Part C: Emerging Technologies. Aug2023, Vol. 153, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• A lane-level VSL (LVSL) approach with actor-critic framework is proposed to provide differential VSL control. • A hybrid reward function is presented to adjust relationship between the reducing crash risk and improving traffic efficiency. • A twin delayed deep deterministic policy gradient (TD3)-based LVSL approach is established for the first time. • The TD3LVSL shows better performance than baseline models in terms of improving traffic safety and traffic efficiency. Variable speed limit (VSL) control plays a vital role in the emerging connected automated vehicle highway (CAVH) system, which can alleviate recurrent traffic congestion caused by capacity drop and accidents. However, the effect of the VSL on improving traffic efficiency is still controversial. It is necessary to study how to explore the potential benefits of the VSL by balancing its influence on reducing crash risk and enhancing traffic efficiency. To fill the technological gap above, this paper proposes a reinforcement learning-based lane-level VSL (LVSL) control approach to conduct refined traffic control on the mainlines. Firstly, an actor-critic framework is developed to generate and evaluate the discrete speed limits of each lane in continuous action space. To optimize traffic control performance, a hybrid reward function is developed by synchronously considering traffic safety and traffic efficiency of the bottleneck area. Then, to solve the overestimation bias problem of the actor-critic methods caused by function approximation error, a twin delayed deep deterministic policy gradient (TD3) method is introduced to train the framework of the LVSL method. Finally, a real-world recurrent bottleneck of the State Route 91 highway in California is simulated with consideration of the connected automated vehicles to examine the performance of the TD3-based LVSL (TD3LVSL) controller. The simulation results reveal that the proposed method is capable of reducing crash risk and improving traffic efficiency synchronously. Compared with the LVSL controller based on the deep deterministic policy gradient approach, the TD3LVSL controller shows better performance in terms of traffic safety and efficiency. These findings indicate that the proposed controller could contribute to future traffic control in a CAVH environment. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0968090X
Volume :
153
Database :
Academic Search Index
Journal :
Transportation Research Part C: Emerging Technologies
Publication Type :
Academic Journal
Accession number :
164854362
Full Text :
https://doi.org/10.1016/j.trc.2023.104221