Back to Search Start Over

A deep reinforcement learning based distributed control strategy for connected automated vehicles in mixed traffic platoon.

Authors :
Shi, Haotian
Chen, Danjue
Zheng, Nan
Wang, Xin
Zhou, Yang
Ran, Bin
Source :
Transportation Research Part C: Emerging Technologies. Mar2023, Vol. 148, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

• We develop a generic strategy for any CAV-HDV topology in mixed traffic to efficiently stabilize the traffic oscillations. • We propose a novel car-following structure, 'CAV-HDVs-CAV' which can better capture HDVs' stochasticity by aggregation. • Our method marries the merits of traffic flow theory, DRL, and network control theory organically for CAV control, which shows a very promising control performance. This paper proposes an innovative distributed longitudinal control strategy for connected automated vehicles (CAVs) in the mixed traffic environment of CAV and human-driven vehicles (HDVs), incorporating high-dimensional platoon information. For mixed traffic, the traditional CAV control method focuses on microscopic trajectory information, which may not be efficient in handling the HDV stochasticity (e.g., long reaction time; various driving styles) and mixed traffic heterogeneities. Different from traditional methods, our method, for the first time, characterizes consecutive HDVs as a whole (i.e., AHDV) to reduce the HDV stochasticity and utilize its macroscopic features to control the following CAVs. The new control strategy takes advantage of platoon information to anticipate the disturbances and traffic features induced downstream under mixed traffic scenarios and greatly outperforms the traditional methods. In particular, the control algorithm is based on deep reinforcement learning (DRL) to fulfill car-following control efficiency and further address the stochasticity for the aggregated car following behavior by embedding it in the training environment. To better utilize the macroscopic traffic features, a general platoon of mixed traffic is categorized as a CAV-HDVs-CAV pattern and described by corresponding DRL states. The macroscopic traffic flow properties are built upon the Newell car-following model to capture the characteristics of aggregated HDVs' joint behaviors. Simulated experiments are conducted to validate our proposed strategy. The results demonstrate that the proposed control method has outstanding performances in terms of oscillation dampening, eco-driving, and generalization capability. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0968090X
Volume :
148
Database :
Academic Search Index
Journal :
Transportation Research Part C: Emerging Technologies
Publication Type :
Academic Journal
Accession number :
161990603
Full Text :
https://doi.org/10.1016/j.trc.2023.104019