Back to Search
Start Over
Applications in Traffic Signal Control: A Distributed Policy Gradient Decomposition Algorithm
- Source :
- IEEE Transactions on Industrial Informatics; February 2024, Vol. 20 Issue: 2 p2762-2775, 14p
- Publication Year :
- 2024
-
Abstract
- This article explores the application of the multiagent reinforcement learning (MARL) algorithm in addressing the large-scale traffic signal control (TSC) problem. To address the TSC problem in complex urban traffic networks, most existing algorithms focus on optimizing local traffic flow at each intersection through decentralized training based on either the local observations or messages from its neighboring intersections, which, however, lacks the concept of cooperative learning. To conquer such limitations, a novel distributed critic with decentralized actor (DCDA) framework is proposed, which allows the communication messages and temporal difference (TD) losses to be exchanged among neighboring intersections. Specially, by considering the traffic network as a communication network of agents (more precisely, intersections and lanes are considered as agents and edges, respectively), a distributed global average TD loss estimation algorithm is designed in the distributed critic step to estimate the global average TD loss estimation and enhance collaboration among agents. Moreover, in the decentralized actor step, the policy gradient decomposition method is adopted for each agents to learn its local policy solely based on its local action-value function. By adhering to the DCDA framework, a novel distributed policy gradient decomposition (DPGD) algorithm is further proposed to address the TSC problem. Empirical experiments demonstrate that the efficiency, robustness, and stability of the DPGD algorithm outperform the state-of-the-art MARL algorithms in both the environments of cooperative adaptive cruise control and adaptive traffic signal control.
Details
- Language :
- English
- ISSN :
- 15513203
- Volume :
- 20
- Issue :
- 2
- Database :
- Supplemental Index
- Journal :
- IEEE Transactions on Industrial Informatics
- Publication Type :
- Periodical
- Accession number :
- ejs65300927
- Full Text :
- https://doi.org/10.1109/TII.2023.3296887