Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method
Maojiang Deng , Shoufeng Lu , Jiazhao Shi , Wen Zhang
Urban Lifeline ›› 2026, Vol. 4 ›› Issue (1) : 9
This study proposes a novel adaptive traffic signal control method leveraging a Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) to optimize signal timing by integrating variable cell length and multi-channel state representation. A road partition formula consisting of the sum of logarithmic and linear functions was proposed. The state variables are a vector composed of three channels: the number of vehicles, the average speed, and space occupancy. The set of available signal phases constitutes the action space, and the selected phase is executed with a fixed green time. The reward function is formulated using the absolute values of key traffic state metrics—waiting time, speed, and fuel consumption. Each metric is normalized by a typical maximum value and assigned a weight that reflects its priority and optimization direction. The simulation results, using Sumo-TensorFlow-Python, demonstrate a cross-range transferability evaluation and show that the proposed variable cell length and multi-channel state representation method excels compared to fixed cell length in optimization performance.
Traffic signal control / Road partition / Variable cell length / Multi-channel state representation / Deep Q network / Proximal policy optimization
| [1] |
|
| [2] |
Genders W, Razavi S (2016) Using a deep reinforcement learning agent for traffic signal control. arXiv:1611.01142 [cs.LG] |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
Guo M, Wang P, Chan C, Askary S (2019) A reinforcement learning approach for intelligent traffic signal control at urban intersections. IEEE Intelligent Transportation Systems Conference (ITSC). Auckland, pp 4242–4247 |
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
Luo J, Li X, Zheng Y (2020) Researches on intelligent traffic signal control based on deep reinforcement learning. 16th International Conference on Mobility, Sensing and Networking (MSN). Tokyo, pp 729–734 |
| [17] |
Thorpe T, Anderson C (1996) Traffic light control using SARSA with three state representations. IBM Corporation |
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
The Author(s)
/
| 〈 |
|
〉 |