also paves the way for integration with next
generation technologies, especially connected and
automated vehicles, in which real-time data
exchange can further improve system responsiveness
and predictability.
Although it is highly performant when running in
simulated environments, the development of an actual
deployable version of this system does have to
overcome some challenges such as data validity,
infrastructure status, and computational resources.
However, these challenges are not insurmountable
and can be addressed gradually through staggered
integration, policy assistance and technology
enhancement. To the extent that cities are moving
towards smarter infrastructure, systems similar to the
adaptive, learning-based traffic control proposed in
this paper could be a linchpin for smart & sustainable
urban mobility.
In summary, the framework of reinforcement
learning for the control of traffic signal is a
revolutionary method, consistent with the
requirements of the current urban traffic system. It
helps to maintain the intelligence, scalability and
environment friendliness of the traffic management
system, creating opportunity for the smart city and
intelligent transportation network for further
development.
REFERENCES
Bie, Y., Ji, Y., & Ma, D. (2024). Multi-agent deep
reinforcement learning collaborative traffic signal
control method considering intersection heterogeneity.
Transportation Research Part C: Emerging
Technologies, 164, 104663.
Busch, J. V. S., Voelckner, R., Sossalla, P., Vielhaus, C. L.,
Calandra, R., & Fitzek, F. H. P. (2023). Deep
reinforcement learning for the joint control of traffic
light signaling and vehicle speed advice. arXiv preprint
arXiv:2309.09881.
El-Tantawy, S., Abdulhai, B., & Abdelgawad, H. (2013).
Multiagent reinforcement learning for integrated
network of adaptive traffic signal controllers
(MARLIN-ATSC): Methodology and large-scale
application on downtown Toronto. IEEE Transactions
on Intelligent Transportation Systems, 14(3), 1140–
1150.
Fan, L., Yang, Y., Ji, H., & Xiong, S. (2025). Optimization
of traffic signal cooperative control with sparse deep
reinforcement learning based on knowledge sharing.
Electronics, 14.
Feng, S., Yan, X., Sun, H., Feng, Y., & Liu, H. X. (2021).
Intelligent driving intelligence test for autonomous
vehicles with naturalistic and adversarial environment.
Nature Communications, 12, 748. https://doi.org/10.10
38/s41467-021-21007-8
Feng, S., Sun, H., Yan, X., Zhu, H., Zou, Z., Shen, S., &
Liu, H. X. (2023). Dense reinforcement learning for
safety validation of autonomous vehicles. Nature, 615,
620–627. https://doi.org/10.1038/s41586-023-05732-2
Li, D., Zhu, F., Wu, J., Wong, Y. D., & Chen, T. (2024).
Adaptive urban traffic signal control based on enhanced
deep reinforcement learning. Scientific Reports, 14,
Article 64885. https://doi.org/10.1038/s41598-024-
64885-w
Li, D., Zhu, F., Wu, J., Wong, Y. D., & Chen, T. (2024).
Managing mixed traffic at signalized intersections: An
adaptive signal control and CAV coordination system
based on deep reinforcement learning. Expert Systems
with Applications, 238, 121959.
Liu, H. X., Wu, X., Ma, W., & Hu, H. (2009). Real-time
queue length estimation for congested signalized
intersections. Transportation Research Part C:
Emerging Technologies, 17(4), 412–427.
Peng, X., Gao, H., Wang, H., & Zhang, H. M. (2023).
Combat urban congestion via collaboration:
Heterogeneous GNN-based MARL for coordinated
platooning and traffic signal control. arXiv preprint
arXiv:2310.10948.
Qi, R., Wang, Y., Liu, Y., & Zhang, Y. (2022). Random
ensemble reinforcement learning for traffic signal
control. arXiv preprint arXiv:2203.05961.
Rafique, M. T., Mustafa, A., & Sajid, H. (2024). Adaptive
traffic signal control using reinforcement learning.
ResearchGate.
Rafique, M. T., Mustafa, A., & Sajid, H. (2024).
Reinforcement learning for adaptive traffic signal
control: Turn-based and time-based approaches to
reduce congestion. arXiv preprint arXiv:2408.15751.
Rahman Swapno, S. M. M., Nobel, S. M. N., Meena, P.,
Meena, V. P., Azar, A. T., Haider, Z., & Tounsi, M.
(2024). A reinforcement learning approach for reducing
traffic congestion using deep Q learning. Scientific
Reports, 14, Article 30452. https://doi.org/10.1038/s4
1598-024-75638-0
Wang, M., Xiong, X., Kan, Y., Xu, C., & Pun, M.-O.
(2023). UniTSA: A universal reinforcement learning
framework for V2X traffic signal control. arXiv
preprint arXiv:2312.05090.
Wang, X., Jerome, Z., Wang, Z., Zhang, C., Shen, S.,
Kumar, V., ... & Liu, H. X. (2024). Traffic light
optimization with low penetration rate vehicle
trajectory data. Nature Communications, 15, 1306
https://doi.org/10.1038/s41467-024-45427-4
Xie, X.-F., Smith, S. F., & Barlow, G. J. (2022). Scalable
urban traffic control: Decentralized and schedule-
driven coordination. Transportation Research Part C:
Emerging Technologies, 135, 103465
Yan, X., Zou, Z., Zhu, H., Sun, H., & Liu, H. X. (2023).
Learning naturalistic driving environment with
statistical realism. Nature Communications, 14, 2037.
https://doi.org/10.1038/s41467-023-37677-5