Heterogeneous Multi-Agent Reinforcement Learning for Large-Scale Cooperative Control of Connected Vehicles and Traffic Lights in Mixed Traffic Environments

35 Pages Posted: 12 Apr 2025

See all articles by Bowen Chen

Bowen Chen

Kunming University of Science and Technology

Fuxing Wei

Kunming University of Science and Technology

Fengxiang Guo

Kunming University of Science and Technology

shiquan shen

Kunming University of Science and Technology

Jiangwei Shen

Kunming University of Science and Technology

Yuanjian Zhang

Tongji University

Zheng Chen

Kunming University of Science and Technology

Xing Shu

affiliation not provided to SSRN

Abstract

With the rapid development of vehicle networking and autonomous driving technology, mixed traffic scenarios—where human-driven vehicles (HDVs) coexist with connected and autonomous vehicles (CAVs) and interact with traffic lights (TLs)—have become increasingly prevalent. However, the heterogeneity of CAVs and TLs poses significant challenges to promotion of overall traffic efficiency through traditional control methods. To address this challenge, this study proposes a heterogeneous multi-agent reinforcement learning (HMARL) algorithm based on potential game (PG). Specifically, an information-sharing framework that leverages a heterogeneous graph structure and an encoder is employed to effectively integrate diverse data from CAVs and TLs. Secondly, the CAVs and TLs are integrated into a PG structure, wherein all agents share the same goal, which enables theoretically validated reward convergence and individual policy adjustments to be uniformly quantified, and a computationally efficient safety projection mechanism is introduced in PG to enhance motion safety. Extensive simulations on the city-scale demonstrate the preferable performance of the algorithm. The average speed is promoted by [[EQUATION]] and the traffic density is reduced by [[EQUATION]] during peak hours compared to baselines. In a custom urban network, the HMARL increases the average vehicle speed by approximately [[EQUATION]], exhibiting more stable synergistic effects in heterogeneous multi-agent strategy convergence.

Keywords: Heterogeneous multi-agent reinforcement learning (HMARL), connected and automated vehicles (CAVs), mixed traffic, potential game (PG), safety projection

Suggested Citation

Chen, Bowen and Wei, Fuxing and Guo, Fengxiang and shen, shiquan and Shen, Jiangwei and Zhang, Yuanjian and Chen, Zheng and Shu, Xing, Heterogeneous Multi-Agent Reinforcement Learning for Large-Scale Cooperative Control of Connected Vehicles and Traffic Lights in Mixed Traffic Environments. Available at SSRN: https://ssrn.com/abstract=5214485 or http://dx.doi.org/10.2139/ssrn.5214485

Bowen Chen

Kunming University of Science and Technology ( email )

Kunming Yunnan China
Kunming
China

Fuxing Wei

Kunming University of Science and Technology ( email )

Kunming Yunnan China
Kunming
China

Fengxiang Guo

Kunming University of Science and Technology ( email )

Kunming Yunnan China
Kunming
China

Shiquan Shen

Kunming University of Science and Technology ( email )

Jiangwei Shen

Kunming University of Science and Technology ( email )

Kunming Yunnan China
Kunming
China

Yuanjian Zhang

Tongji University ( email )

1239 Siping Road
Shanghai, 200092
China

Zheng Chen (Contact Author)

Kunming University of Science and Technology ( email )

Kunming Yunnan China
Kunming
China

Xing Shu

affiliation not provided to SSRN ( email )

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
23
Abstract Views
139
PlumX Metrics