Spatio-temporal dual-stage hypergraph MARL for human-centric multimodal corridor traffic signal control

About

Human-centric traffic signal control in corridor networks must increasingly account for multimodal travelers, particularly high-occupancy public transportation, rather than focusing solely on vehicle-centric performance. This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning), a scalable multi-agent deep reinforcement learning framework that follows a centralized training and decentralized execution paradigm. The proposed method captures spatio-temporal dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial and temporal hyperedges. In addition, a hybrid discrete action space is introduced to jointly determine the next signal phase configuration and its corresponding green duration, enabling more adaptive signal timing decisions. Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority. Compared with state-of-the-art baseline methods, the proposed approach achieves superior overall performance. Further ablation studies confirm the contribution of each component of STDSH-MARL, with temporal hyperedges identified as the most influential factor driving the observed performance gains.

Xiaocai Zhang, Neema Nassir, Milad Haghani• 2026

Related benchmarks

Task	Dataset	Result
Traffic Signal Control	VISSIM Scenario 2 (off-peak to peak period)	ANP1.55e+3	7
Traffic Signal Control	VISSIM Scenario 5	ANP1.57e+3	7
Traffic Signal Control	VISSIM Scenario 4 (morning school period) (test)	ANP1.55e+3	7
Traffic Signal Control	Scenario VISSIM corridor 1	ANP1.38e+3	7
Traffic Signal Control	VISSIM Synthetic Road Corridor Scenario 3	ANP1.65e+3	7

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord