Spatio-temporal dual-stage hypergraph MARL for human-centric multimodal corridor traffic signal control
About
Human-centric traffic signal control in corridor networks must increasingly account for multimodal travelers, particularly high-occupancy public transportation, rather than focusing solely on vehicle-centric performance. This paper proposes STDSH-MARL (Spatio-Temporal Dual-Stage Hypergraph based Multi-Agent Reinforcement Learning), a scalable multi-agent deep reinforcement learning framework that follows a centralized training and decentralized execution paradigm. The proposed method captures spatio-temporal dependencies through a novel dual-stage hypergraph attention mechanism that models interactions across both spatial and temporal hyperedges. In addition, a hybrid discrete action space is introduced to jointly determine the next signal phase configuration and its corresponding green duration, enabling more adaptive signal timing decisions. Experiments conducted on a corridor network under five traffic scenarios demonstrate that STDSH-MARL consistently improves multimodal performance and provides clear benefits for public transportation priority. Compared with state-of-the-art baseline methods, the proposed approach achieves superior overall performance. Further ablation studies confirm the contribution of each component of STDSH-MARL, with temporal hyperedges identified as the most influential factor driving the observed performance gains.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Traffic Signal Control | VISSIM Scenario 2 (off-peak to peak period) | ANP1.55e+3 | 7 | |
| Traffic Signal Control | VISSIM Scenario 5 | ANP1.57e+3 | 7 | |
| Traffic Signal Control | VISSIM Scenario 4 (morning school period) (test) | ANP1.55e+3 | 7 | |
| Traffic Signal Control | Scenario VISSIM corridor 1 | ANP1.38e+3 | 7 | |
| Traffic Signal Control | VISSIM Synthetic Road Corridor Scenario 3 | ANP1.65e+3 | 7 |