CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal Control
About
Effective multi-intersection collaboration is pivotal for reinforcement-learning-based traffic signal control to alleviate congestion. Existing work mainly chooses neighboring intersections as collaborators. However, quite an amount of congestion, even some wide-range congestion, is caused by non-neighbors failing to collaborate. To address these issues, we propose to separate the collaborator selection as a second policy to be learned, concurrently being updated with the original signal-controlling policy. Specifically, the selection policy in real-time adaptively selects the best teammates according to phase- and intersection-level features. Empirical results on both synthetic and real-world datasets provide robust validation for the superiority of our approach, offering significant improvements over existing state-of-the-art methods. The code is available at https://github.com/bonaldli/CoSLight.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Adaptive Traffic Signal Control | Grid5x5 | Average Trip Time (s)220.3 | 20 | |
| Adaptive Traffic Signal Control | Grid4x4 | Average Trip Time (s)159.1 | 12 | |
| Adaptive Traffic Signal Control | Cologne8 | Average Trip Time (s)90.46 | 12 | |
| Adaptive Traffic Signal Control | Arterial4x4 | Avg Trip Time (s)364.2 | 12 | |
| Adaptive Traffic Signal Control | Ingolstadt21 | Average Trip Time (s)284.6 | 12 | |
| Adaptive Traffic Signal Control | Manhattan2668 Peak Transition | Average Trip Time (s)783.1 | 12 | |
| Adaptive Traffic Signal Control | Manhattan2668 Adverse Weather | Average Trip Time (s)1.09e+3 | 12 | |
| Adaptive Traffic Signal Control | Manhattan2668 | Avg Trip Time (s)1.05e+3 | 12 | |
| Adaptive Traffic Signal Control | Manhattan2668 (Holiday Rush) | Average Trip Time (seconds)1.28e+3 | 12 |