AttendLight: Universal Attention-Based Reinforcement Learning Model for Traffic Signal Control
About
We propose AttendLight, an end-to-end Reinforcement Learning (RL) algorithm for the problem of traffic signal control. Previous approaches for this problem have the shortcoming that they require training for each new intersection with a different structure or traffic flow distribution. AttendLight solves this issue by training a single, universal model for intersections with any number of roads, lanes, phases (possible signals), and traffic flow. To this end, we propose a deep RL model which incorporates two attention models. The first attention model is introduced to handle different numbers of roads-lanes; and the second attention model is intended for enabling decision-making with any number of phases in an intersection. As a result, our proposed model works for any intersection configuration, as long as a similar configuration is represented in the training set. Experiments were conducted with both synthetic and real-world standard benchmark data-sets. The results we show cover intersections with three or four approaching roads; one-directional/bi-directional roads with one, two, and three lanes; different number of phases; and different traffic flows. We consider two regimes: (i) single-environment training, single-deployment, and (ii) multi-environment training, multi-deployment. AttendLight outperforms both classical and other RL-based approaches on all cases in both regimes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Traffic Signal Control | Jinan-2 | Average Travel Time (ATT)280.9 | 48 | |
| Traffic Signal Control | Jinan-1 | Avg Travel Time (ATT)273 | 38 | |
| Traffic Signal Control | Hangzhou D_HZ(2) | Average Travel Time (s)411.2 | 32 | |
| Traffic Signal Control | Hangzhou (HZ-1) | Average Travel Time (ATT)491.4 | 24 | |
| Traffic Signal Control | Jinan (JN-3) | Average Travel Time (ATT)446.2 | 22 | |
| Adaptive Traffic Signal Control | Grid5x5 | Average Trip Time (s)767.4 | 20 | |
| Traffic Signal Control | Monaco network heterogeneous (test) | Queue Length1.46 | 18 | |
| Multi-agent Traffic Signal Control | Homogeneous Grid 5x5 network (test) | Queue Length3.37 | 15 | |
| Traffic Signal Control | Hangzhou | ATT (Avg Travel Time)322.9 | 14 | |
| Adaptive Traffic Signal Control | Heterogeneous Monaco Network | Average Trip Duration (s)909.5 | 8 |