T-Graphormer: Using Transformers for Spatiotemporal Forecasting
About
Spatiotemporal data is ubiquitous, and forecasting it has important applications in many domains. However, its complex cross-component dependencies and non-linear temporal dynamics can be challenging for traditional techniques. Existing methods address this by learning the two dimensions separately. Here, we introduce Temporal Graphormer (T-Graphormer), a Transformer-based approach capable of modelling spatiotemporal correlations simultaneously. By adding temporal encodings in the Graphormer architecture, each node attends to all other tokens within the graph sequence, enabling the model to learn rich spacetime patterns with minimal predefined inductive biases. We show the effectiveness of T-Graphormer on real-world traffic prediction benchmark datasets. Compared to state-of-the-art methods, T-Graphormer reduces root mean squared error (RMSE) and mean absolute percentage error (MAPE) by up to 20% and 10%.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Traffic speed forecasting | METR-LA (test) | -- | 200 | |
| Spatiotemporal forecasting | PEMS-BAY (test) | MAE (Horizon 3)1.16 | 11 | |
| Spatiotemporal forecasting | PEMS03 Horizon 12 (test) | MAE15.88 | 8 | |
| Spatiotemporal forecasting | PEMS04 Horizon 12 (test) | MAE20.4 | 8 | |
| Spatiotemporal forecasting | PEMS08 Horizon 12 (test) | MAE16.88 | 8 |