Breaking the Regional Barrier: Inductive Semantic Topology Learning for Worldwide Air Quality Forecasting
About
Global air quality forecasting grapples with extreme spatial heterogeneity and the poor generalization of existing transductive models to unseen regions. To tackle this, we propose OmniAir, a semantic topology learning framework tailored for global station-level prediction. By encoding invariant physical environmental attributes into generalizable station identities and dynamically constructing adaptive sparse topologies, our approach effectively captures long-range non-Euclidean correlations and physical diffusion patterns across unevenly distributed global networks. We further curate WorldAir, a massive dataset covering over 7,800 stations worldwide. Extensive experiments show that OmniAir achieves state-of-the-art performance against 18 baselines, maintaining high efficiency and scalability with speeds nearly 10 times faster than existing models, while effectively bridging the monitoring gap in data-sparse regions.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Air quality forecasting | KnowAir | MAE7.58 | 45 | |
| Air quality forecasting | LargeAQ | MAE13.42 | 45 | |
| Air quality forecasting | Global air quality dataset | MAE11.28 | 29 | |
| Air quality forecasting | USA regional air quality | MAE10.38 | 24 | |
| Air quality forecasting | China regional air quality subset | MAE7.72 | 24 | |
| Air quality forecasting | Europe regional air quality | MAE11.7 | 24 |