Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-LLMs for Spatiotemporal Traffic Forecasting

About

Accurate spatiotemporal traffic forecasting is a critical prerequisite for proactive resource management in dense urban mobile networks. While large language models have shown promise in time series analysis, they inherently struggle to model the complex spatial dependencies of grid-based traffic data. Effectively extending large language models to this domain is challenging, as representing the vast amount of information from dense geographical grids can be inefficient and overwhelm the model's context. To address these challenges, we propose ST-Vision-LLM, a novel framework that reframes spatiotemporal forecasting as a vision-language fusion problem. Our approach leverages a Vision-LLM visual encoder to process historical global traffic matrices as image sequences, providing the model with a comprehensive global view to inform cell-level predictions. To overcome the inefficiency of large language models in handling numerical data, we introduce an efficient encoding scheme that represents floating-point values as single tokens via a specialized vocabulary, coupled with a two-stage numerical alignment fine-tuning process. The model is first trained with supervised fine-tuning and then further optimized for predictive accuracy using group relative policy optimization, a memory-efficient reinforcement learning method. Evaluations on real-world mobile traffic datasets demonstrate that ST-Vision-LLM outperforms existing methods by 15.6% in long-term prediction accuracy and exceeds the best baseline by around 30% on average in cross-domain few-shot scenarios. Our extensive experiments validate the model's strong generalization capabilities across various data-scarce environments.

Ning Yang, Hengyu Zhong, Haijun Zhang, Randall Berry• 2025

Related benchmarks

TaskDatasetResultRank
Spatiotemporal Traffic ForecastingMilan-Internet
NRMSE0.1018
63
Spatiotemporal Traffic ForecastingTrentino-SMS
NRMSE0.9548
24
Spatiotemporal Traffic ForecastingMilan-SMS
NRMSE0.6801
24
Spatiotemporal Traffic ForecastingTrentino INTERNET
NRMSE0.6638
24
Traffic PredictionTrentino INTERNET
MAE14.5816
20
Spatio-temporal traffic forecastingMilan-Internet
MAE47.341
13
Cross-domain Traffic PredictionMilan-Internet
MAE47.1007
9
Cross-domain Traffic PredictionMilan-SMS
MAE7.2017
9
Cross-domain Traffic PredictionTrentino-SMS
MAE2.9704
9
Spatio-temporal predictionTrentino-SMS (10% train)
NRMSE0.9882
8
Showing 10 of 13 rows

Other info

Follow for update