Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

When Does Multimodality Lead to Better Time Series Forecasting?

About

Recently, there has been growing interest in incorporating textual information into foundation models for time series forecasting. However, it remains unclear whether and under what conditions such multimodal integration consistently yields gains. We systematically investigate these questions across a diverse benchmark of 16 forecasting tasks spanning 7 domains, including health, environment, and economics. We evaluate two popular multimodal forecasting paradigms: aligning-based methods, which align time series and text representations; and prompting-based methods, which directly prompt large language models for forecasting. Our findings reveal that the benefits of multimodality are highly condition-dependent. While we confirm reported gains in some settings, these improvements are not universal across datasets or models. To move beyond empirical observations, we disentangle the effects of model architectural properties and data characteristics, drawing data-agnostic insights that generalize across domains. Our findings highlight that on the modeling side, incorporating text information is most helpful given (1) high-capacity text models, (2) comparatively weaker time series models, and (3) appropriate aligning strategies. On the data side, performance gains are more likely when (4) sufficient training data is available and (5) the text offers complementary predictive signal beyond what is already captured from the time series alone. Our study offers a rigorous, quantitative foundation for understanding when multimodality can be expected to aid forecasting tasks, and reveals that its benefits are neither universal nor always aligned with intuition.

Xiyuan Zhang, Boran Han, Haoyang Fang, Abdul Fatir Ansari, Shuai Zhang, Danielle C. Maddix, Cuixiong Hu, Andrew Gordon Wilson, Michael W. Mahoney, Hao Wang, Yan Liu, Huzefa Rangwala, George Karypis, Bernie Wang• 2025

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingMTBench Finance Short
MAPE2.545
10
Time Series ForecastingMTBench Finance (Long)
MAPE3.757
10
Time Series ForecastingMTBench Weather Long
MSE13.331
10
Trend PredictionMTBench Weather Short
Past Trend Prediction Score91.758
10
Time Series ForecastingTimeMMD Social Good
MSE1.908
10
Time Series ForecastingTimeMMD Health (US)
MSE0.802
10
Trend PredictionMTBench Finance Short
3-way Score43.478
10
Trend PredictionMTBench Finance (Long)
3-way Accuracy48.288
10
Time Series ForecastingMTBench Weather Short
MSE11.233
10
Time Series ForecastingTimeMMD Environment
MAPE26.886
10
Showing 10 of 13 rows

Other info

Follow for update