Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting

About

Most existing multivariate time series forecasting methods adopt an all-to-all paradigm that feeds all variable histories into a unified model to predict their future values without distinguishing their individual roles. However, this undifferentiated paradigm makes it difficult to identify variable-specific causal influences and often entangles causally relevant information with spurious correlations. To address this limitation, we propose an all-to-one forecasting paradigm that predicts each target variable separately. Specifically, we first construct a Structural Causal Model from observational data and then, for each target variable, we partition the historical sequence into four subsegments according to the inferred causal structure: endogenous, direct causal, collider causal, and spurious correlation. Furthermore, we propose the Causal Decomposition Transformer (CDT), which integrates a dynamic causal adapter to learn causal structures initialized by the inferred graph, enabling correction of imperfect causal discovery during training. Furthermore, motivated by causal theory, we apply a projection-based output constraint to mitigate collider induced bias and improve robustness. Extensive experiments on multiple benchmark datasets demonstrate the effectiveness of the CDT.

Xingyu Zhang, Hanyun Du, Zeen Song, Siyu Zhao, Changwen Zheng, Wenwen Qiang• 2025

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.406
686
Multivariate Time-series ForecastingETTm1
MSE0.365
466
Multivariate Time-series ForecastingETTm2
MSE0.268
389
Multivariate ForecastingETTh2
MSE0.358
350
Multivariate Time-series ForecastingWeather
MSE0.239
340
Multivariate Time-series ForecastingTraffic
MSE0.411
264
Multivariate Time-series ForecastingExchange
MAE0.395
181
Multivariate Time-series ForecastingECL
MSE0.165
66
Multivariate long-term forecastingETTm1 T=96 (test)
MSE0.307
39
Multivariate Time-series ForecastingTraffic S=720 (test)
MSE0.441
14
Showing 10 of 10 rows

Other info

Follow for update