Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Beyond All-to-All: Causal-Aligned Transformer with Dynamic Structure Learning for Multivariate Time Series Forecasting

About

Most existing multivariate time series forecasting methods adopt an all-to-all paradigm that feeds all variable histories into a unified model to predict their future values without distinguishing their individual roles. However, this undifferentiated paradigm makes it difficult to identify variable-specific causal influences and often entangles causally relevant information with spurious correlations. To address this limitation, we propose an all-to-one forecasting paradigm that predicts each target variable separately. Specifically, we first construct a Structural Causal Model from observational data and then, for each target variable, we partition the historical sequence into four subsegments according to the inferred causal structure: endogenous, direct causal, collider causal, and spurious correlation. Furthermore, we propose the Causal Decomposition Transformer (CDT), which integrates a dynamic causal adapter to learn causal structures initialized by the inferred graph, enabling correction of imperfect causal discovery during training. Furthermore, motivated by causal theory, we apply a projection-based output constraint to mitigate collider induced bias and improve robustness. Extensive experiments on multiple benchmark datasets demonstrate the effectiveness of the CDT.

Xingyu Zhang, Hanyun Du, Zeen Song, Siyu Zhao, Changwen Zheng, Wenwen Qiang• 2025

Related benchmarks

TaskDatasetResultRank
Multivariate ForecastingETTh1
MSE0.406
830
Multivariate Time-series ForecastingETTm1
MSE0.365
686
Multivariate Time-series ForecastingETTm2
MSE0.268
539
Multivariate Time-series ForecastingWeather
MSE0.239
409
Multivariate ForecastingETTh2
MSE0.358
359
Multivariate Time-series ForecastingTraffic
MSE0.411
310
Multivariate Time-series ForecastingExchange
MAE0.395
262
Multivariate Time-series ForecastingECL
MSE0.165
84
Multivariate long-term forecastingETTm1 T=96 (test)
MSE0.307
39
Multivariate Time-series ForecastingTraffic S=720 (test)
MSE0.441
14
Showing 10 of 10 rows

Other info

Follow for update