Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

The Procrustean Bed of Time Series: The Optimization Bias in Point-wise Loss Functions

About

Intuitively, a more deterministic time series should be easier to forecast. However, point-wise loss functions (e.g., MSE and MAE), serving as differentiable surrogates for the ideal optimization target, score each timestamp independently and therefore disregard temporal dependence. This mismatch induces a systematic optimization bias that cannot be eliminated merely by improving model expressiveness or optimizer. To formalize this issue, we define the Expectation of Optimization Bias (EOB) as the Kullback--Leibler divergence between the true joint distribution and the factorized i.i.d. surrogate induced by the point-wise paradigm. Under covariance-stationary Gaussian assumptions, we derive closed-form expressions for the stochastic component of EOB, establishing it as an irreducible lower bound on the total bias in linear systems, and further extend it to nonlinear regimes through a Gaussian mixture model lower bound. Crucially, we prove this bias is governed intrinsically by two data properties, i.e., sequence length and Structural Signal-to-Noise Ratio (SSNR), regardless of specific model architecture, optimizer, or point-wise loss forms. This theory motivates a principled debiasing program based on sequence length reduction and structural orthogonalization, which we instantiate through DFT/DWT combined with a novel harmonized $\ell_p$ norm. Extensive experiments validate the predicted SSNR--horizon dynamics, resolve the classic trigonometric fitting failure as an objective-induced pathology, and demonstrate substantial plug-and-play gains. Notably, on iTransformer, our proposed objective reduces average MSE/MAE by 5.2%/5.0% in forecasting across 11 datasets and by 27.4%/19.4% in imputation across 9 datasets.

Rongyao Cai, Yuxi Wan, Kexin Zhang, Ming Jin, Zhiqiang Ge, Daoyi Dong, Hang Yu, Yong Liu, Qingsong Wen• 2025

Related benchmarks

TaskDatasetResultRank
Long-term time-series forecastingETTh1 (test)
MSE0.436
410
Long-term time-series forecastingWeather (test)
MSE0.256
223
Long-term time-series forecastingETTm1 (test)
MSE0.393
199
Long-term time-series forecastingTraffic (test)
MSE0.429
182
Long-term forecastingExchange (test)
MAE0.409
144
Long-term time-series forecastingECL (test)
MSE0.17
90
Time Series ImputationETTh1 (test)
MSE0.0018
63
Time Series ImputationECL
MSE4.50e-4
57
Time Series ImputationETTm1 (test)
MSE0.0016
45
Time Series ImputationETTm2 (test)
MSE0.0027
45
Showing 10 of 18 rows

Other info

Follow for update