Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Noise Titration: Exact Distributional Benchmarking for Probabilistic Time Series Forecasting

About

Modern time series forecasting is evaluated almost entirely through passive observation of single historical trajectories, rendering claims about a model's robustness to non-stationarity fundamentally unfalsifiable. We propose a paradigm shift toward interventionist, exact-statistical benchmarking. By systematically titrating calibrated Gaussian observation noise into known chaotic and stochastic dynamical systems, we transform forecasting from a black-box sequence matching game into an exact distributional inference task. Because the underlying data-generating process and noise variance are mathematically explicit, evaluation can rely on exact negative log-likelihoods and calibrated distributional tests rather than heuristic approximations. To fully leverage this framework, we extend the Fern architecture into a probabilistic generative model that natively parameterizes the Symmetric Positive Definite (SPD) cone, outputting calibrated joint covariance structures without the computational bottleneck of generic Jacobian modeling. Under this rigorous evaluation, we find that state-of-the-art zero-shot foundation models behave consistently with the context-parroting mechanism, failing systematically under non-stationary regime shifts and elevated noise. In contrast, Fern explicitly captures the invariant measure and multivariate geometry of the underlying dynamics, maintaining structural fidelity and statistically sharp calibration precisely where massive sequence-matching models collapse.

Qilin Wang• 2026

Related benchmarks

TaskDatasetResultRank
Time Series ForecastingRossler Base
MSE0.016
17
Time Series ForecastingLorenz Base
MSE1.34
17
Time Series ForecastingLorenz-State
MSE1.82
17
Time Series ForecastingLorenz-Param
MSE4.48
17
Time Series ForecastingLorenz96 Base
MSE0.436
17
Time Series ForecastingLorenz96-Switch
MSE1.58
17
Time Series ForecastingChua Base
MSE0.006
17
Time Series ForecastingChua-Param
MSE0.008
17
Time Series ForecastingChua-Switch
MSE0.004
17
Time Series ForecastingSLDS-Switch
MSE3.87
17
Showing 10 of 59 rows

Other info

Follow for update