Controlling Transient Amplification Improves Long-horizon Rollouts
About
Autoregressive neural simulators now match classical solvers on short-horizon prediction of physical systems, yet their accuracy degrades rapidly when rolled out over long horizons. In this work, we identify transient amplification of perturbations around rollout trajectories as a structural mechanism driving rollout error. Using a linearization analysis we show that when the Jacobians along an autoregressive trajectory are non-normal and non-commuting, the model amplifies errors transiently, resulting in model rollout drift even when the overall system is asymptotically stable. Building on the analysis, we propose commutativity regularization: a combination of two penalties designed to reduce the normality defect of individual Jacobians and the commutator norm of Jacobians across steps. The penalties are estimated with Jacobian-vector products and have no inference-time cost. We show a propagator bound that quantifies rollout error under approximate commutativity and normality. We evaluate UNet and FNO variants with commutativity regularization on 1D and 2D spatio-temporal data in synthetic and real settings, showing successful long-horizon rollouts over thousands of steps. Further, we show that the method improves FourCastNet climate forecasts on ERA5 without using any new data. The gain is most pronounced out-of-distribution: trained on trajectories of a few hundred steps, regularized models remain in-distribution for thousands of rollout steps on initial conditions where baselines diverge.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Rollout Forecasting | BVE 30-trajectory (test) | RMSE (mean)0.0028 | 20 | |
| Sea Surface Temperature Forecasting | OISST 1°×1° (test) | -- | 6 | |
| 1D KdV forecasting | KdV single-soliton in-distribution (test) | NMSE (Step 50)2.1 | 4 | |
| 1D PDE forecasting | KdV multi-soliton out-of-distribution (test) | nMSE (Step 50)1.6 | 3 | |
| Long-horizon rollout forecasting | KdV single-soliton in-distribution (test) | nMSE (Step 50)1.3 | 3 | |
| Geopotential (z500) Rollout Prediction | ERA5 2018 | RMSE (1d)82.7 | 3 | |
| Geopotential at 500hPa (z500) rollout forecasting | ERA5 2019 | Forecast Error (1 Day)84.7 | 3 | |
| 2m Temperature (t2m) rollout forecasting | ERA5 2019 | T2m Error (1d)0.97 | 3 | |
| 2m Temperature (t2m) Rollout Prediction | ERA5 2018 | RMSE (1 day)0.97 | 3 | |
| Eastward wind at 850hPa (u850) rollout forecasting | ERA5 2019 | Forecast Error (1 day)1.64 | 3 |