SurF: A Generative Model for Multivariate Irregular Time Series Forecasting
About
Irregularly sampled multivariate event streams remain a stubbornly difficult modality for generative modeling: tokenization-based approaches break down when inter-event intervals vary by orders of magnitude, and neural temporal point processes are bottlenecked by window-level numerical quadrature. We (i) propose SurF, a generative model that uses the Time Rescaling Theorem (TRT) as a learnable bijection between event sequences and i.i.d.\ unit-rate exponential noise, enabling a single model to be trained across heterogeneous event-stream datasets; (ii) three efficient parameterizations of the cumulative intensity that scale to long sequences; and (iii) a Transformer-based encoder for multi-dataset pretraining. On six real-world benchmarks, SurF achieves the best reported time RMSE on Earthquake, Retweet, and Taobao, and is within trial-level noise of the strongest specialist on the remaining three. Under a strict leave-one-out protocol, the held-out checkpoint beats every classical and neural-autoregressive baseline on 5/6 datasets and beats every baseline on Amazon and Earthquake, an initial step toward foundation models over asynchronous event streams.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Event Prediction | StackOverflow | ACC45.1 | 58 | |
| Event Prediction | taxi | RMSEΔt0.286 | 40 | |
| Next event prediction | Taobao | Time RMSE0.126 | 33 | |
| Next event prediction | AMAZON | RMSE0.336 | 32 | |
| Event Prediction | Retweet | RMSE (Time)15.78 | 28 | |
| Next-event time and location prediction | Earthquake | Temporal RMSE1.228 | 27 | |
| Long-horizon prediction | AMAZON | RMSE (Δt)0.327 | 26 | |
| Multi-horizon forecasting | Retweet | Inter-event Time RMSE14.71 | 15 | |
| Multi-horizon forecasting | Earthquake | Inter-event Time RMSE1.229 | 15 | |
| Multi-horizon forecasting | StackOverflow | Inter-event Time RMSE0.825 | 15 |