Oscillators Are All You Need: Irregular Time Series Modelling via Damped Harmonic Oscillators with Closed-Form Solutions
About
Transformers excel at time series modelling through attention mechanisms that capture long-term temporal patterns. However, they assume uniform time intervals and therefore struggle with irregular time series. Neural Ordinary Differential Equations (NODEs) effectively handle irregular time series by modelling hidden states as continuously evolving trajectories. ContiFormers arxiv:2402.10635 combine NODEs with Transformers, but inherit the computational bottleneck of the former by using heavy numerical solvers. This bottleneck can be removed by using a closed-form solution for the given dynamical system - but this is known to be intractable in general! We obviate this by replacing NODEs with a novel linear damped harmonic oscillator analogy - which has a known closed-form solution. We model keys and values as damped, driven oscillators and expand the query in a sinusoidal basis up to a suitable number of modes. This analogy naturally captures the query-key coupling that is fundamental to any transformer architecture by modelling attention as a resonance phenomenon. Our closed-form solution eliminates the computational overhead of numerical ODE solvers while preserving expressivity. We prove that this oscillator-based parameterisation maintains the universal approximation property of continuous-time attention; specifically, any discrete attention matrix realisable by ContiFormer's continuous keys can be approximated arbitrarily well by our fixed oscillator modes. Our approach delivers both theoretical guarantees and scalability, achieving state-of-the-art performance on irregular time series benchmarks while being orders of magnitude faster.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time-series classification | SelfRegulationSCP2 | Accuracy58.7 | 55 | |
| Time-series classification | SelfRegulationSCP1 | Accuracy84.1 | 45 | |
| Event Prediction | StackOverflow | RMSE0.951 | 42 | |
| Event Prediction | MIMIC | Accuracy83.4 | 15 | |
| Binary sequence classification | Synthetic Event-based (irregular) encoding | Accuracy99.83 | 13 | |
| Binary sequence classification | Synthetic Equidistant encoding | Accuracy100 | 13 | |
| Time-series classification | MI | Accuracy91.8 | 10 | |
| Time-series classification | ETC | Accuracy31.5 | 10 | |
| Time-series classification | HB | Accuracy71.8 | 10 | |
| Clinical time series prediction | HR | RMSE2.56 | 9 |