Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data
About
Irregular sampling intervals and missing values in real-world time series data present challenges for conventional methods that assume consistent intervals and complete data. Neural Ordinary Differential Equations (Neural ODEs) offer an alternative approach, utilizing neural networks combined with ODE solvers to learn continuous latent representations through parameterized vector fields. Neural Stochastic Differential Equations (Neural SDEs) extend Neural ODEs by incorporating a diffusion term, although this addition is not trivial, particularly when addressing irregular intervals and missing values. Consequently, careful design of drift and diffusion functions is crucial for maintaining stability and enhancing performance, while incautious choices can result in adverse properties such as the absence of strong solutions, stochastic destabilization, or unstable Euler discretizations, significantly affecting Neural SDEs' performance. In this study, we propose three stable classes of Neural SDEs: Langevin-type SDE, Linear Noise SDE, and Geometric SDE. Then, we rigorously demonstrate their robustness in maintaining excellent performance under distribution shift, while effectively preventing overfitting. To assess the effectiveness of our approach, we conduct extensive experiments on four benchmark datasets for interpolation, forecasting, and classification tasks, and analyze the robustness of our methods with 30 public datasets under different missing rates. Our results demonstrate the efficacy of the proposed method in handling real-world irregular time series data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time-series classification | PhysioNet Sepsis (test) | AUROC91.1 | 30 | |
| Time-series classification | 30 benchmark datasets Regular (test) | Accuracy72.7 | 25 | |
| Time-series classification | benchmark datasets 30% Missing (test) | Accuracy72.3 | 25 | |
| Time-series classification | LSST Scenario 2 v1 (test) | Accuracy39.6 | 21 | |
| Novelty Detection | LSST Scenario 4 v1 (test) | AUROC54.2 | 21 | |
| Time-series classification | LSST Scenario 3 v1 (test) | Accuracy39.8 | 21 | |
| Time-series classification | LSST Scenario 1 v1 (test) | Accuracy39.5 | 21 | |
| Classification | PhysioNet Sepsis 2020 (test) | AUROC (With OI)0.911 | 17 | |
| Time-series classification | 30 benchmark datasets Pairwise Comparison (test) | Win Count (vs Neural ODE, TANDEM)59 | 4 | |
| Time-series classification | 30 benchmark datasets 70% Missing (test) | Accuracy70.3 | 2 |