ACE-NODE: Attentive Co-Evolving Neural Ordinary Differential Equations
About
Neural ordinary differential equations (NODEs) presented a new paradigm to construct (continuous-time) neural networks. While showing several good characteristics in terms of the number of parameters and the flexibility in constructing neural networks, they also have a couple of well-known limitations: i) theoretically NODEs learn homeomorphic mapping functions only, and ii) sometimes NODEs show numerical instability in solving integral problems. To handle this, many enhancements have been proposed. To our knowledge, however, integrating attention into NODEs has been overlooked for a while. To this end, we present a novel method of attentive dual co-evolving NODE (ACE-NODE): one main NODE for a downstream machine learning task and the other for providing attention to the main NODE. Our ACE-NODE supports both pairwise and elementwise attention. In our experiments, our method outperforms existing NODE-based and non-NODE-based baselines in almost all cases by non-trivial margins.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Time-series classification | PhysioNet Sepsis (test) | AUROC80.4 | 30 | |
| Classification | PhysioNet Sepsis 2019 (test) | AUROC80.4 | 20 | |
| Forecasting | MuJoCo 30% Dropped (test) | MSE0.053 | 12 | |
| Forecasting | MuJoCo 50% Dropped (test) | MSE0.053 | 12 | |
| Forecasting | MuJoCo 70% Dropped (test) | MSE0.052 | 12 | |
| Forecasting | MuJoCo Regular (test) | MSE0.039 | 12 | |
| Forecasting | Google stock data Regular scenario | MSE0.0022 | 11 | |
| Forecasting | Google stock data 30% Dropped scenario | MSE0.0024 | 11 | |
| Forecasting | Google stock data 50% Dropped scenario | MSE0.0022 | 11 | |
| Forecasting | Google stock data 70% Dropped scenario | MSE0.0025 | 11 |