Learning Long-Term Dependencies in Irregularly-Sampled Time Series
About
Recurrent neural networks (RNNs) with continuous-time hidden states are a natural fit for modeling irregularly-sampled time series. These models, however, face difficulties when the input data possess long-term dependencies. We prove that similar to standard RNNs, the underlying reason for this issue is the vanishing or exploding of the gradient during training. This phenomenon is expressed by the ordinary differential equation (ODE) representation of the hidden state, regardless of the ODE solver's choice. We provide a solution by designing a new algorithm based on the long short-term memory (LSTM) that separates its memory from its time-continuous state. This way, we encode a continuous-time dynamical flow within the RNN, allowing it to respond to inputs arriving at arbitrary time-lags while ensuring a constant error propagation through the memory path. We call these RNN models ODE-LSTMs. We experimentally show that ODE-LSTMs outperform advanced RNN-based counterparts on non-uniformly sampled data with long-term dependencies. All code and data is available at https://github.com/mlech26l/ode-lstms.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Temporal Point Process modeling | MOOC real-world (test) | NLL-0.2277 | 25 | |
| Temporal Point Process modeling | Reddit real-world (test) | Negative Log-Likelihood-1.0888 | 25 | |
| Per time-step regression | Walker2D | Squared Error0.883 | 19 | |
| Temporal Point Process modeling | Wiki real-world (test) | Negative Log-Likelihood-1.3727 | 18 | |
| Sequence Classification | Bit-stream XOR Event-based (irregular) encoding (test) | Accuracy98.89 | 18 | |
| Sequence Classification | Bit-stream XOR Equidistant encoding (test) | Accuracy100 | 18 | |
| Event sequence classification | Irregular sequential MNIST (test) | Accuracy95.73 | 11 | |
| Sequence Classification | Bit-stream sequence Event-based encoding (test) | Accuracy98.89 | 11 | |
| Temporal Point Process modeling | Hawkes2 synthetic (test) | NLL0.1253 | 11 | |
| Temporal Point Process modeling | Renewal synthetic (test) | NLL0.2605 | 11 |