Neural Ordinary Differential Equations

About

We introduce a new family of deep neural network models. Instead of specifying a discrete sequence of hidden layers, we parameterize the derivative of the hidden state using a neural network. The output of the network is computed using a black-box differential equation solver. These continuous-depth models have constant memory cost, adapt their evaluation strategy to each input, and can explicitly trade numerical precision for speed. We demonstrate these properties in continuous-depth residual networks and continuous-time latent variable models. We also construct continuous normalizing flows, a generative model that can train by maximum likelihood, without partitioning or ordering the data dimensions. For training, we show how to scalably backpropagate through any ODE solver, without access to its internal operations. This allows end-to-end training of ODEs within larger models.

Ricky T. Q. Chen, Yulia Rubanova, Jesse Bettencourt, David Duvenaud• 2018

Related benchmarks

Task	Dataset	Result
Image Classification	MNIST (test)	--	894
RUL prediction	N-CMAPSS	RMSE22.74	72
Irregularly Sampled Time Series Forecasting	USHCN (test)	MSE0.96	68
Clinical prediction	MIMIC-III	AUROC77.34	59
Forecasting	MIMIC-III (test)	MSE0.89	51
Generative Modeling	CIFAR-10	BPD3.4	46
Remaining Useful Life Estimation	C-MAPSS FD002 (test)	RMSE19.9	44
Long-term time-series forecasting	Sinewave (SIN) synthetic (test)	MSE98.8	36
Anomaly Detection	SMAP (test)	Precision87.5	35
Classification	RotMNIST (test)	--	32

Showing 10 of 257 rows

...

Other info

Code

Follow for update

@wizwand_team Discord