Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GP-VAE: Deep Probabilistic Time Series Imputation

About

Multivariate time series with missing values are common in areas such as healthcare and finance, and have grown in number and complexity over the years. This raises the question whether deep learning methodologies can outperform classical data imputation methods in this domain. However, naive applications of deep learning fall short in giving reliable confidence estimates and lack interpretability. We propose a new deep sequential latent variable model for dimensionality reduction and data imputation. Our modeling assumption is simple and interpretable: the high dimensional time series has a lower-dimensional representation which evolves smoothly in time according to a Gaussian process. The non-linear dimensionality reduction in the presence of missing data is achieved using a VAE approach with a novel structured variational approximation. We demonstrate that our approach outperforms several classical and deep learning-based data imputation methods on high-dimensional data from the domains of computer vision and healthcare, while additionally improving the smoothness of the imputations and providing interpretable uncertainty estimates.

Vincent Fortuin, Dmitry Baranchuk, Gunnar R\"atsch, Stephan Mandt• 2019

Related benchmarks

TaskDatasetResultRank
ImputationPhysioNet Challenge 2012 (test)
MAE0.398
21
Time Series ImputationPhysioNet Challenge healthcare hourly time series with 48 time steps 2012 (test)
CRPS0.574
15
Time Series ImputationAir-Quality 10% missing (test)
MAE0.268
10
Time Series ImputationETT 10% missing (test)
MAE0.274
10
ClassificationPhysioNet 2012
ROC AUC0.834
10
ImputationElectricity 70% missing rate
MAE1.037
8
ImputationElectricity 90% missing rate
MAE1.004
8
ImputationElectricity 60% missing rate
MAE1.101
8
ImputationElectricity 80% missing rate
MAE1.062
8
ImputationElectricity 20% missing rate
MAE1.124
8
Showing 10 of 18 rows

Other info

Follow for update