Multivariate Time Series Data Imputation via Distributionally Robust Regularization
About
Multivariate time series imputation is often compromised by mismatch between the observed and true data distributions, a bias induced by the combined effects of time-series non-stationarity and systematic missingness. Standard methods that encourage point-wise reconstruction or direct distributional alignment may overfit these biased observations. We propose the Distributionally Robust Regularized Imputer Objective (DRIO), which jointly minimizes reconstruction error and the worst-case divergence between the imputer distribution and data distributions within a Wasserstein ambiguity set. We derive a tractable upper-bound surrogate that reduces infinite-dimensional optimization over measures to adversarial search over sample trajectories, and develop an alternating learning algorithm compatible with modern deep learning backbones. Comprehensive experiments on diverse real-world datasets show that DRIO consistently provides robust imputation and suggests improved downstream forecasting under various missingness scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Imputation | CNNpred | MSE0.74 | 8 | |
| Time Series Imputation | CNNpred MNAR averaged across missing ratios (test) | MSE0.74 | 8 | |
| Imputation | PeMS08 | MSE0.52 | 8 | |
| Imputation | PM2.5 | MSE0.63 | 8 | |
| Imputation | Gait | MSE0.15 | 8 | |
| Imputation | CMAPSS | MSE0.27 | 8 | |
| Imputation | AirQuality | MSE0.32 | 8 | |
| Imputation | HAR | MSE0.33 | 8 | |
| Time Series Imputation | PEMS08 MNAR (test) | MSE0.52 | 8 | |
| Time Series Imputation | PM2.5 MNAR averaged across missing ratios (test) | MSE0.63 | 8 |