Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MIDA: Multiple Imputation using Denoising Autoencoders

About

Missing data is a significant problem impacting all domains. State-of-the-art framework for minimizing missing data bias is multiple imputation, for which the choice of an imputation model remains nontrivial. We propose a multiple imputation model based on overcomplete deep denoising autoencoders. Our proposed model is capable of handling different data types, missingness patterns, missingness proportions and distributions. Evaluation on several real life datasets show our proposed model significantly outperforms current state-of-the-art methods under varying conditions while simultaneously improving end of the line analytics.

Lovedeep Gondara, Ke Wang• 2017

Related benchmarks

TaskDatasetResultRank
Missing data estimationBiobank
Mean RMSE0.0805
13
Patient state predictionMIMIC-III (test)
AUROC83.99
13
Patient state predictionBiobank (test)
AUROC87.85
13
Missing data estimationMIMIC-III v1.4 (test)
Mean RMSE0.0412
13
Missing data estimationDeterioration
RMSE0.0309
13
Patient state predictionDeterioration (test)
AUROC0.7488
13
Missing data estimationUNOS Heart
Mean RMSE0.0589
12
Missing data estimationUNOS-Lung
Mean RMSE0.0712
12
Patient state predictionUNOS-Heart (test)
AUROC66.33
12
Patient state predictionUNOS-Lung (test)
AUROC0.6574
12
Showing 10 of 19 rows

Other info

Follow for update