Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Missing Pattern Recognized Diffusion Imputation Model for Missing Not At Random

About

Missing data frequently arises across diverse domains, including time-series and image domains. In the real world, missing occurrences often depend on the unobservable values themselves, which are referred to as Missing Not at Random (MNAR). In this work, we introduce the Missing Pattern Recognized Diffusion Imputation Model (PRDIM), a novel framework that explicitly captures the missing pattern and precisely imputes unobserved values. PRDIM iteratively maximizes the likelihood of the joint distribution for observed values and missing mask under an Expectation-Maximization (EM) algorithm. In this sense, we first employ a pattern recognizer, which approximates the underlying missing pattern and provides guidance during every inference toward more plausible imputations with respect to the missing information. Through extensive experiments, we demonstrate that PRDIM consistently achieves strong imputation performance under MNAR settings across multiple data modalities.

Gyuwon Sim, Sumin Lee, Heesun Bae, Byeonghu Na, Doyun Kwon, Ju-Hee Hwang, Jae-Young Lim, Il-Chul Moon• 2026

Related benchmarks

TaskDatasetResultRank
Time Series ImputationETT Original (Out-of-Sample)
MAE0.663
22
Time Series ImputationETT (Original In-Sample)
MAE0.303
22
Image InpaintingCelebA-HQ (test)
LPIPS2.09
18
Time Series ImputationSTOCK (Original Out-of-Sample)
MAE0.254
11
Time Series ImputationPEMS-Bay (Original Out-of-Sample)
MAE0.17
11
Time Series ImputationSTOCK Original (In-Sample)
MAE0.275
11
Time Series ImputationPEMS-Bay (Original In-Sample)
MAE0.154
11
Out-of-sample ImputationPhysioNet (out-of-sample)
MAE0.2737
9
Tabular Imputationadult out-of-sample
MAE0.482
6
Tabular Imputationdefault out-of-sample
MAE0.279
6
Showing 10 of 23 rows

Other info

Follow for update