Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SMART: Towards Pre-trained Missing-Aware Model for Patient Health Status Prediction

About

Electronic health record (EHR) data has emerged as a valuable resource for analyzing patient health status. However, the prevalence of missing data in EHR poses significant challenges to existing methods, leading to spurious correlations and suboptimal predictions. While various imputation techniques have been developed to address this issue, they often obsess unnecessary details and may introduce additional noise when making clinical predictions. To tackle this problem, we propose SMART, a Self-Supervised Missing-Aware RepresenTation Learning approach for patient health status prediction, which encodes missing information via elaborated attentions and learns to impute missing values through a novel self-supervised pre-training approach that reconstructs missing data representations in the latent space. By adopting missing-aware attentions and focusing on learning higher-order representations, SMART promotes better generalization and robustness to missing data. We validate the effectiveness of SMART through extensive experiments on six EHR tasks, demonstrating its superiority over state-of-the-art methods.

Zhihao Yu, Xu Chu, Yujie Jin, Yasha Wang, Junfeng Zhao• 2024

Related benchmarks

TaskDatasetResultRank
Readmission predictionMIMIC IV
AUC-ROC0.6375
70
Mortality PredictionMIMIC-III
AUROC75.24
46
Readmission Prediction (RA)MIMIC-IV (test)
ROC AUC0.6132
33
Length-of-Stay PredictionMIMIC-III
Macro ROC AUC62.03
28
Length of Stay (LOS) predictionMIMIC-III (test)
Macro ROC AUC63.02
14
Mortality PredictionMIMIC-III (test)
AUROC85.02
14
CardiologyCardiology (test)
AUROC85.8
10
Cardiology PredictionCardiology (test)
AUPRC53.84
10
DecompensationDecompensation (test)
AUROC94.2
10
Decompensation PredictionDecompensation (test)
AUPRC71.26
10
Showing 10 of 18 rows

Other info

Code

Follow for update