Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Countdown Regression: Sharp and Calibrated Survival Predictions

About

Probabilistic survival predictions from models trained with Maximum Likelihood Estimation (MLE) can have high, and sometimes unacceptably high variance. The field of meteorology, where the paradigm of maximizing sharpness subject to calibration is popular, has addressed this problem by using scoring rules beyond MLE, such as the Continuous Ranked Probability Score (CRPS). In this paper we present the \emph{Survival-CRPS}, a generalization of the CRPS to the survival prediction setting, with right-censored and interval-censored variants. We evaluate our ideas on the mortality prediction task using two different Electronic Health Record (EHR) data sets (STARR and MIMIC-III) covering millions of patients, with suitable deep neural network architectures: a Recurrent Neural Network (RNN) for STARR and a Fully Connected Network (FCN) for MIMIC-III. We compare results between the two scoring rules while keeping the network architecture and data fixed, and show that models trained with Survival-CRPS result in sharper predictive distributions compared to those trained by MLE, while still maintaining calibration.

Anand Avati, Tony Duan, Sharon Zhou, Kenneth Jung, Nigam H. Shah, Andrew Ng• 2018

Related benchmarks

TaskDatasetResultRank
Censored Quantile RegressionLogNorm (test)
MSE0.247
5
Censored Quantile RegressionLogNorm heavy (test)
MSE1.17
5
Censored Quantile RegressionLogNorm med. (test)
MSE0.907
5
Censored Quantile RegressionLogNorm light (test)
MSE0.432
5
Censored Quantile RegressionLogNorm same (test)
MSE to True Quantile0.067
5
Censored Quantile RegressionNorm linear (test)
MSE (Quantile)0.184
5
Censored Quantile RegressionNorm same (test)
MSE to True Quantile0.114
5
Censored Quantile RegressionNorm non-lin (test)
MSE to True Quantile0.323
5
Censored Quantile RegressionExponential (test)
MSE to true quantile17.825
5
Censored Quantile RegressionWeibull (test)
MSE (Quantile)1.586
5
Showing 10 of 14 rows

Other info

Follow for update