Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
About
EHR systems lack a unified code system forrepresenting medical concepts, which acts asa barrier for the deployment of deep learningmodels in large scale to multiple clinics and hos-pitals. To overcome this problem, we introduceDescription-based Embedding,DescEmb, a code-agnostic representation learning framework forEHR. DescEmb takes advantage of the flexibil-ity of neural language understanding models toembed clinical events using their textual descrip-tions rather than directly mapping each event toa dedicated embedding. DescEmb outperformedtraditional code-based embedding in extensiveexperiments, especially in a zero-shot transfertask (one hospital to another), and was able totrain a single unified model for heterogeneousEHR datasets.
Kyunghoon Hur, Jiyoung Lee, Jungwoo Oh, Wesley Price, Young-Hak Kim, Edward Choi• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| In-hospital mortality prediction | In-hospital Mortality | AUROC93.3 | 16 | |
| Length-of-Stay Prediction | Long LOS 7d | AUROC0.849 | 16 | |
| Mortality Prediction | 1YR Mortality | AUROC76.6 | 16 | |
| ICU readmission prediction | ICU-Readmit 30d | AUROC0.66 | 16 |
Showing 4 of 4 rows