Guiding Masked Representation Learning to Capture Spatio-Temporal Relationship of Electrocardiogram
About
Electrocardiograms (ECG) are widely employed as a diagnostic tool for monitoring electrical signals originating from a heart. Recent machine learning research efforts have focused on the application of screening various diseases using ECG signals. However, adapting to the application of screening disease is challenging in that labeled ECG data are limited. Achieving general representation through self-supervised learning (SSL) is a well-known approach to overcome the scarcity of labeled data; however, a naive application of SSL to ECG data, without considering the spatial-temporal relationships inherent in ECG signals, may yield suboptimal results. In this paper, we introduce ST-MEM (Spatio-Temporal Masked Electrocardiogram Modeling), designed to learn spatio-temporal features by reconstructing masked 12-lead ECG data. ST-MEM outperforms other SSL baseline methods in various experimental settings for arrhythmia classification tasks. Moreover, we demonstrate that ST-MEM is adaptable to various lead combinations. Through quantitative and qualitative analysis, we show a spatio-temporal relationship within ECG data. Our code is available at https://github.com/bakqui/ST-MEM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| ECG Classification | PTBXL Form | Macro AUC (1%)55.71 | 17 | |
| ECG Classification | CSN | Macro AUC (1%)59.77 | 17 | |
| ECG Classification | PTBXL Rhythm | Macro AUC (1%)51.12 | 17 | |
| ECG Classification | CPSC 2018 | Macro AUC (1%)56.69 | 17 | |
| ECG Classification | PTBXL Sub | Macro AUC (1%)0.5412 | 17 | |
| ECG Classification | PTBXL Super | Macro AUC (1%)61.12 | 17 | |
| Disposition after emergency department visit (ED dispo) | MC-MED | AUROC53.23 | 16 | |
| LVEF Prediction | MIMIC IV | MAE (LVEF)7.149 | 8 | |
| SCD outcome prediction | music | AUROC0.5389 | 8 | |
| Beat recognition | Icentia11K | AUROC0.9452 | 8 |