EVL-ECG: Efficient ECG Interpretation With Multi-Aspect Heterogeneous Knowledge Distillation
About
High-fidelity ECG interpretation is increasingly reliant on massive foundation models, yet their deployment in clinical edge-care remains hindered by extreme computational demands. While knowledge distillation (KD) is a promising solution, traditional methods fail to capture the complex spatio-temporal dependencies of ECG signals when transferring knowledge across heterogeneous architectures. In this paper, we propose EVL-ECG, a framework specifically designed for cross-architecture distillation of cardiac diagnostic logic. EVL-ECG introduces three ECG-aware innovations: (1) Multi-Head Cross-Attention Alignment, which harmonizes architectural discrepancies to preserve fine-grained morphological features; (2) Optimal Transport-based Visual Feature Matching, utilizing optimal transport to maintain global structural relationships across ECG leads despite mismatched token representations; and (3) Geometric Intra-Architecture Relation Matching, which distills the latent diagnostic reasoning of the teacher model. Evaluations across ECG benchmarks demonstrate that EVL-ECG yields improvements of up to 2.4% AUC and 1.1% clinical accuracy over existing baselines. Notably, EVL-ECG establishes an efficient 2B-parameter ECG foundation model, suitable for resource-constrained clinical environments.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| ECG Interpretation | PTB-XL Super | AUC75.2 | 15 | |
| ECG Interpretation | CODE 15% | AUC86.4 | 15 | |
| ECG Interpretation | CPSC 2018 | AUC66.8 | 15 | |
| ECG Interpretation | CSN | Accuracy89.7 | 14 | |
| ECG Interpretation | G12EC | Accuracy77.1 | 14 | |
| ECG Interpretation | MMMU ECG | Accuracy48.5 | 14 | |
| ECG Question Answering | ECG-QA | Accuracy64.8 | 14 |