Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model

About

Electrocardiogram (ECG) is essential for the clinical diagnosis of arrhythmias and other heart diseases, but deep learning methods based on ECG often face limitations due to the need for high-quality annotations. Although previous ECG self-supervised learning (eSSL) methods have made significant progress in representation learning from unannotated ECG data, they typically treat ECG signals as ordinary time-series data, segmenting the signals using fixed-size and fixed-step time windows, which often ignore the form and rhythm characteristics and latent semantic relationships in ECG signals. In this work, we introduce a novel perspective on ECG signals, treating heartbeats as words and rhythms as sentences. Based on this perspective, we first designed the QRS-Tokenizer, which generates semantically meaningful ECG sentences from the raw ECG signals. Building on these, we then propose HeartLang, a novel self-supervised learning framework for ECG language processing, learning general representations at form and rhythm levels. Additionally, we construct the largest heartbeat-based ECG vocabulary to date, which will further advance the development of ECG language processing. We evaluated HeartLang across six public ECG datasets, where it demonstrated robust competitiveness against other eSSL methods. Our data and code are publicly available at https://github.com/PKUDigitalHealth/HeartLang.

Jiarui Jin, Haoyu Wang, Hongyan Li, Jun Li, Jiahui Pan, Shenda Hong• 2025

Related benchmarks

TaskDatasetResultRank
ECG ClassificationPTBXL Super
Macro AUC88
136
ECG ClassificationCSN
Macro AUC82.49
51
ECG ClassificationCPSC 2018
AUC77.87
32
ECG ClassificationPTBXL Sub
Macro AUC0.8891
27
ECG ClassificationPTBXL Rhythm
Macro AUC90.34
27
ECG ClassificationPTB-XL Form
AUC80.23
17
Chronic Kidney Disease DetectionMIMIC-IV-ECG-Ext-ICD Chronic Kidney Disease (test)
AUC76.24
9
Sepsis DetectionMIMIC-IV-ECG-Ext-ICD Sepsis (test)
AUC74.34
9
Diabetes DetectionMIMIC-IV-ECG-Ext-ICD (Diabetes) (test)
AUC65.08
9
Showing 9 of 9 rows

Other info

Follow for update