Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images

About

While recent multimodal large language models (MLLMs) have advanced automated ECG interpretation, they still face two key limitations: (1) insufficient multimodal synergy between time series signals and visual ECG representations, and (2) limited explainability in linking diagnoses to granular waveform evidence. We introduce GEM, the first MLLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation. GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process through three core innovations: a dual-encoder framework extracting complementary time series and image features, cross-modal alignment for effective multimodal understanding, and knowledge-guided instruction generation for generating high-granularity grounding data (ECG-Grounding) linking diagnoses to measurable parameters ($e.g.$, QRS/PR Intervals). Additionally, we propose the Grounded ECG Understanding task, a clinically motivated benchmark designed to comprehensively assess the MLLM's capability in grounded ECG understanding. Experimental results on both existing and our proposed benchmarks show GEM significantly improves predictive performance (CSN $7.4\% \uparrow$), explainability ($22.7\% \uparrow$), and grounding ($24.8\% \uparrow$), making it more suitable for real-world clinical applications. GitHub repository: https://github.com/lanxiang1017/GEM.git

Xiang Lan, Feng Wu, Kai He, Qinghao Zhao, Shenda Hong, Mengling Feng• 2025

Related benchmarks

TaskDatasetResultRank
ForecastingIcentia
F1 Score33.33
128
Time Series ReasoningTSUR Reasoning (test)
Inductive Accuracy58.73
19
Grounded ECG InterpretationECG-Grounding
Diagnosis Accuracy74.7
17
ECG Abnormality DetectionPTB-XL Super
AUC83.4
10
Cardiovascular Disease DetectionCPSC 2018
AUC0.791
10
ECG Abnormality DetectionCODE 15%
AUC91.5
8
ECG Abnormality DetectionCSN
ACC86.2
8
ECG Abnormality DetectionG12EC
Accuracy80.5
8
Medical Time Series ClassificationPTB-XL
F1-Score29.82
7
ECG dialogue response quality evaluationECG-MTD 12-lead (test)
Accuracy2.48
7
Showing 10 of 29 rows

Other info

Follow for update