CARE-ECG: Causal Agent-based Reasoning for Explainable and Counterfactual ECG Interpretation
About
Large language models (LLMs) enable waveform-to-text ECG interpretation and interactive clinical questioning, yet most ECG-LLM systems still rely on weak signal-text alignment and retrieval without explicit physiological or causal structure. This limits grounding, temporal reasoning, and counterfactual "what-if" analysis central to clinical decision-making. We propose CARE-ECG, a causally structured ECG-language reasoning framework that unifies representation learning, diagnosis, and explanation in a single pipeline. CARE-ECG encodes multi-lead ECGs into temporally organized latent biomarkers, performs causal graph inference for probabilistic diagnosis, and supports counterfactual assessment via structural causal models. To improve faithfulness, CARE-ECG grounds language outputs through causal retrieval-augmented generation and a modular agentic pipeline that integrates history, diagnosis, and response with verification. Across multiple ECG benchmarks and expert QA settings, CARE-ECG improves diagnostic accuracy and explanation faithfulness while reducing hallucinations (e.g., 0.84 accuracy on Expert-ECG-QA and 0.76 on SCP-mapped PTB-XL under GPT-4). Overall, CARE-ECG provides traceable reasoning by exposing key latent drivers, causal evidence paths, and how alternative physiological states would change outcomes.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clinical Reasoning | Expert-ECG-QA | Accuracy84 | 8 | |
| Diagnosis Grounding | PTB-XL SCP-Mapped | Accuracy76 | 8 | |
| Medical Explanation Generation | PTB-XL | CRC0.91 | 8 | |
| Medical Explanation Generation | MIMIC-IV ECG | CRC94 | 8 | |
| Medical Explanation Generation | Expert-ECG-QA | CRC96 | 8 | |
| ECG Question Answering | Expert-ECG-QA | HR8 | 8 | |
| ECG report generation | PTB-XL | Heart Rate (HR)0.08 | 8 | |
| ECG report generation | MIMIC-IV ECG | HR0.09 | 8 |