GraphWalker: Graph-Guided In-Context Learning for Clinical Reasoning on Electronic Health Records
About
Clinical Reasoning on Electronic Health Records (EHRs) is a fundamental yet challenging task in modern healthcare. While in-context learning (ICL) offers a promising inference-time adaptation paradigm for large language models (LLMs) in EHR reasoning, existing methods face three fundamental challenges: (1) Perspective Limitation, where data-driven similarity fails to align with LLM reasoning needs and model-driven signals are constrained by limited clinical competence; (2) Cohort Awareness, as demonstrations are selected independently without modeling population-level structure; and (3) Information Aggregation, where redundancy and interaction effects among demonstrations are ignored, leading to diminishing marginal gains. To address these challenges, we propose GraphWalker, a principled demonstration selection framework for EHR-oriented ICL. GraphWalker (i) jointly models patient clinical information and LLM-estimated information gain by integrating data-driven and model-driven perspectives, (ii) incorporates Cohort Discovery to avoid noisy local optima, and (iii) employs a Lazy Greedy Search with Frontier Expansion algorithm to mitigate diminishing marginal returns in information aggregation. Extensive experiments on multiple real-world EHR benchmarks demonstrate that GraphWalker consistently outperforms state-of-the-art ICL baselines, yielding substantial improvements in clinical reasoning performance. Our code is open-sourced at https://github.com/PuppyKnightUniversity/GraphWalker
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Readmission prediction | MIMIC IV | AUC-ROC0.7591 | 70 | |
| Mortality Prediction | MIMIC-III | AUROC84.19 | 46 | |
| Readmission Prediction (RA) | MIMIC-IV (test) | ROC AUC0.7757 | 33 | |
| Length-of-Stay Prediction | MIMIC-III | Macro ROC AUC69.51 | 28 | |
| Length of Stay (LOS) prediction | MIMIC-III (test) | Macro ROC AUC67.4 | 14 | |
| Mortality Prediction | MIMIC-III (test) | AUROC85.88 | 14 | |
| Medical Reasoning | CMB | Exact Match (EM)84.05 | 12 | |
| Medical Reasoning | MedQA | EM61.1 | 12 | |
| Medical Reasoning | CMB clin | BLEU-129.68 | 12 |