C-MIG: Multi-view Information Gain-based Retrieval-Augmented Generation for Clinical Diagnosis Reasoning
About
Retrieval-augmented generation combined with reinforcement learning has shown promise for grounding large language models in trustworthy medical evidence. However, existing methods rely on exact-match binary rewards, which in clinical diagnosis cause two issues: (i) semantically relevant but non-verbatim steps receive zero signal, discarding valuable learning signals; and (ii) uni-dimensional rewards cannot effectively supervise heterogeneous reasoning capabilities. To address these issues, we propose C-MIG, a Multi-view Information Gain-based retrieval-augmented generation framework for Clinical diagnosis. C-MIG estimates information gain under a frozen reference model from two complementary views, retrieved-document and document-refinement, to jointly guide what to retrieve and how to refine, alleviating the issues of valuable reward signal loss and credit assignment. We further design a multi-subquery retrieval augmentation strategy that improves knowledge recall coverage in clinical diagnostic scenarios. Comprehensive experiments on four medical benchmarks demonstrate that C-MIG achieves the best performance among all RAG-RL methods on both in-domain and out-of-domain sets, and outperforms state-of-the-art general-purpose LLMs for clinical diagnosis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Clinical Diagnosis | MedDDx-Plus (In-domain) | Exact Match (EM)57.43 | 14 | |
| Clinical Diagnosis | MedXpertQA (OOD) | Exact Match (EM)7.33 | 14 | |
| Clinical Diagnosis | RJUA (OOD) | Exact Match (EM)9.95 | 14 | |
| Clinical Diagnosis | MedQA (OOD) | EM6.5 | 14 | |
| Clinical Diagnosis | MedDDx-Plus | EM Score60.82 | 9 | |
| Clinical Diagnosis | MedXpertQA | EM7.52 | 9 | |
| Clinical Diagnosis | RJUA | EM11.37 | 9 | |
| Clinical Diagnosis | MedQA | Exact Match (EM)11 | 9 |