Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

About

The field of medical diagnosis has undergone a significant transformation with the advent of large language models (LLMs), yet the challenges of interpretability within these models remain largely unaddressed. This study introduces Chain-of-Diagnosis (CoD) to enhance the interpretability of LLM-based medical diagnostics. CoD transforms the diagnostic process into a diagnostic chain that mirrors a physician's thought process, providing a transparent reasoning pathway. Additionally, CoD outputs the disease confidence distribution to ensure transparency in decision-making. This interpretability makes model diagnostics controllable and aids in identifying critical symptoms for inquiry through the entropy reduction of confidences. With CoD, we developed DiagnosisGPT, capable of diagnosing 9604 diseases. Experimental results demonstrate that DiagnosisGPT outperforms other LLMs on diagnostic benchmarks. Moreover, DiagnosisGPT provides interpretability while ensuring controllability in diagnostic rigor.

Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang• 2024

Related benchmarks

TaskDatasetResultRank
Automatic DiagnosisDxy
Number of Turns0.6
46
Medical DiagnosisMedQA agent
Rounds13.32
25
Medical Diagnosisagent-CMB
Rounds11.99
25
Automatic DiagnosisMuzhi Dataset
Accuracy (w/ Inquiry)65.5
14
Automatic DiagnosisDxBench
Accuracy (w/o inquiry)56.9
10
Diagnostic DialogueHG (test)
Recall@110.6
8
Showing 6 of 6 rows

Other info

Code

Follow for update