CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis

About

The field of medical diagnosis has undergone a significant transformation with the advent of large language models (LLMs), yet the challenges of interpretability within these models remain largely unaddressed. This study introduces Chain-of-Diagnosis (CoD) to enhance the interpretability of LLM-based medical diagnostics. CoD transforms the diagnostic process into a diagnostic chain that mirrors a physician's thought process, providing a transparent reasoning pathway. Additionally, CoD outputs the disease confidence distribution to ensure transparency in decision-making. This interpretability makes model diagnostics controllable and aids in identifying critical symptoms for inquiry through the entropy reduction of confidences. With CoD, we developed DiagnosisGPT, capable of diagnosing 9604 diseases. Experimental results demonstrate that DiagnosisGPT outperforms other LLMs on diagnostic benchmarks. Moreover, DiagnosisGPT provides interpretability while ensuring controllability in diagnostic rigor.

Junying Chen, Chi Gui, Anningzhe Gao, Ke Ji, Xidong Wang, Xiang Wan, Benyou Wang• 2024

Related benchmarks

Task	Dataset	Result
Automatic Diagnosis	Dxy	Number of Turns0.6	46
Medical Diagnosis	MedQA agent	Rounds13.32	25
Medical Diagnosis	agent-CMB	Rounds11.99	25
Automatic Diagnosis	Muzhi Dataset	Accuracy (w/ Inquiry)65.5	14
Automatic Diagnosis	DxBench	Accuracy (w/o inquiry)56.9	10
Diagnostic Dialogue	HG (test)	Recall@110.6	8

Showing 6 of 6 rows

Other info

Code

Follow for update

@wizwand_team Discord