MedLA: A Logic-Driven Multi-Agent Framework for Complex Medical Reasoning with Large Language Models

About

Answering complex medical questions requires not only domain expertise and patient-specific information, but also structured and multi-perspective reasoning. Existing multi-agent approaches often rely on fixed roles or shallow interaction prompts, limiting their ability to detect and resolve fine-grained logical inconsistencies. To address this, we propose \textsc{MedLA}, a logic-driven multi-agent framework built on large language models. Each agent organizes its reasoning process into an explicit logical tree based on syllogistic triads (major premise, minor premise, and conclusion), enabling transparent inference and premise-level alignment. Agents engage in a multi-round, graph-guided discussion to compare and iteratively refine their logic trees, achieving consensus through error correction and contradiction resolution. We demonstrate that \textsc{MedLA} consistently outperforms both static role-based systems and single-agent baselines on challenging benchmarks such as MedDDx and standard medical QA tasks. Furthermore, \textsc{MedLA} scales effectively across both open-source and commercial LLM backbones, achieving state-of-the-art performance and offering a generalizable paradigm for trustworthy medical reasoning.

Siqi Ma, Jiajie Huang, Fan Zhang, Yue Shen, Jinlin Wu, Guohui Fan, Zhu Zhang, Zelin Zang• 2025

Related benchmarks

Task	Dataset	Result
Medical Reasoning	MedDDx (test)	Basic Accuracy48.2	28
Multi-choice medical QA	Multi-choice medical QA benchmarks (test)	MMLU-Med Accuracy70.7	28
Medical Reasoning	MedDDx	Basic Accuracy48.2	22
Multi-choice Medical Question Answering	Medical QA Multi-choice	MMLU-Med Accuracy70.7	22
Medical Reasoning	MedXpertQA	Accuracy36	20
Diagnostic Precision	ClinicalBench (CB) (test)	Accuracy63.4	11
Diagnostic Precision	MIMIC-IV (test)	Accuracy49.6	11

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord