Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MedLA: A Logic-Driven Multi-Agent Framework for Complex Medical Reasoning with Large Language Models

About

Answering complex medical questions requires not only domain expertise and patient-specific information, but also structured and multi-perspective reasoning. Existing multi-agent approaches often rely on fixed roles or shallow interaction prompts, limiting their ability to detect and resolve fine-grained logical inconsistencies. To address this, we propose \textsc{MedLA}, a logic-driven multi-agent framework built on large language models. Each agent organizes its reasoning process into an explicit logical tree based on syllogistic triads (major premise, minor premise, and conclusion), enabling transparent inference and premise-level alignment. Agents engage in a multi-round, graph-guided discussion to compare and iteratively refine their logic trees, achieving consensus through error correction and contradiction resolution. We demonstrate that \textsc{MedLA} consistently outperforms both static role-based systems and single-agent baselines on challenging benchmarks such as MedDDx and standard medical QA tasks. Furthermore, \textsc{MedLA} scales effectively across both open-source and commercial LLM backbones, achieving state-of-the-art performance and offering a generalizable paradigm for trustworthy medical reasoning.

Siqi Ma, Jiajie Huang, Fan Zhang, Yue Shen, Jinlin Wu, Guohui Fan, Zhu Zhang, Zelin Zang• 2025

Related benchmarks

TaskDatasetResultRank
Medical ReasoningMedDDx (test)
Basic Accuracy48.2
28
Multi-choice medical QAMulti-choice medical QA benchmarks (test)
MMLU-Med Accuracy70.7
28
Medical ReasoningMedDDx
Basic Accuracy48.2
22
Multi-choice Medical Question AnsweringMedical QA Multi-choice
MMLU-Med Accuracy70.7
22
Diagnostic PrecisionClinicalBench (CB) (test)
Accuracy63.4
11
Diagnostic PrecisionMIMIC-IV (test)
Accuracy49.6
11
Medical ReasoningMedXpertQA
Accuracy36
4
Showing 7 of 7 rows

Other info

Follow for update