Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TeamMedAgents: Enhancing Medical Decision-Making of LLMs Through Structured Teamwork

About

We present TeamMedAgents, a modular multi-agent framework that systematically translates evidence-based teamwork principles from organizational psychology into large language model collaboration for medical decision-making. Building upon Salas et al.'s "Big Five" teamwork model, we operationalize five core components as independently configurable mechanisms: shared mental models, team leadership, team orientation, trust networks, and mutual monitoring. Our architecture dynamically recruits 2-4 specialist agents and employs structured four-phase deliberation with adaptive component selection. Evaluation across eight medical benchmarks encompassing 11,545 questions demonstrates TeamMedAgents achieves 77.63% overall accuracy (text-based: 81.30%, vision-language: 66.60%). Systematic ablation studies comparing three single-agent baselines (Zero-Shot, Few-Shot, CoT) against individual teamwork components reveal task-specific optimization patterns: shared mental models excel on knowledge tasks, trust mechanisms improve differential diagnosis, while comprehensive integration degrades performance. Adaptive component selection yields 2-10 percentage point improvements over strongest baselines, with 96.2% agent convergence validating structured coordination effectiveness. TeamMedAgents establishes principled methodology for translating human teamwork theory into multi-agent systems, demonstrating that evidence-based collaboration patterns enhance AI performance in safety-critical domains through modular component design and selective activation strategies.

Pranav Pushkar Mishra, Mohammad Arvan, Mohan Zalake• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringPubMedQA
Accuracy79.2
145
Medical Question AnsweringMedQA
Accuracy88.1
109
Medical Question AnsweringDDXPlus
Accuracy82.4
28
Medical Question AnsweringMedbullets
Accuracy80.3
9
Medical Question AnsweringSymCat
Accuracy88.9
9
Multi-Agent Clinical EvaluationCombined Medical Benchmarks Aggregate
Expert Rating8
9
Showing 6 of 6 rows

Other info

Follow for update