Share your thoughts, 1 month free Claude Pro on usSee more

Medical Reasoning on MMLU Pro

80.7Accuracy

MDAgents

Updated 29d ago

Evaluation Results

Method	Links
MDAgents 2025.08		80.7
DyLAN 2025.08		80
TMA-AllCompon 2025.08		79.7
ReConcile 2025.08		78.8
m1k 2026.06		64.5
Middle Perplexity 2026.06		63.6
Ours 2026.06		63
Random 2026.06		62.7
Embedding Diversity 2026.06		62.5
S2L 2026.06		62.5
Learnability 2026.06		60.8
MedAgents 2025.08		46.7
TMA-AllCompon 2025.08		46.4
ReConcile 2025.08		35.8
TMA-AllCompon 2025.08		35.6
DyLAN 2025.08		27.4
MDAgents 2025.08		24.4
MedAgents 2025.08		23.6