Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Question Answering on MedBullets (test)
Loading...
82.79
Accuracy
ClinicalAgents
57.1332
63.7941
70.455
77.1159
Mar 27, 2026
Accuracy
Updated 2mo ago
Evaluation Results
Method
Method
Links
Accuracy
ClinicalAgents
Category=Multi-Agents
2026.03
82.79
MedChain-Agents
Category=Multi-Agents
2026.03
81.82
MedAgents
Category=Multi-Agents
2026.03
80.84
ReConcile
Category=Multi-Agents
2026.03
80.52
ReAct
Category=Single Agent
2026.03
80.19
DeepSeek-R1
Category=Single LLM
2026.03
79.87
Few-shot + CoT
Category=Single Agent
2026.03
79.87
AutoGen
Category=Multi-Agents
2026.03
79.55
RAG
Category=Single Agent
2026.03
79.22
GPT-5.2
Category=Single LLM
2026.03
78.9
MDAgents
Category=Multi-Agents
2026.03
77.27
Llama-4-Maverick-17B
Category=Single LLM
2026.03
76.95
ColaCare
Category=Multi-Agents
2026.03
76.62
Intern-S1
Category=Single LLM
2026.03
72.4
Qwen3-VL-235B
Category=Single LLM
2026.03
67.21
FineMedLM-o1-8B
Category=Single LLM
2026.03
60.39
HuaTuoGPT-o1-7B
Category=Single LLM
2026.03
59.74
MedGemma-4B
Category=Single LLM
2026.03
58.12
Feedback
Search any
task
Search any
task