Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic medical interaction on AgentClinic MedQA
Loading...
65.8
Accuracy
o3
50.096
54.173
58.25
62.327
Jul 7, 2025
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
o3
Model category=Large M...
2025.07
65.8
Gemini 2.5 Pro
Model category=Large M...
2025.07
58.3
DeepSeek R1
Model category=Large M...
2025.07
58.1
MedGemma 27B
Model category=Small M...
2025.07
56.2
Human physician
Model category=Human m...
2025.07
54
Gemma 3 27B
Model category=Small M...
2025.07
50.7
Feedback
Search any
task
Search any
task