Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic medical interaction on AgentClinic-MIMIC OOD
Loading...
50.6
Accuracy
o3
34.584
38.742
42.9
47.058
Jul 7, 2025
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
o3
Model category=Large M...
2025.07
50.6
Gemini 2.5 Pro
Model category=Large M...
2025.07
48.4
MedGemma 27B
Model category=Small M...
2025.07
46
DeepSeek R1
Model category=Large M...
2025.07
43.8
Gemma 3 27B
Model category=Small M...
2025.07
35.2
Feedback
Search any
task
Search any
task