Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Test Recommendation on MedAction 300-Hard
Loading...
54
Precision
OpenAI-o3-mini
41.52
44.76
48
51.24
May 8, 2026
Precision
Recall
F1 Score
Updated 23d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
OpenAI-o3-mini
Model Source=Closed-So...
2026.05
54
47
50
MedGemma
Model Source=Open-Sour...
2026.05
52
52
52
Qwen-QwQ
Model Source=Open-Sour...
2026.05
51
41
45
II-Medical-8B SFT
Model Source=Open-Sour...
2026.05
51
51
51
Baichuan-M3
Model Source=Open-Sour...
2026.05
50
54
52
Gemini 2.5 Flash-Lite
Model Source=Closed-So...
2026.05
47
48
47
Baichuan-M1
Model Source=Open-Sour...
2026.05
47
41
44
II-Medical-8B
Model Source=Open-Sour...
2026.05
45
45
45
GPT-5.4
Model Source=Closed-So...
2026.05
42
60
49
Feedback
Search any
task
Search any
task