Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Reasoning on MedBullets 4-option multiple choice
Loading...
62.3
Accuracy
MedVerse
45.14
49.595
54.05
58.505
Feb 7, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
MedVerse
Backbone=LLaMA-3.1-8B-...
2026.02
62.3
MedReason
Backbone=LLaMA-3.1-8B-...
2026.02
57.1
Huatuo-o1
Backbone=LLaMA-3.1-8B-...
2026.02
55.8
MedVerse
Backbone=Qwen2.5-7B-In...
2026.02
55.2
MedReason
Backbone=Qwen2.5-7B-In...
2026.02
49.7
Original (LLaMA-3.1-8B-Instruct)
Backbone=LLaMA-3.1-8B-...
2026.02
48.7
Original (Qwen2.5-7B-Instruct)
Backbone=Qwen2.5-7B-In...
2026.02
45.8
Feedback
Search any
task
Search any
task