Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Knowledge Question Answering on MMLU Clinical Knowledge
Loading...
92
Accuracy
GPT-4o
67.872
74.136
80.4
86.664
Oct 25, 2024
Jan 9, 2025
Mar 27, 2025
Jun 11, 2025
Aug 27, 2025
Nov 11, 2025
Jan 27, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
Prompting (shots)=0-shot
2024.10
92
GPT-4o
Prompting (shots)=5-shot
2024.10
92
GPT-4T (May 2024)
Prompting (shots)=5-shot
2024.10
87
GPT-4T (May 2024)
Prompting (shots)=0-shot
2024.10
85
Dense Model
Base Model=Llama 3.1-8...
2026.01
73.5
GradPruner
Base Model=Llama 3.1-8...
2026.01
71.7
Laco
Base Model=Llama 3.1-8...
2026.01
71.1
SAT
Base Model=Llama 3.1-8...
2026.01
70.9
MINITRON
Base Model=Llama 3.1-8...
2026.01
70.3
LLMPruner
Base Model=Llama 3.1-8...
2026.01
68.8
Feedback
Search any
task
Search any
task