| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMedbench English subset (val) | LLaMA3-8B | Accuracy60.33 | 36 | 4d ago | |
| MMLU Med | DOS-CPT | Accuracy82.11 | 5 | 4d ago | |
| CMMLU Med | Accuracy86.89 | 5 | 4d ago | ||
| CEVAL Med | Accuracy91.46 | 5 | 4d ago | ||
| MedQA-USMLE | DOS-CPT | Accuracy73.61 | 5 | 4d ago | |
| NEJMQA | DOS-CPT | Accuracy66.14 | 5 | 4d ago | |
| Medbullets | Accuracy58.44 | 5 | 4d ago | ||
| GPQA Med | DOS-CPT | Accuracy63.16 | 5 | 4d ago | |
| DiagnosisArena | DOS-CPT | Accuracy61.11 | 5 | 4d ago |