| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-choice medical QA | Multi-choice medical QA benchmarks (test) | MMLU-Med Accuracy70.7 | 28 | |
| Medical Question Answering | Medical QA Benchmarks (MedQA, MedMCQA, MMLU*, CMB, CMExam, CMMLU*) (test) | MedQA Accuracy64.1 | 20 |