| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Classification | MEDIC (In-Domain) | Top-1 Accuracy (Damage Severity)81.39 | 12 | |
| CR label prediction | MEDIC | Accuracy82.9 | 5 | |
| ER label prediction | MEDIC | Accuracy82.2 | 5 | |
| EE label prediction | MEDIC | Accuracy88.3 | 5 | |
| Medical Note Summarization | MEDIC Note Summ | Kendall's tau_b-0.275 | 3 | |
| Medical Generation | MEDIC ACI Bench | Kendall's tau_b0.341 | 3 | |
| Open-Ended Medical Evaluation | MEDIC Open-Ended | Kendall's tau_b0.485 | 3 | |
| Medical Safety Evaluation | MEDIC MedSafety | Kendall's tau_b0.552 | 3 | |
| Medical Chat Evaluation | MEDIC HealthBench | Kendall's Tau-b0.546 | 3 | |
| Classification | MEDIC OOD Evaluation | Top-1 Accuracy (DS)80.98 | 2 |