| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| S3 1.0 (test) | Qwen3-VL-32B | Precision98 | 36 | 12d ago | |
| MIMIC-CDM (test) | Appendicitis Accuracy99.4 | 21 | 1mo ago | ||
| MIMIC-IV | Full Hybrid | Accuracy89.3 | 15 | 8d ago | |
| RJUA (OOD) | AutoRefine-ICDTree | Exact Match (EM)13.27 | 14 | 6d ago | |
| MedXpertQA (OOD) | Exact Match (EM)22.97 | 14 | 6d ago | ||
| MedQA (OOD) | EM20 | 14 | 6d ago | ||
| MedDDx-Plus (In-domain) | C-MIG | Exact Match (EM)57.43 | 14 | 6d ago | |
| Sleep Stag (test) | PatchTST (Supervised Learning) | Top-1 Accuracy84.29 | 10 | 2mo ago | |
| Hypertension (test) | SLIPBase | Top-1 Accuracy50 | 10 | 2mo ago | |
| Stroke (test) | SLIPBase | Top-1 Accuracy91.67 | 10 | 2mo ago | |
| RJUA | C-MIG | EM11.37 | 9 | 6d ago | |
| MedXpertQA | C-MIG | EM7.52 | 9 | 6d ago | |
| MedQA | AutoRefine-Embedding | Exact Match (EM)13.5 | 9 | 6d ago | |
| MedDDx-Plus | C-MIG | EM Score60.82 | 9 | 6d ago | |
| Medical Clinical Diagnosis | GPT-4o | IDC97.88 | 9 | 8d ago | |
| RareBench HMS subset n=88 | Mixed-Vendor MAC | Recall@151.14 | 7 | 2mo ago | |
| RareBench (combined) | Mixed-Vendor MAC | Recall@139.31 | 7 | 2mo ago | |
| RareBench LIRICAL | Recall@136.24 | 7 | 2mo ago | ||
| CD | SUPERMAN | ECE0.028 | 7 | 3mo ago | |
| OASST (test) | Accuracy (Appendicitis)82 | 1 | 3mo ago |