| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CT-RATE | U-VLM | F1 Score41.4 | 26 | 5d ago | |
| MIMIC-CXR (test) | MOTOR | BLEU-415.6 | 20 | 1mo ago | |
| DeepResearch Bench 2025 (test) | Comprehensiveness49.5 | 16 | 1mo ago | ||
| MIMIC-CXR-JPG (test) | VILA-M3-13B | BLEU-421.6 | 16 | 1mo ago | |
| WSI-Bench | MLLM-HWSI | BLEU-155.6 | 15 | 24d ago | |
| Heartcare-Bench I (test) | ScoreGPT78.8 | 14 | 11d ago | ||
| Heartcare-Bench S (test) | HeartcareGPT-7B | ScoreGPT76.55 | 14 | 11d ago | |
| TMALL | RecPilot | Accuracy4.6 | 14 | 1mo ago | |
| L-MIMIC | Maira2 | Precision61.5 | 14 | 1mo ago | |
| MIMIC IV | GEM | METEOR35.06 | 12 | 1mo ago | |
| WSI | HistoSelect | BLEU-143.1 | 12 | 1mo ago | |
| MMTT | ForgeryTalker | CIDEr59.3 | 11 | 9d ago | |
| CT-RATE (val) | CT-Agent | BLEU-150.2 | 11 | 1mo ago | |
| IU-Xray | VALOR | ROUGE-L33.1 | 10 | 1mo ago | |
| HistGen | MLLM-HWSI | BLEU-166.7 | 9 | 24d ago | |
| Radiology Report Generation | RadAgents | CheXbert Macro F1 (14)53.2 | 6 | 3d ago | |
| IXI | LLaBIT | ROUGE37.33 | 6 | 13d ago | |
| ATLAS 2.0 | LLaBIT | ROUGE33.69 | 6 | 13d ago | |
| BraTS MEN 2023 | LLaBIT | ROUGE Score33.36 | 6 | 13d ago | |
| BraTS 2021 | LLaBIT | ROUGE35.03 | 6 | 13d ago | |
| PTB-XL | GEM | LLM Score20.45 | 6 | 1mo ago | |
| TREC NeuCLIR 2024 | BulletPoints | Nugget Recall50.8 | 6 | 1mo ago | |
| MIMIC-CXR Exp. (test) | VILA-M3 | BLEU-421.6 | 6 | 1mo ago | |
| MIMIC CX | MedGemma 1.5 | Radgraph F127.2 | 5 | 11d ago | |
| DQ_F++ zero-shot 2024b | ForgeryTalker | BLEU-148.5 | 4 | 9d ago |