| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MIMIC-CXR (test) | MOTOR | BLEU-415.6 | 20 | 4d ago | |
| DeepResearch Bench 2025 (test) | Comprehensiveness49.5 | 16 | 4d ago | ||
| MIMIC-CXR-JPG (test) | VILA-M3-13B | BLEU-421.6 | 16 | 4d ago | |
| L-MIMIC | Maira2 | Precision61.5 | 14 | 2d ago | |
| IU-Xray | VALOR | ROUGE-L33.1 | 10 | 2d ago | |
| PTB-XL | GEM | LLM Score20.45 | 6 | 2d ago | |
| TREC NeuCLIR 2024 | BulletPoints | Nugget Recall50.8 | 6 | 4d ago | |
| MIMIC-CXR Exp. (test) | VILA-M3 | BLEU-421.6 | 6 | 4d ago | |
| WSI | CPath-Omni | BLEU-133.7 | 5 | 4d ago | |
| USAFact | EvidFuse | API Score33.13 | 4 | 4d ago | |
| OurWorldInData | EvidFuse | API Interactions43.2 | 4 | 4d ago | |
| Tableau | EvidFuse | API Score41.85 | 4 | 4d ago | |
| CXR | NVILA | BLEU-422.8 | 4 | 4d ago | |
| MIMIC | CAMEL | LLM Score62.59 | 3 | 2d ago | |
| MIMIC IV | CAMEL | LLM Score62.59 | 3 | 2d ago | |
| MIMIC-CXR | Proposed (NN) | NEM17.55 | 3 | 4d ago | |
| Chest ImaGenome (test) | TRACE | BLEU-40.26 | 1 | 4d ago |