| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TopiOCQA | Ours | F1 Score40.3 | 17 | 3mo ago | |
| QReCC | ChatR1 | F1 Score31 | 16 | 3mo ago | |
| INSCIT | UniConv | F1 Score33.2 | 16 | 3mo ago | |
| CORAL | Ours | F128.9 | 13 | 3mo ago | |
| Mix domain | Hyper-RAG | Prompt Tokens19,235 | 8 | 1mo ago | |
| LongMemEval-s | All-Mem | 4o-J Score60.2 | 8 | 2mo ago | |
| LoCoMo | All-Mem | 4o-J54.63 | 8 | 2mo ago | |
| ArchEHR-QA 2026 (test) | Overall Score36.3 | 6 | 21d ago | ||
| MS-MARCO | GAVA-SDA | Accuracy87.7 | 4 | 3mo ago | |
| WQA | GAVA-SDA | Accuracy86.8 | 4 | 3mo ago | |
| ArchEHR-QA | HealthNLP_Retrievers | SARI59.2 | 3 | 1mo ago | |
| IQAD (internal) | GAVA-DDA | Accuracy9.85 | 3 | 3mo ago |