| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| TruthfulQA MCQ | STAR-1 | MCQ Accuracy39.33 | 18 | 19d ago | |
| HaluEval General | Domain-Grounded Tiered Retrieval | Baseline Wins30 | 2 | 1mo ago | |
| TruthfulQA | Domain-Grounded Tiered Retrieval | Baseline Wins12 | 2 | 1mo ago | |
| FreshQA v2 | Domain-Grounded Tiered Retrieval | Baseline Wins16 | 2 | 1mo ago | |
| MMLU Global Facts | Domain-Grounded Tiered Retrieval | Baseline Wins3 | 2 | 1mo ago | |
| TimeQA v2 | Domain-Grounded Tiered Retrieval | Baseline Wins4 | 2 | 1mo ago |