| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Explain (+ domain) | Conf + Probe (SCAO) | Accuracy64.87 | 20 | 1mo ago | |
| Explain original | Conf + Probe (SCAO) | Accuracy80.91 | 20 | 1mo ago | |
| Mintaka refined by question type and domain | Conf (SCAO) | AUROC75.51 | 20 | 1mo ago | |
| Mintaka refined by question type | Conf + Probe (SCAO) | AUROC77.89 | 20 | 1mo ago | |
| Overshadowing | CoDA | Accuracy (Time)65 | 16 | 1mo ago | |
| HotpotQA + type | Conf + Probe (SCAO) | AUROC75.51 | 10 | 1mo ago | |
| HotpotQA (original) | Conf + Probe (SCAO) | AUROC83.39 | 10 | 1mo ago | |
| ParaRel + domain | Conf + Probe (SCAO) | Accuracy69.24 | 10 | 1mo ago | |
| ParaRel original | Probe_dnn | Accuracy82.29 | 10 | 1mo ago | |
| Mintaka (original) | Conf + Probe (SCAO) | AUROC79.41 | 10 | 1mo ago | |
| Mintaka unrefined (original) | Conf + Probe (SCAO) | AUROC79.41 | 10 | 1mo ago | |
| Explain domain refined | Conf + Probe (SCAO) | AUROC70.04 | 10 | 1mo ago | |
| Explain unrefined (original) | Conf + Probe (SCAO) | AUROC85.42 | 10 | 1mo ago | |
| MemoTrap | GAME-LoRA | Accuracy65 | 6 | 1mo ago | |
| NQ-Swap | Alarmer | Accuracy (entity)29.4 | 4 | 1mo ago |