| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| NQ (Natural Questions) | CoopRAG (GPT-4o-mini) | EM72 | 52 | 21d ago | |
| NQ | SLEA-RL | Accuracy48.5 | 17 | 2mo ago | |
| NQ | HELP | EM43 | 13 | 3mo ago | |
| NQ (test) | Token-level F142 | 9 | 14d ago | ||
| SimpleQuestions | GPT-4+RFKG-CoT | Accuracy0.87 | 9 | 3mo ago | |
| Assembly Knowledge Graph QA Single-hop (test) | AssemMate | Accuracy82.1 | 5 | 3mo ago |