| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | NQ-Open (val) | Accuracy30.7 | 28 | |
| Hallucination detection | NQ-Open | AUROC0.8843 | 27 | |
| Factual Question Answering | NQ-Open ID | Precision57.34 | 24 | |
| Question Answering | NQ-open v1.0 (test) | A179.08 | 16 | |
| Question Answering | NQ-Open (out-of-domain) | Precision0.705 | 12 |