| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| PopQA | CorVer | Accuracy35.3 | 56 | 5d ago | |
| DyKnow 130 time-sensitive facts Wikidata-derived | Correctness80 | 24 | 2mo ago | ||
| Wikidata knowledge infusion | PretrainRL | Accuracy64.69 | 18 | 3mo ago | |
| Factual Evaluation Suite HHEM, PopQA, TriviaQA | HHEM Accuracy96.22 | 12 | 2mo ago | ||
| WikiBench | Genius | WikiBench Score28.75 | 3 | 3mo ago |