| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Must-C & Spoken-SQuAD | contr-cos-all + giga | Normalized Average1.1418 | 15 | 4d ago | |
| Average across all benchmarks | LaVer | Average Score59.94 | 12 | 4d ago | |
| LoCoMo | BLEU48.7 | 8 | 4d ago | ||
| CoP-QA-F | Talk2DM | AC Score97.6 | 6 | 4d ago | |
| AfroNLG (test) | Cheetah | AfroNLG Score14.25 | 5 | 4d ago | |
| Aggregate General, Math, Coding | NBDiff-7B-BASE | Average Accuracy65.3 | 4 | 4d ago |