| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LongMemEval | EFFGEN | Accuracy34.72 | 25 | 3mo ago | |
| LoCoMo | EFFGEN | Accuracy30.18 | 25 | 3mo ago | |
| LoCoMo | EFFGEN | Execution Time (min)21.7 | 25 | 3mo ago | |
| RULER HotpotQA | DARE + TIES | Score67 | 24 | 1d ago | |
| SQuAD-32K | RAM | Score80 | 12 | 1d ago | |
| RULER SQuAD | MemAgent (Memory) | F1 Score (32K Context)81.25 | 11 | 3mo ago | |
| POPGym Copy k=10 | LinOSS | Temporal Range16.715 | 4 | 3mo ago | |
| POPGym Copy k=5 | Temporal Range17.255 | 4 | 3mo ago | ||
| POPGym Copy k=3 | Temporal Range17.312 | 4 | 3mo ago | ||
| POPGym Copy k=1 | Temporal Range12.294 | 4 | 3mo ago | ||
| POPGym RepeatFirst | LinOSS | Temporal Range21.177 | 4 | 3mo ago | |
| RulerQA | WUDI | RulerQA Acc (32k Context)82.03 | 4 | 3mo ago |