| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Real-world data | StaticHybrid | SQuAD Accuracy50.1 | 30 | 26d ago | |
| In-context retrieval (lm-evaluation-harness) zero-shot | KDA | FDA Accuracy23.23 | 23 | 1mo ago | |
| DROP | BERT-Judge | Accuracy88.6 | 16 | 1mo ago | |
| Real-world retrieval tasks 2K tokens (test) | Gated DeltaNet-2 | SWDE Score41.96 | 13 | 12d ago | |
| SWDE | Hybrid Gated DeltaNet + M2RNN-1 | Accuracy62.5 | 13 | 2mo ago | |
| FDA | Hybrid M2RNN | Accuracy74.5 | 13 | 2mo ago | |
| TriviaQA | Hybrid Mamba-2 + M2RNN-3 | Accuracy56.7 | 13 | 2mo ago | |
| SQuAD | Hybrid M2RNN | Accuracy41.3 | 13 | 2mo ago | |
| RULER (test) | S1 Score100 | 8 | 1mo ago | ||
| RULER | MQ Score99.55 | 4 | 1mo ago |