| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Needle-in-a-Haystack (test) | Accuracy100 | 56 | 16d ago | ||
| RULER | MrRoPE-Pro | Retrieval Accuracy (8K)96.2 | 34 | 9d ago | |
| S-NIAH | Latency (s)10.5 | 27 | 26d ago | ||
| NIAH 128k | Single Score24.4 | 20 | 10d ago | ||
| NIAH 64k | Single Score49.3 | 20 | 10d ago | ||
| Lost-in-the-Middle 30-passage contexts | PRISM-∆ | Average Exact Match62.57 | 20 | 1mo ago | |
| NIAH multivalue | FLy | Speedup4.1 | 20 | 1mo ago | |
| MLDR | Ettin-Enc-1B | MLDR40.2 | 17 | 1mo ago | |
| RULER | Accuracy80.1 | 14 | 16d ago | ||
| RULER 64K context | WindowedManifoldKV | Accuracy84.3 | 13 | 1mo ago | |
| NIAH-Multi | Kimi-K2 | Accuracy100 | 13 | 1mo ago | |
| Needle-in-a-Haystack | Retrieval Accuracy100 | 10 | 1mo ago | ||
| NIAH (Needle-In-A-Haystack) Retrieval Variants long-context | Gated DeltaNet | Single-1 Acc (1K)100 | 8 | 8d ago | |
| RULER 4K-32K context | ManifoldKV | Accuracy95.73 | 8 | 1mo ago | |
| Probing (val) | FILM-7B | Document Avg85.4 | 8 | 1mo ago | |
| NIAH (avg) | Qwen2.5-14B-Instruct-1M | Score (4k Context)100 | 7 | 1mo ago | |
| NIAH 32k | PHSA | NIAH Score99 | 6 | 1mo ago | |
| NIAH 16k | PHSA | NIAH Score98.6 | 6 | 1mo ago | |
| NIH | Multi-needle Avg Recall100 | 6 | 1mo ago | ||
| LITM | Accuracy100 | 5 | 1mo ago | ||
| NIAH | Accuracy100 | 5 | 1mo ago | ||
| Needle-in-a-Haystack 1.0 (test) | FastKV | Score99.9 | 5 | 1mo ago | |
| RULER | INF-V2 | Retrieval Accuracy (4K Context)95.9 | 5 | 1mo ago | |
| Needle-in-a-Haystack (NiH) | Accuracy (512 tokens)100 | 3 | 1mo ago |