| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RULER | GA-S2 | RULER Score0.911 | 148 | 4d ago | |
| LongBench | MXFP8 | Single-Document QA42.77 | 44 | 2d ago | |
| LongBench-E 1.0 (test) | Elastic Attention | S-Doc QA Perf.49.92 | 37 | 4d ago | |
| RULER | Accuracy (8K Context)90.97 | 34 | 4d ago | ||
| HELMET | Summarization Score247 | 27 | 4d ago | ||
| ZeroSCROLLS (test) | GDWM | GovReport Score35.8 | 24 | 4d ago | |
| RULER 1.0 (test) | MInference | Accuracy (4K Context)0.977 | 16 | 4d ago | |
| LongBench v2 | AdmTree | Single Doc QA34.9 | 10 | 4d ago | |
| InfiniteBench (test) | SnapKV | En Sum Score1 | 10 | 4d ago | |
| LongBench MultiFieldQA, MuSiQue, GovReport 2023 (test) | DroPE | MultiFieldQA Score32.18 | 8 | 4d ago | |
| LongBench (test) | MoQAE | Qasper Score9.79 | 5 | 4d ago | |
| LongPPL 32k | Engram-27B | Book Perplexity4.14 | 4 | 4d ago | |
| LongBench V2 (test) | TRIM-KV | Acc (Short)35.39 | 3 | 4d ago | |
| LongBench | Qwen3-1.7B | SAMSum42.04 | 3 | 4d ago | |
| RULER | CoPE | Accuracy (8k Context)81.5 | 2 | 4d ago |