| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| FDA (test) | GA-S2 | Score0.8004 | 120 | 1mo ago | |
| RULER Context Length = 8K | Average Accuracy (RULER 8K)89.59 | 72 | 10d ago | ||
| HELMET | FullAttention | Average Sparsity0 | 28 | 1mo ago | |
| RULER | FlashPrefill | Score (4K)97.27 | 18 | 1mo ago | |
| HELMET held-out eval | Qwen 2.5 32B | Accuracy (8K Context)57.61 | 13 | 1mo ago | |
| RULER | Single-key Accuracy100 | 8 | 1mo ago | ||
| RULER 32K | SnapKV | S1 Score (RULER 32K)100 | 7 | 25d ago |