Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-Context Language Modeling on RULER (Context Sweep)
Loading...
81.5
Accuracy (8k Context)
CoPE
80.4808
80.7454
81.01
81.2746
Feb 5, 2026
Accuracy (8k Context)
Accuracy (16k Context)
Accuracy (32k Context)
Accuracy (64k Context)
Accuracy (128k Context)
Accuracy (256k Context)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (8k Context)
Accuracy (16k Context)
Accuracy (32k Context)
Accuracy (64k Context)
Accuracy (128k Context)
Accuracy (256k Context)
CoPE
2026.02
81.5
82.84
82.75
76.71
61.95
46.86
RoPE
2026.02
80.52
82.33
82.11
76.93
61.19
28.86
Feedback
Search any
task
Search any
task