Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-Context Language Modeling on RULER (Context Sweep)
Loading...
81.5
Accuracy (8k Context)
CoPE
80.4808
80.7454
81.01
81.2746
Feb 5, 2026
Accuracy (8k Context)
Accuracy (16k Context)
Accuracy (32k Context)
Accuracy (64k Context)
Accuracy (128k Context)
Accuracy (256k Context)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (8k Context)
Accuracy (16k Context)
Accuracy (32k Context)
Accuracy (64k Context)
Accuracy (128k Context)
Accuracy (256k Context)
CoPE
2026.02
81.5
82.84
82.75
76.71
61.95
46.86
RoPE
2026.02
80.52
82.33
82.11
76.93
61.19
28.86
Feedback
Search any
task
Search any
task