Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language modeling evaluation on RULER 32K

100S1 Score (RULER 32K)

SnapKV

-3.16823.61650.477.184Mar 21, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
10085.889.813.67.2516.520.8862.7745.849.16
2026.03
10080.678.814.43.695.0520.4860.2744.645.31
2026.03
10090.6924424.856.520.9264.2750.860.43
2026.03
10095.29638.431.0564.3521.665.851.462.64
2026.03
99.698989746.694.1522.2869.7853.275.4
2026.03
98.485.692.48.47.7517.121.2466.4751.249.84
2026.03
0.8000004.0424.223.65.85