Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on RULER 16K

89.28CWE Score

Full Cache

0.15223.29146.4369.569May 29, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
89.2890.3399.61009998.998.910010010080.857.299.7293.36-
2026.05
82.6488.0797.69577.688.588.6594.496.499.67249.896.0486.64-
2026.05
77.6681.3325.26.21.418.917.84435.82.435.637.241.7632.710
2026.05
77.482.1326.661.619.518.245.4382.435.636.841.8833.198
2026.05
70.388516.65.61.411.359.845.414.62.428.633.616.2826.239
2026.05
68.3284.6714.45.61.210.759.3546112.428.434.615.825.580
2026.05
65.6479.072130.817.0515.6546.4352.627.434.420.628.352
2026.05
59.9683.5312.43.20.410.359.6545.810.82.42631.67.6423.361
2026.05
43.388.7323.84.6115.7514.643.629.42.43632.220.427.372
2026.05
42.4291.4144.41.49.69.0522.610.82.430.432.66.6821.373
2026.05
42.2278.891.215.23.690.1581.2595.2996.237.637.485.858.7410
2026.05
41.977.0790.414.63.690.380.995995.83736.684.8858.230
2026.05
22.6872.4759124.643.238.8589.887.23.228.631.876.0843.810
2026.05
20.375.076514.24.846.6541.6591.291328.632.277.4845.4710
2026.05
12.970.1386.45.41.686.975.5594.296.43.823.83180.0451.390
2026.05
4.948480.68.43.27366.8588.693.84.637.433.638.1647.472
2026.05
3.968.5350.841.636.3529.391.481.23.221.82967.6437.591
2026.05
3.5883.845.294.231.929.266.876.22.82932.827.834.023