Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on RULER 16K 1.0 (test)

89.28CWE Score

Full Cache

-3.550420.549844.6568.7502May 29, 2026
Updated 1d ago

Evaluation Results

MethodLinks
2026.05
89.2890.3399.61009998.998.910010010080.857.299.7293.36-
2026.05
82.6488.0797.69577.688.588.6594.496.499.67249.896.0486.64-
2026.05
59.8479.3313.62.8010.159.7539.29.62.42831.61423.110
2026.05
59.178.2132.4010.059.4536.89.22.428.231.413.2422.570
2026.05
47.3283.3310.820.49.89.742.24.42.424.630.414.3221.6710
2026.05
45.9882.810.41.80.49.69.540.83.42.424.231.812.3221.180
2026.05
30.7471.339.80.208.79.540.45.82.422294.5618.032
2026.05
22.7873.138.2008.79.243.242.421.628.62.6817.272
2026.05
10.3264.7310.21.409.059.25.442.425.427.41.4413.152
2026.05
9.7258.7350.25.61.645.146.393.694.42.623.830.468.7240.830
2026.05
8.864.93505.81.645.646.6593.695.42.6243169.641.518
2026.05
3.245923.241.41715.178.454.22.4212849.7227.440
2026.05
2.0619.811.80.209.159.205.62.42626.808.690
2026.05
1.7866.425.64.41.417.615.882.458.22.420.227.854.2429.098
2026.05
0.2853.836.41036.9532.4593.880.22.41526.252.2833.141
2026.05
0.1256.217.62.21.613.9513.241.431.22.42328.25.4418.193
2026.05
0.157.8734.22.21.228.6528.0550.6712.625.625.86.4825.721
2026.05
0.0257.816.40.40.212.711.2579.847.62.414.623.836.623.351