Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on RULER 64k context length

0FWE (Error)

Full Attention

-3.652821.003645.6670.3164Sep 10, 2024Dec 20, 2024Apr 1, 2025Jul 11, 2025Oct 21, 2025Jan 30, 2026May 12, 2026
Updated 5d ago

Evaluation Results

MethodLinks
2026.02
099.498.496.893.719.870.447.665.8-------
2026.02
099.673.290.381.719.963.444.459-------
2024.09
6.33----0.6----29.52.634.9---
2024.09
10.67----4.4----150.132.6---
2024.09
13.33----0.2----1523.250.3---
2024.09
15.67----0----90.312.4---
2025.03
18.435.42-1.561.821.4620.8330.2113.78----12.513.542.08
2024.09
32.67----6.2----34.59.340.2---
34.17-----------2---
2025.03
58.3389.58-21.3514.0649.3861.4635.4245.04----66.6743.7510.42
2025.03
61.1125-44.7928.913.5447.9236.4645.71----51.0461.4696.88
2025.03
68.06100-98.4410095.4283.3357.2989.94----10010096.88
2025.03
68.75100-97.661009583.3358.3389.68----10098.9694.79
2025.03
78.47100-96.6296.0986.6778.1356.2588.5----96.8897.9297.92
2025.03
80.2194.79-66.1582.8146.0448.965071.79----66.6783.3398.96
2025.03
84.38100-89.5891.4181.4680.2155.2186.87----96.8898.9690.63
2025.03
85.42100-97.6698.9697.2983.3359.3891.89----10098.9697.92
2025.03
85.76100-91.931009571.8860.4290.19----97.9210098.96
2025.03
86.11100-97.1499.7499.5883.3364.5891.17----10010081.25
2025.03
89.58100-92.9798.1897.7178.1364.5890.45----98.9695.8388.54
2025.03
90.28100-95.5799.7499.7981.2565.6391.66----10010084.38
2025.03
91.32100-98.9610099.5880.2164.5893.36----10010098.96
2026.02
--86.8886.67----83.7898.2763.3-----
2026.02
--58.288.33----73.1282.2363.7-----
2026.02
--87.1686.67----81.2887.6963.6-----
2026.02
--87.7286.67----82.0189.8563.8-----