Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on RULER 64k context length

63.8QA Score

CLAA

-1.51215.44432.449.356Sep 10, 2024Dec 6, 2024Mar 4, 2025May 30, 2025Aug 26, 2025Nov 21, 2025Feb 17, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.02
63.8-87.7286.67-----82.0189.85--
2026.02
63.7-58.288.33-----73.1282.23--
2026.02
63.6-87.1686.67-----81.2887.69--
2026.02
63.3-86.8886.67-----83.7898.27--
2024.09
34.5----6.232.67----9.340.2
2024.09
29.5----0.66.33----2.634.9
2024.09
15----0.213.33----23.250.3
2024.09
9----015.67----0.312.4
2024.09
1----4.410.67----50.132.6
2026.02
-99.498.496.893.719.8070.447.665.8---
2026.02
-99.673.290.381.719.9063.444.459---