Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench English subsets

25.38NarrativeQA

W4A8+GlowQ

23.372823.893924.41524.9361Mar 26, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.03
25.3812.6125.7115.379.9315.334.0722.6727.1452.0638.5588.4943.67172.7336.97
2026.03
25.3612.6125.6215.139.8215.23422.5926.9651.1238.8488.6743.447173.536.92
2026.03
23.513.5427.8716.8310.9416.4434.2722.8726.8752.8148.0490.7743.947173.1338.19
2026.03
23.4512.227.4115.349.2116.1533.8722.7826.3948.7338.8388.7842.4370.570.5236.44