Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Average Performance across Context Lengths (8k-128k) on Average across tasks

57.2Performance @ 8k Context Length

CLP

51.27252.81154.3555.889Mar 18, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
57.254.152.149.344
2026.03
56.853.749.53831.7
2026.03
56.853.95034.54.1
2026.03
5652.14945.440.7
2026.03
55.753.450.647.242.5
2026.03
55.251.443.63627.8
2026.03
54.751.547.739.431.8
2026.03
54.751.647.434.121
2026.03
53.449.248.34641
2026.03
53.348.745.742.537.8
2026.03
5349.747.544.341.3
2026.03
52.849.945.840.930.7
2026.03
51.540.329.625.321.7