Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Reasoning on LongBench 2

50.3P@4

MPD

Updated 2mo ago

Evaluation Results

Method	Links
MPD 2026.05		50.3	700
CRISP 2026.05		49.9	1,000
Vanilla LLM 2026.05		48.7	900
Direct Comp. 2026.05		48.1	800
Chain-of-Draft 2026.05		40.8	1,800
LiteCoT 2026.05		36.6	2,000