Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agentic Long-context Reasoning on CL-bench (test)

26Solve Rate

RLM + PEEK

11.4415.221922.78May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2026.05
2663.4
2026.05
2054.6
2026.05
2053.5
2026.05
1454.5
2026.05
1455.6
2026.05
1251.3