Share your thoughts, 1 month free Claude Pro on usSee more

Agentic Long-context Reasoning on CL-bench (test)

26Solve Rate

RLM + PEEK

Updated 2mo ago

Evaluation Results

Method	Links
RLM + PEEK 2026.05		26	63.4
RLM + Compaction Agent 2026.05		20	54.6
RLM + ACE (Online Adaptation) 2026.05		20	53.5
RLM 2026.05		14	54.5
RLM + RAG 2026.05		14	55.6
RLM + Shared Chat 2026.05		12	51.3