Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context Reasoning on LoCoMo (test)

72.3LLM Score

FullContext

28.51639.88351.2562.617Jan 13, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
72.3-5,806
2026.01
72.18353,448
2026.01
65.2111,289
2026.01
61.37843,539
2026.01
58.55223,255
2026.01
51.319,82922,082
2026.01
30.25442,884