Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Reasoning on LoCoMo (test)

72.3LLM Score

FullContext

28.51639.88351.2562.617Jan 13, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
72.3-5,806
2026.01
72.18353,448
2026.01
65.2111,289
2026.01
61.37843,539
2026.01
58.55223,255
2026.01
51.319,82922,082
2026.01
30.25442,884