Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context Reasoning on LoCoMo (test)
Loading...
72.3
LLM Score
FullContext
28.516
39.883
51.25
62.617
Jan 13, 2026
LLM Score
Search Latency (ms)
Total Latency (ms)
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM Score
Search Latency (ms)
Total Latency (ms)
FullContext
Evaluator LLM=GPT-4o-mini
2026.01
72.3
-
5,806
Nemori
Evaluator LLM=GPT-4o-mini
2026.01
72.1
835
3,448
SwiftMem
Evaluator LLM=GPT-4o-mini
2026.01
65.2
11
1,289
Mem0
Evaluator LLM=GPT-4o-mini
2026.01
61.3
784
3,539
Zep
Evaluator LLM=GPT-4o-mini
2026.01
58.5
522
3,255
LangMem
Evaluator LLM=GPT-4o-mini
2026.01
51.3
19,829
22,082
RAG-4096
Evaluator LLM=GPT-4o-m...
2026.01
30.2
544
2,884
Feedback
Search any
task
Search any
task