Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Overall reasoning performance on LoCoMo (test)
Loading...
80.6
LLM Score
FullContext
30.992
43.871
56.75
69.629
Jan 13, 2026
LLM Score
F1 Score
BLEU-1
Updated 4d ago
Evaluation Results
Method
Method
Links
LLM Score
F1 Score
BLEU-1
FullContext
LLM=GPT-4.1-mini
2026.01
80.6
53.3
45
Nemori
LLM=GPT-4.1-mini
2026.01
79.2
51.9
44.5
LangMem
LLM=GPT-4.1-mini
2026.01
73.4
47.6
40
SwiftMem
LLM=GPT-4.1-mini
2026.01
70.4
42.9
46.7
Mem0
LLM=GPT-4.1-mini
2026.01
66.3
43.5
36.5
Zep
LLM=GPT-4.1-mini
2026.01
61.6
36.9
30.9
RAG
LLM=GPT-4.1-mini
2026.01
32.9
23.5
19.2
Feedback
Search any
task
Search any
task