Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-session-preference on LongMemEval-M
Loading...
13.79
F1 Score
HIPPOCAMPUS
3.0364
5.8282
8.62
11.4118
Feb 14, 2026
F1 Score
Accuracy
LLM-as-a-Judge Score
Updated 3d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
LLM-as-a-Judge Score
HIPPOCAMPUS
2026.02
13.79
3.33
2.87
MemOS
2026.02
10.36
2.5
2.58
MemoryOS
2026.02
8.98
2.16
2.44
A-mem
2026.02
7.59
1.83
2.3
MemGPT
2026.02
4.83
1.17
1.58
MemoryBank
2026.02
4.14
1
1.72
ReadAgent
2026.02
3.45
0.83
1.15
Feedback
Search any
task
Search any
task