Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-session-preference on LongMemEval S (test)
Loading...
14.14
F1 Score
HIPPOCAMPUS
3.116
5.978
8.84
11.702
Feb 14, 2026
F1 Score
Accuracy
LLM-as-a-Judge Score
Updated 2d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
LLM-as-a-Judge Score
HIPPOCAMPUS
2026.02
14.14
16.67
3.13
MemOS
2026.02
10.61
12.5
2.81
MemoryOS
2026.02
9.21
10.83
2.66
A-mem
2026.02
7.78
9.17
2.5
MemGPT
2026.02
4.95
5.83
1.72
MemoryBank
2026.02
4.25
5
1.88
ReadAgent
2026.02
3.54
4.17
1.25
Feedback
Search any
task
Search any
task