Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-session on LongMemEval-M
Loading...
4.81
F1 Score
HIPPOCAMPUS
1.0556
2.0303
3.005
3.9797
Feb 14, 2026
F1 Score
Accuracy
LLM-as-a-Judge Score
Updated 2d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
LLM-as-a-Judge Score
HIPPOCAMPUS
2026.02
4.81
5.26
1.94
MemOS
2026.02
3.68
3.95
1.72
MemoryOS
2026.02
3.18
3.42
1.64
A-mem
2026.02
2.68
2.89
1.57
MemGPT
2026.02
1.7
1.84
1.07
MemoryBank
2026.02
1.45
1.58
1.17
ReadAgent
2026.02
1.2
1.32
1.08
Feedback
Search any
task
Search any
task