Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Knowledge Update Evaluation on LongMemEval S
Loading...
11.83
F1 Score
HIPPOCAMPUS
2.6052
5.0001
7.395
9.7899
Feb 14, 2026
F1 Score
Accuracy
LLM-as-a-Judge Score
Updated 2d ago
Evaluation Results
Method
Method
Links
F1 Score
Accuracy
LLM-as-a-Judge Score
HIPPOCAMPUS
2026.02
11.83
32.05
2.63
MemOS
2026.02
8.88
24.04
2.36
MemoryOS
2026.02
7.73
20.83
2.23
A-mem
2026.02
6.54
17.63
2.1
MemGPT
2026.02
4.15
11.22
1.43
MemoryBank
2026.02
3.55
9.62
1.58
ReadAgent
2026.02
2.96
8.01
1.05
Feedback
Search any
task
Search any
task