Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
DB Task on Lifelong Agent Bench Task
Loading...
96
Last Epoch Success Rate
MemP
85.6
88.3
91
93.7
Jan 6, 2026
Last Epoch Success Rate
Cumulative Success Rate (CSR)
Updated 4d ago
Evaluation Results
Method
Method
Links
Last Epoch Success Rate
Cumulative Success Rate (CSR)
MemP
Model=GPT-4o-mini
2026.01
96
96.6
MemRL
Model=GPT-4o-mini
2026.01
96
97.2
Mem0
Model=GPT-4o-mini
2026.01
92
92.6
RAG
Model=GPT-4o-mini
2026.01
91.4
91.6
Self-RAG
Model=GPT-4o-mini
2026.01
89.1
89.8
No Memory
Model=GPT-4o-mini
2026.01
86
-
Pass@10
Model=GPT-4o-mini
2026.01
-
92.8
Feedback
Search any
task
Search any
task