Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fact Consolidation (Single-Hop) on MemoryAgentBench (MAB) FC-SH 262K
Loading...
93
Accuracy
Ablation C (gpt-4o backbone)
3.56
26.78
50
73.22
May 31, 2026
Accuracy
Updated 1d ago
Evaluation Results
Method
Method
Links
Accuracy
Ablation C (gpt-4o backbone)
Pipeline=Ablation C (g...
2026.05
93
Headline
Pipeline=Headline, Bac...
2026.05
82
Ablation A (chunk-4096)
Pipeline=Ablation A (c...
2026.05
73
LLM-judgment baseline
Pipeline=LLM-judgment...
2026.05
61
GPT-4o (long-context)
system=long-context
2026.05
60
HippoRAG-v2 (best published)
2026.05
54
BM25 (MAB chunk-512 default)
2026.05
48
Zep / Graphiti (temporal KG)
2026.05
7
Feedback
Search any
task
Search any
task