Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-term Conversational Memory Retrieval on LoCoMo 2024
Loading...
75.39
Single Hop Accuracy
GOM
38.366
47.978
57.59
67.202
May 8, 2026
Single Hop Accuracy
Multi-Hop Accuracy
Open Domain Accuracy
Temporal Accuracy
Overall Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Single Hop Accuracy
Multi-Hop Accuracy
Open Domain Accuracy
Temporal Accuracy
Overall Accuracy
GOM
Chunk Size=2492, Evalu...
2026.05
75.39
59.6
74.96
70.7
71.56
Mem0
Chunk Size=1764, Evalu...
2026.05
67.13
51.15
72.93
55.51
66.8
Mem0g
Chunk Size=3616, Evalu...
2026.05
65.71
47.19
75.71
58.13
68.44
LangMem
Chunk Size=127, Evalua...
2026.05
62.23
47.92
71.12
23.43
58.1
Zep
Chunk Size=3911, Evalu...
2026.05
61.7
41.35
76.6
49.31
65.99
A-Mem
Chunk Size=2520, Evalu...
2026.05
39.79
18.85
54.05
49.91
48.38
Feedback
Search any
task
Search any
task