Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Preference Evaluation via Multi-Choice Queries on PrefEval Implicit
Loading...
69.9
Accuracy
MemCoE
29.236
39.793
50.35
60.907
May 1, 2026
Accuracy
Overall Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Overall Score
MemCoE
Backbone=Qwen2.5-7B-In...
2026.05
69.9
52.02
MemAgent
Backbone=Qwen2.5-7B-In...
2026.05
63.6
45
Mem-α
Backbone=Qwen3-4B, Ret...
2026.05
62.5
44.19
LightMem
Backbone=Qwen2.5-7B-In...
2026.05
54.8
41.21
A-Mem
Backbone=Qwen2.5-7B-In...
2026.05
52.8
42.64
Mem0
Backbone=Qwen2.5-7B-In...
2026.05
46.4
38.23
RAG
Backbone=Qwen2.5-7B-In...
2026.05
32.4
36.68
Long Context
Backbone=Qwen2.5-7B-In...
2026.05
30.8
26.9
Feedback
Search any
task
Search any
task