Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-context Memory Evaluation on BEHEMOTH LongMemEval (out-of-distribution)
Loading...
63.07
Accuracy
CluE
18.7452
30.2526
41.76
53.2674
Apr 13, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
CluE
2026.04
63.07
MemEvolve
2026.04
56.82
Simple
2026.04
46.02
GEPA
2026.04
35.06
ACE
2026.04
29.71
No Memory
2026.04
20.45
Feedback
Search any
task
Search any
task