Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-term Memory Evaluation on LongMemEval single run
Loading...
46.7
F1 Score
Memory-R1-GRPO
17.1328
24.8089
32.485
40.1611
Apr 9, 2026
F1 Score
BLEU-1 Score
J Metric
Updated 9d ago
Evaluation Results
Method
Method
Links
F1 Score
BLEU-1 Score
J Metric
Memory-R1-GRPO
Inference Run Type=sin...
2026.04
46.7
41.1
57.8
TSUBASA-PRO
Memory Evaluation Sett...
2026.04
45.75
42.1
57.4
TSUBASA-PRO
Memory Evaluation Sett...
2026.04
43.43
39.85
53
A-Mem
Inference Run Type=sin...
2026.04
41.55
36.58
54.8
Memory-R1-PPO
Inference Run Type=sin...
2026.04
40.3
35.5
47.4
Mem0
Inference Run Type=sin...
2026.04
38.44
34.53
46.8
LoCoMo
Inference Run Type=sin...
2026.04
18.27
14.57
22.2
Feedback
Search any
task
Search any
task