Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Simulation on DialSim
Loading...
4.12
F1 Score
LightMem
1.0624
1.8562
2.65
3.4438
Apr 9, 2026
F1 Score
BLEU-1 Score
ROUGE-L Score
ROUGE-2 Score
METEOR Score
SBERT Score
Updated 9d ago
Evaluation Results
Method
Method
Links
F1 Score
BLEU-1 Score
ROUGE-L Score
ROUGE-2 Score
METEOR Score
SBERT Score
LightMem
Backbone=GPT-4o-mini
2026.04
4.12
3.95
4.2
4.15
2.48
23.4
A-MEM
Backbone=GPT-4o-mini
2026.04
3.45
3.37
3.54
3.6
2.05
19.51
LoCoMo
Backbone=GPT-4o-mini
2026.04
2.55
3.13
2.75
0.9
1.64
15.76
MemGPT
Backbone=GPT-4o-mini
2026.04
1.18
1.07
0.96
0.42
0.95
8.54
Feedback
Search any
task
Search any
task