Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Decision Making on ALFWorld Avg v1
Loading...
59.88
Success Rate (SR)
BeliefMem
17.344
28.387
39.43
50.473
May 7, 2026
Success Rate (SR)
Average Steps
Updated 26d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average Steps
BeliefMem
Backbone=Qwen3-Next-80...
2026.05
59.88
29.56
BeliefMem
Backbone=Qwen3-Next-80...
2026.05
58.66
28.99
ReadAgent
Backbone=Qwen3-Next-80...
2026.05
54.03
27.65
Mem0
Backbone=Qwen3-Next-80...
2026.05
39.81
33.4
MemoryBank
Backbone=Qwen3-Next-80...
2026.05
37.96
35.07
LangMem
Backbone=Qwen3-Next-80...
2026.05
34.24
35.8
A-MEM
Backbone=Qwen3-Next-80...
2026.05
27.1
39.66
No-Memory
Backbone=Qwen3-Next-80...
2026.05
22.35
40.92
MemoryOS
Backbone=Qwen3-Next-80...
2026.05
18.98
42.69
Feedback
Search any
task
Search any
task