Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Exploration on ALFWorld (val)
Loading...
97.9
Success Rate
MemRL
83.028
86.889
90.75
94.611
Jan 6, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
MemRL
Model=GPT-5-mini
2026.01
97.9
RAG
Model=GPT-5-mini
2026.01
95
Self-RAG
Model=GPT-5-mini
2026.01
95
Mem0
Model=GPT-5-mini
2026.01
95
MemP
Model=GPT-5-mini
2026.01
92.1
No Memory
Model=GPT-5-mini
2026.01
83.6
Feedback
Search any
task
Search any
task