Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Agentic Task Success on ALFWorld
Loading...
87.1
Success Rate
ALMA
-0.468
22.266
45
67.734
Feb 8, 2026
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
ALMA
Foundation Model=GPT-5...
2026.02
87.1
Trajectory Retrieval
Foundation Model=GPT-5...
2026.02
80
Dynamic Cheatsheet
Foundation Model=GPT-5...
2026.02
78.6
G-Memory
Foundation Model=GPT-5...
2026.02
74.8
No Memory
Foundation Model=GPT-5...
2026.02
67.6
Reasoning Bank
Foundation Model=GPT-5...
2026.02
67.1
ALMA
Foundation Model=GPT-5...
2026.02
12.4
G-Memory
Foundation Model=GPT-5...
2026.02
7.6
Dynamic Cheatsheet
Foundation Model=GPT-5...
2026.02
5.7
Trajectory Retrieval
Foundation Model=GPT-5...
2026.02
5.2
Reasoning Bank
Foundation Model=GPT-5...
2026.02
5.2
No Memory
Foundation Model=GPT-5...
2026.02
2.9
Feedback
Search any
task
Search any
task