Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Agentic Task Success on Textworld
Loading...
75
Success Rate
ALMA
-0.816
18.867
38.55
58.233
Feb 8, 2026
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
ALMA
Foundation Model=GPT-5...
2026.02
75
G-Memory
Foundation Model=GPT-5...
2026.02
68.8
Trajectory Retrieval
Foundation Model=GPT-5...
2026.02
67
No Memory
Foundation Model=GPT-5...
2026.02
60.5
Dynamic Cheatsheet
Foundation Model=GPT-5...
2026.02
57.8
Reasoning Bank
Foundation Model=GPT-5...
2026.02
56.1
ALMA
Foundation Model=GPT-5...
2026.02
6.2
No Memory
Foundation Model=GPT-5...
2026.02
5.4
Reasoning Bank
Foundation Model=GPT-5...
2026.02
5.3
Dynamic Cheatsheet
Foundation Model=GPT-5...
2026.02
4.3
Trajectory Retrieval
Foundation Model=GPT-5...
2026.02
2.7
G-Memory
Foundation Model=GPT-5...
2026.02
2.1
Feedback
Search any
task
Search any
task