Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Grid-World Navigation on FrozenLake Drift II
Loading...
8,500
Success Rate
Generative Agent + GLOVE
-340
1,955
4,250
6,545
Jan 27, 2026
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
Generative Agent + GLOVE
Backbone=GPT-4o, Augme...
2026.01
8,500
Vanilla + GLOVE
Backbone=GPT-4o, Augme...
2026.01
8,000
Voyager + GLOVE
Backbone=GPT-4o, Augme...
2026.01
7,500
MemoryBank + GLOVE
Backbone=GPT-4o, Augme...
2026.01
6,500
MemoryBank
Backbone=GPT-4o
2026.01
4,500
MemoryBank + GLOVE
Base LLM=Grok-3, Agent...
2026.01
80
Vanilla + GLOVE
Base LLM=Grok-3, Agent...
2026.01
70
Generative Agent + GLOVE
Base LLM=Grok-3, Agent...
2026.01
70
Voyager + GLOVE
Base LLM=Grok-3, Agent...
2026.01
65
Generative Agent
Base LLM=Grok-3, Agent...
2026.01
45
No Memory (Plain)
Base LLM=Grok-3, Agent...
2026.01
0
Vanilla
Base LLM=Grok-3, Agent...
2026.01
0
MemoryBank
Base LLM=Grok-3, Agent...
2026.01
0
Voyager
Base LLM=Grok-3, Agent...
2026.01
0
No Memory (Plain)
Backbone=GPT-4o
2026.01
0
Vanilla
Backbone=GPT-4o
2026.01
0
Voyager
Backbone=GPT-4o
2026.01
0
Generative Agent
Backbone=GPT-4o
2026.01
0
Feedback
Search any
task
Search any
task