Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
FrozenLake Navigation on FrozenLake
Loading...
96
Success Rate (Static)
Qwen2.5-3B-It + Evolving Stage
89.76
91.38
93
94.62
Jan 29, 2026
Success Rate (Static)
Success Rate (Slippery)
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate (Static)
Success Rate (Slippery)
Qwen2.5-3B-It + Evolving Stage
2026.01
96
90
Qwen2.5-1.5B-It + Evolving Stage
2026.01
95
90
GPT-OSS-120B
2026.01
95
88
Qwen2.5-0.5B-It + Evolving Stage
2026.01
93
88
LLaMA3.1-1B-It + Evolving Stage
2026.01
91
88
Scout-DQN
2026.01
90
80
Feedback
Search any
task
Search any
task