Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Classic Control on MountainCar Source v1.0
Loading...
0
Success Rate
No Memory (Plain)
-0.001
-0.0005
0
0.0005
Jan 27, 2026
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
No Memory (Plain)
Backbone=DeepSeek-V3.2
2026.01
0
Vanilla
Backbone=DeepSeek-V3.2
2026.01
0
Vanilla + GLOVE
Backbone=DeepSeek-V3.2
2026.01
0
MemoryBank
Backbone=DeepSeek-V3.2
2026.01
0
MemoryBank + GLOVE
Backbone=DeepSeek-V3.2
2026.01
0
Voyager
Backbone=DeepSeek-V3.2
2026.01
0
Voyager + GLOVE
Backbone=DeepSeek-V3.2
2026.01
0
Generative Agent
Backbone=DeepSeek-V3.2
2026.01
0
Generative Agent + GLOVE
Backbone=DeepSeek-V3.2
2026.01
0
Feedback
Search any
task
Search any
task