Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
8-Puzzle Solving on 8-Puzzle Solvable
Loading...
43.6
Accuracy
Gemini-3.0-Pro
-0.912
10.644
22.2
33.756
Dec 1, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-3.0-Pro
Model=Gemini-3.0-Pro
2025.12
43.6
Deepseek-V3.2-R
Model=Deepseek-V3.2-R
2025.12
40.2
GPT-5.1-Low
Model=GPT-5.1-Low
2025.12
28.9
Qwen3-4B-Instruct + UnsolRL-Final
Base Model=Qwen3-4B-In...
2025.12
26.4
Qwen3-4B-Instruct
Model=Qwen3-4B-Instruct
2025.12
21.6
Qwen3-1.7B-Instruct + UnsolRL-Final
Base Model=Qwen3-1.7B-...
2025.12
1.2
Qwen3-1.7B-Instruct
Model=Qwen3-1.7B-Instruct
2025.12
0.8
Feedback
Search any
task
Search any
task