Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
8-Puzzle Solving on 8-Puzzle Mean
Loading...
70.3
Accuracy
Gemini-3.0-Pro
-2.396
16.477
35.35
54.223
Dec 1, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini-3.0-Pro
Model=Gemini-3.0-Pro
2025.12
70.3
GPT-5.1-Low
Model=GPT-5.1-Low
2025.12
63.5
Deepseek-V3.2-R
Model=Deepseek-V3.2-R
2025.12
59.6
Qwen3-4B-Instruct + UnsolRL-Final
Base Model=Qwen3-4B-In...
2025.12
20.1
Qwen3-4B-Instruct
Model=Qwen3-4B-Instruct
2025.12
10.8
Qwen3-1.7B-Instruct + UnsolRL-Final
Base Model=Qwen3-1.7B-...
2025.12
1.1
Qwen3-1.7B-Instruct
Model=Qwen3-1.7B-Instruct
2025.12
0.4
Feedback
Search any
task
Search any
task