Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Constraint Satisfaction on Sudoku
Loading...
57
CSP Result Index 35
DiffThinker
-2.28
13.11
28.5
43.89
Dec 30, 2025
CSP Result Index 35
CSP Result Index 40
CSP Result Index 45
Updated 4d ago
Evaluation Results
Method
Method
Links
CSP Result Index 35
CSP Result Index 40
CSP Result Index 45
DiffThinker
Setting=Flow Matching
2025.12
57
95
98
DiffThinker++
Setting=Flow Matching
2025.12
55
94
97
Gemini-3-Flash
Setting=N/A
2025.12
3
29
69
Qwen3-VL-8B
Setting=SFT
2025.12
2
17
30
Qwen3-VL-32B
Setting=SFT
2025.12
2
22
32
GPT-5
Setting=N/A
2025.12
0
0
2
Qwen3-VL-8B
Setting=N/A
2025.12
0
0
0
Qwen3-VL-8B
Setting=GRPO
2025.12
0
0
0
Qwen3-VL-32B
Setting=N/A
2025.12
0
0
0
Qwen3-VL-32B
Setting=GRPO
2025.12
0
1
3
Qwen-Image-Edit-2509
Setting=N/A
2025.12
0
0
0
Qwen-Image-Edit-2511
Setting=N/A
2025.12
0
0
0
Feedback
Search any
task
Search any
task