Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Aggregated Logical Reasoning on Overall Unsolvable
Loading...
0.945
Accuracy
GPT-5.1-Low
0.1494
0.35595
0.5625
0.76905
Dec 1, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-5.1-Low
Model=GPT-5.1-Low
2025.12
0.945
Gemini-3.0-Pro
Model=Gemini-3.0-Pro
2025.12
0.855
Deepseek-V3.2-R
Model=Deepseek-V3.2-R
2025.12
0.85
Qwen3-4B-Instruct + UnsolRL-Final
Base Model=Qwen3-4B-In...
2025.12
0.524
Qwen3-4B-Instruct
Model=Qwen3-4B-Instruct
2025.12
0.345
Qwen3-1.7B-Instruct + UnsolRL-Final
Base Model=Qwen3-1.7B-...
2025.12
0.242
Qwen3-1.7B-Instruct
Model=Qwen3-1.7B-Instruct
2025.12
0.18
Feedback
Search any
task
Search any
task