Share your thoughts, 1 month free Claude Pro on usSee more

Aggregated Logical Reasoning on Overall Unsolvable

0.945Accuracy

GPT-5.1-Low

Updated 4mo ago

Evaluation Results

Method	Links
GPT-5.1-Low 2025.12		0.945
Gemini-3.0-Pro 2025.12		0.855
Deepseek-V3.2-R 2025.12		0.85
Qwen3-4B-Instruct + UnsolRL-Final 2025.12		0.524
Qwen3-4B-Instruct 2025.12		0.345
Qwen3-1.7B-Instruct + UnsolRL-Final 2025.12		0.242
Qwen3-1.7B-Instruct 2025.12		0.18