Share your thoughts, 1 month free Claude Pro on usSee more

Problem Solving and Unsolvability Detection on Hitori

98Accuracy (Solvable)

Gemini-3

Updated 4mo ago

Evaluation Results

Method	Links
Gemini-3 2025.12		98	100	99
Deepseek-V3.2-R 2025.12		70	86	78
Qwen3-4B + UnsolvableRL 2025.12		63.5	94.5	79
Qwen3-4B Instruct 2025.12		34.5	6.5	20.5
Qwen3-1.7B + UnsolvableRL 2025.12		11	59	35
Qwen3-1.7B Instruct 2025.12		8	21	14.5
GPT-5.1-Low 2025.12		6	12	9