Share your thoughts, 1 month free Claude Pro on usSee more

Problem Solving and Unsolvability Detection on HamPath

100Solvable Accuracy

Gemini-3

Updated 4mo ago

Evaluation Results

Method	Links
Gemini-3 2025.12		100	90	95
Deepseek-V3.2-R 2025.12		84	96	90
Qwen3-4B + UnsolvableRL 2025.12		55	96.5	75.8
GPT-5.1-Low 2025.12		50	86	68
Qwen3-4B Instruct 2025.12		37	56.5	46.8
Qwen3-1.7B + UnsolvableRL 2025.12		27	85	56
Qwen3-1.7B Instruct 2025.12		14	33	23.5