Share your thoughts, 1 month free Claude Pro on usSee more

Problem Solving and Unsolvability Detection on HamCycle

100Solvable Accuracy

Gemini-3

Updated 4mo ago

Evaluation Results

Method	Links
Gemini-3 2025.12		100	98	99
Deepseek-V3.2-R 2025.12		83.5	94	88.8
Qwen3-4B + UnsolvableRL 2025.12		41.1	94.5	67.8
Qwen3-4B Instruct 2025.12		37.5	57	47.3
Qwen3-1.7B Instruct 2025.12		22.9	28	25.5
Qwen3-1.7B + UnsolvableRL 2025.12		20.8	88	54.4
GPT-5.1-Low 2025.12		8.3	82	45.1