Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning on ARC-c

90.36Accuracy

Qwen3-8B

14.273634.026853.7873.5332Mar 17, 2025May 9, 2025Jul 1, 2025Aug 23, 2025Oct 15, 2025Dec 7, 2025Jan 29, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
90.36--
2025.12
89.76--
2025.12
88.31--
2025.12
86.86--
2025.12
85.92--
2025.12
85.58--
2025.12
84.13--
2025.12
82.85--
2025.12
82.76--
2025.03
50.4--
2025.03
49.754.1-
2025.03
49.4--
2025.03
49.3--
2025.03
47.7--
2025.03
45--
2025.03
43.947.6-
2025.03
43.2--
2025.03
42.9--
2025.03
41--
2025.03
37.7--
2026.01
35.7--
2025.03
35.1--
2026.01
35.1--
2026.01
34.8--
2026.01
32.8--
2026.01
32.3--
2025.03
31.934.2-
2025.03
31.435.5-
2026.01
31.3--
2025.03
26.428.8-
2025.03
26.1--
2025.03
22.5--
2025.03
22.3--
2025.03
22.1--
2025.03
21.9--
2025.03
21.5--
2025.03
21.3--
2025.03
21.3--
2025.03
20.5--
2025.03
20--
2025.03
19.9--
2025.03
17.2--
2026.01
--83.3
2026.01
--88.3
2026.01
--72.2
2026.01
--89.3
2026.01
--89.6
2026.01
--79.9
2026.02
--84
--90.5
--55.5
--82.5
--93.6
--93.5
--93.7
--94.4