Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Aggregated Logical Reasoning on Overall Solvable

67.3Accuracy

Deepseek-V3.2-R

0.7418.0235.352.58Dec 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
67.3
2025.12
54.3
2025.12
38.5
2025.12
17.4
2025.12
11.8
2025.12
4.9
2025.12
3.3