Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Aggregated Logical Reasoning on Overall Mean

76.2Accuracy

Deepseek-V3.2-R

8.91226.38143.8561.319Dec 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
76.2
2025.12
71.5
2025.12
69.9
2025.12
34.9
2025.12
23.2
2025.12
13.8
2025.12
11.5