Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Advanced Reasoning on ruAIME 2024

80Accuracy

DeepSeek-R1

3.24823.17443.163.026Dec 11, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
80
2025.12
78.1
2025.12
70.6
2025.12
70.4
2025.12
57.5
2025.12
51
2025.12
31.9
2025.12
24.8
2025.12
10.2
2025.12
9
2025.12
6.2