Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Problem Solving on AIME'24 (test)

26.67Average@16 Accuracy

PRM-CoT (Process-Aware)

8.896413.510718.12522.7393Dec 2, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
26.67304050
2025.12
22.7126.6736.6750
2025.12
18.1226.674046.67
2025.12
9.583.323.3326.67