Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on Math Domain
Loading...
66.45
Avg Accuracy
DVPO
58.078
60.2515
62.425
64.5985
Dec 3, 2025
Avg Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Accuracy
DVPO
Training Domain=Math D...
2025.12
66.45
Dr.GRPO
Training Domain=Math D...
2025.12
61.64
Reinforce++
Training Domain=Math D...
2025.12
61.35
Robust Bellman
Training Domain=Math D...
2025.12
60.56
GRPO
Training Domain=Math D...
2025.12
59.26
Base
Training Domain=Math D...
2025.12
58.4
PPO
Training Domain=Math D...
2025.12
58.4
Feedback
Search any
task
Search any
task