Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Advanced Mathematical Reasoning on Math500 (pass@1, 256 tokens)
Loading...
41.2
Pass@1 Accuracy
d-TreeRPO
29.656
32.653
35.65
38.647
Dec 10, 2025
Pass@1 Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1 Accuracy
d-TreeRPO
Base Model=LLaDA-MoE-7...
2025.12
41.2
TraceRL
Base Model=LLaDA-MoE-7...
2025.12
40
wd1
Base Model=LLaDA-MoE-7...
2025.12
39.8
SAPO
Base Model=LLaDA-MoE-7...
2025.12
38.6
GDPO
Base Model=LLaDA-MoE-7...
2025.12
38.4
Diffu-GRPO
Base Model=LLaDA-MoE-7...
2025.12
38.1
d-TreeRPO
Base Model=LLaDA-8B-In...
2025.12
37.7
GDPO
Base Model=LLaDA-8B-In...
2025.12
37
VRPO (LLaDA-1.5)
Base Model=LLaDA-8B-In...
2025.12
35.6
TraceRL
Base Model=LLaDA-8B-In...
2025.12
35.6
wd1
Base Model=LLaDA-8B-In...
2025.12
34.4
Diffu-GRPO
Base Model=LLaDA-8B-In...
2025.12
34.1
SAPO
Base Model=LLaDA-8B-In...
2025.12
33.8
LLaDA-8B-Instruct
Base Model=LLaDA-8B-In...
2025.12
32.4
LLaDA-MoE-7BA1B-Instruct
Base Model=LLaDA-MoE-7...
2025.12
30.1
Feedback
Search any
task
Search any
task