Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIMO 2025 (reference set)
Loading...
20
Pass@1
RHO-LOSS
9.6
12.3
15
17.7
Jun 28, 2024
Pass@1
Cons@64
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Cons@64
RHO-LOSS
Backbone=DeepSeek-R1-D...
2024.06
20
30
ScaleBiO
Backbone=DeepSeek-R1-D...
2024.06
20
30
Uniform
Backbone=DeepSeek-R1-D...
2024.06
10
30
LESS
Backbone=DeepSeek-R1-D...
2024.06
10
30
Feedback
Search any
task
Search any
task