Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Math Accuracy on MATH 4-shot
Loading...
5.27
Accuracy
UM-190k
3.346
3.8455
4.345
4.8445
Nov 14, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
UM-190k
Training Dataset=UM-190k
2025.11
5.27
UM-187k
Training Dataset=UM-187k
2025.11
5.02
TuluDPO
Training Dataset=TuluDPO
2025.11
4.64
UM-170k
Training Dataset=UM-170k
2025.11
4.18
ORPO
Training Dataset=ORPO
2025.11
4.15
HelpSteer
Training Dataset=HelpS...
2025.11
4.15
UltraFB
Training Dataset=UltraFB
2025.11
4.08
SFT
Training Dataset=SFT
2025.11
3.47
CodePref
Training Dataset=CodePref
2025.11
3.42
Feedback
Search any
task
Search any
task