Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Math Reasoning on PRM800K
Loading...
0.613
AUC-ROC
Supervised
0.41228
0.46439
0.5165
0.56861
Apr 7, 2024
AUC-ROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUC-ROC
Supervised
2024.04
0.613
FRACTAL
2024.04
0.597
BagLoss
2024.04
0.569
Resp-level
2024.04
0.528
cos-sim
2024.04
0.42
Feedback
Search any
task
Search any
task