Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Process-level Reward Modeling on PROCESSBENCH Omni-MATH

2.8Error Rate

SPARE-Llama3-8B

1.729.0116.323.59Jun 18, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
2.882.25.4
2025.06
10.951.916.9
2025.06
1441.921
2025.06
1483.823.9
2025.06
14.27323.8
29.886.144.3