Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Process-level Reward Modeling on PROCESSBENCH MATH

6.1Error Rate

SPARE-Llama3-8B

4.42415.73727.0538.363Jun 18, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
6.191.611.4
2025.06
1689.227.1
2025.06
188229.5
2025.06
21.48033.8
2025.06
43.862.253.6
4890.162.6