Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Process-level Reward Modeling on PROCESSBENCH Olymp.Bench

3.3Error

SPARE-Llama3-8B

2.00410.75219.528.248Jun 18, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.06
3.387.66.4
2025.06
10.15116.9
2025.06
11.18519.6
2025.06
1571.124.8
2025.06
17.931.922.9
35.787.350.7