Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Process Reward Modeling on PRMBENCH

76.5Overall Score

DeepSeek-R1 (DG-PRM)

45.8253.78561.7569.715Jul 23, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.07
76.574.172.373.280.177.578.979.4797174.310081.8
2025.07
73.571.269.170.277.574.876.375.676.167.370.410079.2
2025.07
72.366.16967.675.973.274.776.475.166.870.910079.2
2025.07
7063.265.464.372.474.372.974.573.564.567.910077.5
2025.07
69.56665.265.674.870.172.272.972.563.266.210076.5
2025.07
696264.763.471.172.671.373.872.263.666.810076.8
2025.07
68.865.663.764.674.567.773.872.372.161.864.810075.5
2025.07
66.85762.459.77269.770.771.170.962.565.799.275.8
2025.07
65.260.462.161.369.868.169.572.169.962.464.210075.5
2025.07
63.657.255.656.467.472.366.266.968.257.862.710073.5
2025.07
60.257.251.954.566.168.469.364.867.253.354.699.969.3
2025.07
54.446.147.346.756.655.154.463.857.551.556.297.968.5
2025.07
54.248.85451.45752.150.757.854.452.855.891.166.5
2025.07
54.246.448.947.655.75553.266.257.54955.499.868.1
2025.07
52.632.937.935.447.354.148.44849.445.646.810064.1
2025.07
5249.353.451.456.447.146.753.350.95153.593.666
2025.07
474450.347.149.444.541.347.745.747.248.686.160.7