Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Process Reward Modeling on PRMBENCH

81.7Simplicity Avg Score

Skywork-PRM-7B

33.54846.04958.5571.051Jul 23, 2025Sep 6, 2025Oct 21, 2025Dec 5, 2025Jan 19, 2026Mar 5, 2026Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
81.7-------56.6---90.189.240.965.1
2025.07
73.276.574.172.380.177.578.979.4797174.310081.8---
2025.07
70.273.571.269.177.574.876.375.676.167.370.410079.2---
2025.07
67.672.366.16975.973.274.776.475.166.870.910079.2---
2025.07
65.669.56665.274.870.172.272.972.563.266.210076.5---
2025.07
64.668.865.663.774.567.773.872.372.161.864.810075.5---
2025.07
64.37063.265.472.474.372.974.573.564.567.910077.5---
2025.07
63.4696264.771.172.671.373.872.263.666.810076.8---
2025.07
61.365.260.462.169.868.169.572.169.962.464.210075.5---
2025.07
59.766.85762.47269.770.771.170.962.565.799.275.8---
2025.07
56.463.657.255.667.472.366.266.968.257.862.710073.5---
2026.04
55.3-------72.3---60.290.944.267.5
2025.07
54.560.257.251.966.168.469.364.867.253.354.699.969.3---
2026.04
52.3-------63.8---56.586.835.361
52.1-------71---75.591.539.465.5
2025.07
51.45249.353.456.447.146.753.350.95153.593.666---
2025.07
51.454.248.8545752.150.757.854.452.855.891.166.5---
2026.04
48.5-------55.1---51.981.526.554
2025.07
47.654.246.448.955.75553.266.257.54955.499.868.1---
2025.07
47.1474450.349.444.541.347.745.747.248.686.160.7---
2025.07
46.754.446.147.356.655.154.463.857.551.556.297.968.5---
2025.07
35.452.632.937.947.354.148.44849.445.646.810064.1---