Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human Preference Prediction on DocPairBench

89.3Gov. Preference Score

DocReward-7B

36.36450.10763.8577.593Oct 13, 2025
Updated 15d ago

Evaluation Results

MethodLinks
2025.10
89.392.386.883.8678784.480.976.476.782.3
2025.10
87.276.980.275698587.279.174.677.380.6
2025.10
70.269.269.268.8576976.263.67069.868.9
2025.10
69.780.870.371.3497275.26064.666.667.7
2025.10
62.969.275.861.3466075.256.465.562.963
2025.10
61.957.775.866.3536271.661.866.461.463.3
2025.10
60.146.258.256.3345458.750.958.253.954.9
2025.10
59.8505660486348.65050.949.754.2
2025.10
57.761.553.953.854595652.749.158.156.1
2025.10
55.161.548.450504756.947.355.560.554.4
2025.10
49.434.646.241.3355249.531.848.248.546
2025.10
38.423.136.330252732.139.138.230.233.5