Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling (Accuracy) on ReasonEdit-Reward-113K 1.0 (test)

0.9323SRCC

RE-Reward

-0.0320920.2182790.468650.719021May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
0.93230.77810.9316
2026.05
0.85870.72090.8411
2026.05
0.85320.72840.8535
2026.05
0.82640.69430.8251
2026.05
0.77610.65980.7797
2026.05
0.48530.36120.5055
2026.05
0.44620.35250.3839
2026.05
0.42790.35440.432
2026.05
0.39470.32660.4175
2026.05
0.38350.31630.3894
2026.05
0.15480.12870.1538
2026.05
0.07520.07250.0453
2026.05
0.06870.05530.0929
2026.05
0.06550.04520.0624
2026.05
0.04970.0410.043
2026.05
0.0050.00290.0062