Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling (Usefulness) on ReasonEdit-Reward-113K 1.0 (test)

0.9428SRCC

RE-Reward

-0.0377120.2168440.47140.725956May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
0.94280.80150.9439
2026.05
0.86760.73640.8609
2026.05
0.8660.74670.8657
2026.05
0.83710.70780.8417
2026.05
0.78840.67290.7916
2026.05
0.53930.40640.5422
2026.05
0.39060.32580.4037
2026.05
0.34870.29070.3617
2026.05
0.28410.2410.2346
2026.05
0.18480.15310.1881
2026.05
0.11180.07470.1698
2026.05
0.07360.06140.0291
2026.05
0.07290.05010.0677
2026.05
0.01690.01120.0181
2026.05
0.00230.00190.003
2026.05
000