Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reward Modeling (Logicality) on ReasonEdit-Reward 113K 1.0 (test)

0.8958SRCC

RE-Reward

-0.004840.228980.46280.69662May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
0.89580.73150.9027
2026.05
0.84380.71360.8173
2026.05
0.75390.65280.7572
2026.05
0.70350.59020.7177
2026.05
0.57280.4860.5956
2026.05
0.53860.45450.5913
2026.05
0.40970.34630.4787
2026.05
0.34730.28190.3197
2026.05
0.33840.24960.3258
2026.05
0.33180.25930.2843
2026.05
0.27490.22850.2269
2026.05
0.16970.1440.148
2026.05
0.07840.06120.1798
2026.05
0.0340.02380.0417
2026.05
0.03140.02680.0419
2026.05
0.02980.02090.0148