Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReasonEdit-Reward

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reward Modeling (Usefulness)ReasonEdit-Reward-113K 1.0 (test)
SRCC0.9428
16
Reward Modeling (Accuracy)ReasonEdit-Reward-113K 1.0 (test)
SRCC0.9323
16
Reward Modeling (Logicality)ReasonEdit-Reward 113K 1.0 (test)
SRCC0.8958
16
Showing 3 of 3 rows