Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Skywork-Reward-Preference

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reward ModelingSkywork-Reward-Preference (test)
Final Token Accuracy93.6
27
Showing 1 of 1 rows