Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Unified-Feedback

Benchmarks

Task NameDataset NameSOTA ResultTrend
Reward ModelingUnified Feedback (UF)
Accuracy78.9
40
Reward ModelingUnified-Feedback (ID)
Accuracy73.9
8
Reward ModelingUnified-Feedback ID (test)
Reward Score71.5
8
Win Rate EvaluationUnified-Feedback (test)
Win Rate0.73
2
Showing 4 of 4 rows