Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Large Model Performance Prediction on Paradigm RLHF pattern shift

9.55RMSE

STAR

9.40610.37811.3512.322Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
9.557.338.4492.4884.0739.3771.97-
2026.02
10.147.949.0491.8883.2636.3570.5-
2026.02
13.1510.3211.7488.5179.5424.0264.02-