Share your thoughts, 1 month free Claude Pro on usSee more

Reward Modeling Evaluation on HelpSteer3 (test)

-5.89Score

DPO+Filter

Updated 2mo ago

Evaluation Results

Method	Links
DPO+Filter 2025.10		-5.89	73
DPO 2025.10		-6.56	67
DPO+Filter 2025.10		-6.83	68
DPO 2025.10		-6.91	67
Base 2025.10		-8.39	50