HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages

About

Preference datasets are essential for training general-domain, instruction-following language models with Reinforcement Learning from Human Feedback (RLHF). Each subsequent data release raises expectations for future data collection, meaning there is a constant need to advance the quality and diversity of openly available preference data. To address this need, we introduce HelpSteer3-Preference, a permissively licensed (CC-BY-4.0), high-quality, human-annotated preference dataset comprising of over 40,000 samples. These samples span diverse real-world applications of large language models (LLMs), including tasks relating to STEM, coding and multilingual scenarios. Using HelpSteer3-Preference, we train Reward Models (RMs) that achieve top performance on RM-Bench (82.4%) and JudgeBench (73.7%). This represents a substantial improvement (~10% absolute) over the previously best-reported results from existing RMs. We demonstrate HelpSteer3-Preference can also be applied to train Generative RMs and how policy models can be aligned with RLHF using our RMs. Dataset (CC-BY-4.0): https://huggingface.co/datasets/nvidia/HelpSteer3#preference Models (NVIDIA Open Model): https://huggingface.co/collections/nvidia/reward-models-68377c5955575f71fcc7a2a3

Zhilin Wang, Jiaqi Zeng, Olivier Delalleau, Hoo-Chang Shin, Felipe Soares, Alexander Bukharin, Ellie Evans, Yi Dong, Oleksii Kuchaiev• 2025

Related benchmarks

Task	Dataset	Result
Reward Modeling	RewardBench	Accuracy82.7	166
Reward Modeling	RM-Bench	Accuracy77.1	137
Reward Modeling	RMB	Accuracy76.9	120
Reward Modeling	JudgeBench	Accuracy51.7	117
Reward Modeling	RM-Bench (test)	Chat Score73.7	72
Reward Modeling	JudgeBench (test)	Overall77.2	40
Reward Modeling	PPE Pref	Accuracy65	15
Reward Modeling	Overall 5-Benchmark Suite	Average Score70.7	12
Ranking LLM solutions	EXPERTMATH 1.0	HumanWin25.71	12
Reward Model Evaluation	Arena-Hard RU	Best@8 Score85.93	5

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord