Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation

About

Conducting supervised and preference fine-tuning of large language models (LLMs) requires high-quality datasets to improve their ability to follow instructions and align with human preferences and values. However, constructing such datasets is resource-intensive, and most publicly available datasets are in English. To address these challenges, we propose the \underline{\textbf{Ta}}xonomy-Guided \underline{\textbf{P}}reference Data Generation (TaP) framework for automated, scalable preference dataset construction across languages. TaP uses a structured taxonomy to provide fine-grained control over dataset composition, ensuring diversity and broad coverage. We use TaP-generated datasets to perform supervised and preference fine-tuning on multiple LLMs. Experimental results demonstrate that LLMs trained on TaP-generated datasets outperform those trained on existing open-source datasets. Remarkably, LLMs trained on TaP-generated datasets outperform models trained on an open-source dataset that is 180$\times$ larger.

Renren Jin, Tianhao Shen, Xinwei Wu, Dan Shi, Haoran Sun, Yuqi Ren, Wuwei Huang, Quandong Wang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong• 2025

Related benchmarks

TaskDatasetResultRank
Dialogue Alignment EvaluationAlignBench
Reasoning6.76
90
Multi-turn Dialogue EvaluationMT-Bench-zh
Score6.34
90
Instruction FollowingAlignBench
Reasoning Score7.42
60
Instruction FollowingMT-Bench-zh
Score6.83
60
General LLM EvaluationAlignBench
Reasoning Score7.27
20
General LLM EvaluationMT-Bench-zh
Overall Score6.66
7
Showing 6 of 6 rows

Other info

Follow for update