Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Optimizing Diversity and Quality through Base-Aligned Model Collaboration

About

Alignment has greatly improved large language models (LLMs)' output quality at the cost of diversity, yielding highly similar outputs across generations, especially in open-ended generation tasks. We propose Base-Aligned Model Collaboration (BACo), an inference-time token-level model collaboration framework that dynamically combines a base LLM with its aligned counterpart to optimize diversity and quality. Using uncertainty and content-based signals, BACo employs routing strategies to determine, at each token, which model to decode from. Prior diversity-promoting methods often improve diversity at the expense of quality or require expensive decoding or post-training. In contrast, BACo achieves both high diversity and quality post hoc within a single pass, while offering strong controllability. We introduce a family of effective routing strategies and evaluate them across three open-ended generation tasks with 13 diversity and quality metrics. BACo consistently surpasses state-of-the-art inference-time baselines. With our best router, BACo achieves a 21.3% joint improvement in diversity and quality, which is further supported by human evaluations. Overall, our results demonstrate that collaboration between base and aligned models provides an effective and controllable mechanism for optimizing the diversity-quality trade-off.

Yichen Wang, Chenghao Yang, Tenghao Huang, Muhao Chen, Jonathan May, Mina Lee• 2025

Related benchmarks

TaskDatasetResultRank
Instruction FollowingNOVELTYBENCH
Lexical Dominance31
7
Open-ended generationNoveltyBench, WildChat, and Narrative-Discourse Average (test)
Lexical Dominance24.9
7
Novelty EvaluationNOVELTYBENCH
Overall Dominance44
5
Creative WritingNarrative-Discourse
Lexical Coverage36.7
4
DialogueWildChat
Lexical Coverage47.3
4
Human EvaluationNOVELTYBENCH
Quality4.04
2
Human EvaluationWildChat
Quality Score3.83
2
Showing 7 of 7 rows

Other info

Follow for update