The Wisdom of Many Queries: Complexity-Diversity Principle for Dense Retriever Training

About

Synthetic query generation has become essential for training dense retrievers, yet prior methods generate one query per document, focusing solely on query quality. We are the first to systematically study multi-query synthesis and discover a quality-diversity trade-off: high-quality queries benefit in-domain tasks, while diverse queries benefit out-of-domain (OOD) generalization. Through controlled experiments on 4 benchmark types across Contriever, RetroMAE, and Qwen3-Embedding, we find that diversity benefit strongly correlates with query complexity (r$\geq$0.95, p<0.05), approximated by content words (CW). We formalize this as the Complexity-Diversity Principle (CDP): query complexity determines optimal diversity. Based on CDP, we propose complexity-aware training: multi-query synthesis for high-complexity tasks and CW-weighted training for existing data. Both strategies improve OOD performance on reasoning-intensive benchmarks, with compounded gains when combined.

Xincan Feng, Noriki Nishida, Yusuke Sakai, Yuji Matsumoto• 2026

Related benchmarks

Task	Dataset	Result
Information Retrieval	BEIR (test)	--	130
Retrieval	TREC-DL aggregate (test)	NDCG@1054	38
Retrieval	BRIGHT 12 datasets aggregate (test)	NDCG@109.5	20
Multi-hop Retrieval	Multi-hop 4 datasets aggregate (test)	NDCG@1058.5	8

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord