ThinkQE: Query Expansion via an Evolving Thinking Process
About
Effective query expansion for web search benefits from promoting both exploration and result diversity to capture multiple interpretations and facets of a query. While recent LLM-based methods have improved retrieval performance and demonstrate strong domain generalization without additional training, they often generate narrowly focused expansions that overlook these desiderata. We propose ThinkQE, a test-time query expansion framework addressing this limitation through two key components: a thinking-based expansion process that encourages deeper and comprehensive semantic exploration, and a corpus-interaction strategy that iteratively refines expansions using retrieval feedback from the corpus. Experiments on diverse web search benchmarks (DL19, DL20, and BRIGHT) show ThinkQE consistently outperforms prior approaches, including training-intensive dense retrievers and rerankers.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Question Answering | MMLU Med | Accuracy60.5 | 61 | |
| Information Retrieval | TREC DL20 | NDCG@1064.7 | 50 | |
| Information Retrieval | TREC-COVID | NDCG@1076.1 | 44 | |
| Medical Question Answering | BioASQ | Accuracy52.1 | 38 | |
| Information Retrieval | BRIGHT 1.0 (test) | nDCG@10 (Avg)36 | 35 | |
| Medical Question Answering | MedQA US | Accuracy52.2 | 18 | |
| Factoid-style retrieval | TREC DL19 | NDCG@1068.8 | 16 | |
| Information Retrieval | SciFact | -- | 15 |