CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search

About

In this paper, we study how open-source large language models (LLMs) can be effectively deployed for improving query rewriting in conversational search, especially for ambiguous queries. We introduce CHIQ, a two-step method that leverages the capabilities of LLMs to resolve ambiguities in the conversation history before query rewriting. This approach contrasts with prior studies that predominantly use closed-source LLMs to directly generate search queries from conversation history. We demonstrate on five well-established benchmarks that CHIQ leads to state-of-the-art results across most settings, showing highly competitive performances with systems leveraging closed-source LLMs. Our study provides a first step towards leveraging open-source LLMs in conversational search, as a competitive alternative to the prevailing reliance on commercial LLMs. Data, models, and source code will be publicly available upon acceptance at https://github.com/fengranMark/CHIQ.

Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie• 2024

Related benchmarks

Task	Dataset	Result
Conversational Retrieval	QReCC (test)	Recall@1070.8	43
Conversational Retrieval	TopiOCQA (test)	NDCG@323.5	26
Conversational Search	CAsT 20	MRR54	24
Conversational Search	CAsT 19	MRR73.3	24
Conversational Search	QReCC (test)	MRR54.3	16
Conversational Search	TopiOCQA (test)	MRR38	12
Conversational Search	TREC CAsT 2021	MRR62.9	8

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord