Chain-of-Skills: A Configurable Model for Open-domain Question Answering

About

The retrieval model is an indispensable component for real-world knowledge-intensive tasks, e.g., open-domain question answering (ODQA). As separate retrieval skills are annotated for different datasets, recent work focuses on customized methods, limiting the model transferability and scalability. In this work, we propose a modular retriever where individual modules correspond to key skills that can be reused across datasets. Our approach supports flexible skill configurations based on the target domain to boost performance. To mitigate task interference, we design a novel modularization parameterization inspired by sparse Transformer. We demonstrate that our model can benefit from self-supervised pretraining on Wikipedia and fine-tuning using multiple ODQA datasets, both in a multi-task fashion. Our approach outperforms recent self-supervised retrievers in zero-shot evaluations and achieves state-of-the-art fine-tuned retrieval performance on NQ, HotpotQA and OTT-QA.

Kaixin Ma, Hao Cheng, Yu Zhang, Xiaodong Liu, Eric Nyberg, Jianfeng Gao• 2023

Related benchmarks

Task	Dataset	Result
Question Answering	NQ (test)	EM Accuracy56.4	133
Question Answering	HotpotQA (dev)	Answer F181	43
Question Answering	HotpotQA (test)	Ans EM67.4	37
Open Table-and-Text Question Answering	OTT-QA 1.0 (dev)	EM56.9	27
Passage retrieval	SQuAD (test)	Top-100 Accuracy81.2	22
Open-Domain Question Answering Retrieval	EntityQuestions (test)	Accuracy@2076.3	15
Question Answering	OTT-QA (test)	EM54.9	14
Retrieval	NQ (test)	Top-20 Accuracy0.856	11
Passage retrieval	HotpotQA (dev)	Passage EM88.89	7
Multi-hop Question Answering	OTT-QA full (val)	EM56.9	5

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord