Plug-and-Play Document Modules for Pre-trained Models

About

Large-scale pre-trained models (PTMs) have been widely used in document-oriented NLP tasks, such as question answering. However, the encoding-task coupling requirement results in the repeated encoding of the same documents for different tasks and queries, which is highly computationally inefficient. To this end, we target to decouple document encoding from downstream tasks, and propose to represent each document as a plug-and-play document module, i.e., a document plugin, for PTMs (PlugD). By inserting document plugins into the backbone PTM for downstream tasks, we can encode a document one time to handle multiple tasks, which is more efficient than conventional encoding-task coupling methods that simultaneously encode documents and input queries using task-specific encoders. Extensive experiments on 8 datasets of 4 typical NLP tasks show that PlugD enables models to encode documents once and for all across different scenarios. Especially, PlugD can save $69\%$ computational costs while achieving comparable performance to state-of-the-art encoding-task coupling methods. Additionally, we show that PlugD can serve as an effective post-processing way to inject knowledge into task-specific models, improving model performance without any additional model training.

Chaojun Xiao, Zhengyan Zhang, Xu Han, Chi-Min Chan, Yankai Lin, Zhiyuan Liu, Xiangyang Li, Zhonghua Li, Zhao Cao, Maosong Sun• 2023

Related benchmarks

Task	Dataset	Result
Fact Verification	FEVER	Accuracy0.8254	72
Question Answering	NQ	EM23.01	69
Knowledge-Intensive Language Tasks	KILT (test)	WoW F1 Score17.61	29
Dehazing	RESIDE	FID42.82	25
Relation Extraction	zsRE	Accuracy21.13	22
Desnowing	Realistic	FID35.01	17
Deraining	real (test)	FID52.89	17
Deblurring	RealBlur-J	FID63.16	17
Knowledge Grounded Dialogue	WoW	F1 Score16.58	15
Demoireing	LCDMoire	FID36.37	11

Showing 10 of 15 rows

Other info

Code

Follow for update

@wizwand_team Discord