Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Plug-and-Play Document Modules for Pre-trained Models

About

Large-scale pre-trained models (PTMs) have been widely used in document-oriented NLP tasks, such as question answering. However, the encoding-task coupling requirement results in the repeated encoding of the same documents for different tasks and queries, which is highly computationally inefficient. To this end, we target to decouple document encoding from downstream tasks, and propose to represent each document as a plug-and-play document module, i.e., a document plugin, for PTMs (PlugD). By inserting document plugins into the backbone PTM for downstream tasks, we can encode a document one time to handle multiple tasks, which is more efficient than conventional encoding-task coupling methods that simultaneously encode documents and input queries using task-specific encoders. Extensive experiments on 8 datasets of 4 typical NLP tasks show that PlugD enables models to encode documents once and for all across different scenarios. Especially, PlugD can save $69\%$ computational costs while achieving comparable performance to state-of-the-art encoding-task coupling methods. Additionally, we show that PlugD can serve as an effective post-processing way to inject knowledge into task-specific models, improving model performance without any additional model training.

Chaojun Xiao, Zhengyan Zhang, Xu Han, Chi-Min Chan, Yankai Lin, Zhiyuan Liu, Xiangyang Li, Zhonghua Li, Zhao Cao, Maosong Sun• 2023

Related benchmarks

TaskDatasetResultRank
Fact VerificationFEVER
Accuracy0.8254
67
Question AnsweringNQ
EM23.01
57
Knowledge-Intensive Language TasksKILT (test)
WoW F1 Score17.61
29
DehazingRESIDE
FID42.82
25
DesnowingRealistic
FID35.01
17
Derainingreal (test)
FID52.89
17
DeblurringRealBlur-J
FID63.16
17
Knowledge Grounded DialogueWoW
F1 Score16.58
15
DemoireingLCDMoire
FID36.37
11
Highlight RemovalSHIQ
FID33.09
11
Showing 10 of 15 rows

Other info

Code

Follow for update