Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

An Efficient Memory-Augmented Transformer for Knowledge-Intensive NLP Tasks

About

Access to external knowledge is essential for many natural language processing tasks, such as question answering and dialogue. Existing methods often rely on a parametric model that stores knowledge in its parameters, or use a retrieval-augmented model that has access to an external knowledge source. Parametric and retrieval-augmented models have complementary strengths in terms of computational efficiency and predictive accuracy. To combine the strength of both approaches, we propose the Efficient Memory-Augmented Transformer (EMAT) -- it encodes external knowledge into a key-value memory and exploits the fast maximum inner product search for memory querying. We also introduce pre-training tasks that allow EMAT to encode informative key-value representations, and to learn an implicit strategy to integrate multiple memory slots into the transformer. Experiments on various knowledge-intensive tasks such as question answering and dialogue datasets show that, simply augmenting parametric models (T5-base) using our method produces more accurate results (e.g., 25.8 -> 44.3 EM on NQ) while retaining a high throughput (e.g., 1000 queries/s on NQ). Compared to retrieval-augmented models, EMAT runs substantially faster across the board and produces more accurate results on WoW and ELI5. Our code and datasets are available at https://github. com/uclnlp/EMAT.

Yuxiang Wu, Yu Zhao, Baotian Hu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel• 2022

Related benchmarks

TaskDatasetResultRank
Open-domain Question AnsweringTriviaQA
EM44.4
62
Long-form Question AnsweringELI5 (test)
ROUGE-L20.91
54
Open-domain Question AnsweringNQ (Natural Questions)
EM44.3
33
Open-domain Question AnsweringWebQuestions (WQ)
Exact Match (EM)36.7
15
Open-domain dialogueWizard-of-Wikipedia KILT (test)
F1 Score15.78
8
Long-form Question AnsweringKILT ELI5 (dev test)
RL Score20.91
3
Showing 6 of 6 rows

Other info

Code

Follow for update