Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs

About

Query-based open-domain NLP tasks require information synthesis from long and diverse web results. Current approaches extractively select portions of web text as input to Sequence-to-Sequence models using methods such as TF-IDF ranking. We propose constructing a local graph structured knowledge base for each query, which compresses the web search information and reduces redundancy. We show that by linearizing the graph into a structured input sequence, models can encode the graph representations within a standard Sequence-to-Sequence setting. For two generative tasks with very long text input, long-form question answering and multi-document summarization, feeding graph representations as input can achieve better performance than using retrieved text portions.

Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes• 2019

Related benchmarks

Task	Dataset	Result
Long-form Question Answering	ELI5 (test)	ROUGE-L24	54
Lead Paragraph Generation	WikiSum CommonCrawl static (test)	ROUGE-L36.5	8
Long-form Question Answering	ELI5 standard original	RL Score24	5

Showing 3 of 3 rows

Other info

Code

Follow for update

@wizwand_team Discord