Investigating Pretrained Language Models for Graph-to-Text Generation

About

Graph-to-text generation aims to generate fluent texts from graph-based data. In this paper, we investigate two recently proposed pretrained language models (PLMs) and analyze the impact of different task-adaptive pretraining strategies for PLMs in graph-to-text generation. We present a study across three graph domains: meaning representations, Wikipedia knowledge graphs (KGs) and scientific KGs. We show that the PLMs BART and T5 achieve new state-of-the-art results and that task-adaptive pretraining strategies improve their performance even further. In particular, we report new state-of-the-art BLEU scores of 49.72 on LDC2017T10, 59.70 on WebNLG, and 25.66 on AGENDA datasets - a relative improvement of 31.8%, 4.5%, and 42.4%, respectively. In an extensive analysis, we identify possible reasons for the PLMs' success on graph-to-text tasks. We find evidence that their knowledge about true facts helps them perform well even when the input graph representation is reduced to a simple bag of node and edge labels.

Leonardo F. R. Ribeiro, Martin Schmitt, Hinrich Sch\"utze, Iryna Gurevych• 2020

Related benchmarks

Task	Dataset	Result
AMR-to-text generation	LDC2017T10 (test)	BLEU49.72	55
KG-to-text generation	AGENDA (test)	CHRF++51.63	19
AMR-to-text	LDC2017T10 AMR17 (test)	chrF++74.79	17
Graph-to-Sequence Generation	TEKGEN (test)	B Score0.199	17
Graph-to-Sequence Generation	GenWiki (test)	CrF++ Score0.567	17
Graph-to-text generation	AGENDA (test)	BLEU25.66	15
Graph-to-Sequence Generation	WikiOFGraph (test)	BLEU65.8	12
Graph-to-text generation	WebNLG seen v1.0 (test)	BLEU65.05	12
Graph-to-text generation	WebNLG all v1.0 (test)	BLEU59.7	11
Graph-to-text generation	WebNLG unseen v1.0 (test)	BLEU53.67	10

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord