Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks

About

Text classification is fundamental in natural language processing (NLP), and Graph Neural Networks (GNN) are recently applied in this task. However, the existing graph-based works can neither capture the contextual word relationships within each document nor fulfil the inductive learning of new words. In this work, to overcome such problems, we propose TextING for inductive text classification via GNN. We first build individual graphs for each document and then use GNN to learn the fine-grained word representations based on their local structures, which can also effectively produce embeddings for unseen words in the new document. Finally, the word nodes are aggregated as the document embedding. Extensive experiments on four benchmark datasets show that our method outperforms state-of-the-art text classification methods.

Yufeng Zhang, Xueli Yu, Zeyu Cui, Shu Wu, Zhongzhen Wen, Liang Wang• 2020

Related benchmarks

Task	Dataset	Result
Text Classification	MR (test)	Accuracy78.86	155
Text Classification	R8 (test)	Accuracy98.14	56
Document Classification	Ohsumed (test)	Accuracy70.44	54
Text Classification	movie review dataset (test)	Accuracy56.84	35
Text Classification	R52 (test)	Accuracy95.41	30
Short-text classification	Snippets (test)	Accuracy65.2	23
Short-text classification	Shopping (test)	Accuracy66.22	23
Short-text classification	KLUE YNAT 1.0 (test)	Accuracy43.64	23
Text Classification	R8 small-scale (test)	Accuracy98.04	11
Text Classification	R52 small-scale (test)	Accuracy95.48	11

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord