Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks
About
Text classification is fundamental in natural language processing (NLP), and Graph Neural Networks (GNN) are recently applied in this task. However, the existing graph-based works can neither capture the contextual word relationships within each document nor fulfil the inductive learning of new words. In this work, to overcome such problems, we propose TextING for inductive text classification via GNN. We first build individual graphs for each document and then use GNN to learn the fine-grained word representations based on their local structures, which can also effectively produce embeddings for unseen words in the new document. Finally, the word nodes are aggregated as the document embedding. Extensive experiments on four benchmark datasets show that our method outperforms state-of-the-art text classification methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text Classification | MR (test) | Accuracy78.86 | 99 | |
| Text Classification | R8 (test) | Accuracy98.14 | 56 | |
| Document Classification | Ohsumed (test) | Accuracy70.44 | 54 | |
| Text Classification | R52 (test) | Accuracy95.41 | 30 | |
| Text Classification | R8 small-scale (test) | Accuracy98.04 | 11 | |
| Text Classification | R52 small-scale (test) | Accuracy95.48 | 11 | |
| Text Classification | Ohsumed small-scale (test) | Accuracy70.42 | 11 | |
| Text Classification | MR small-scale (test) | Accuracy79.82 | 10 | |
| In-hospital mortality prediction | MIMIC-III (Whole) | AUPRC34.5 | 10 | |
| In-hospital mortality prediction | MIMIC-III Hypertension | AUPRC36.63 | 10 |