Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BertGCN: Transductive Text Classification by Combining GCN and BERT

About

In this work, we propose BertGCN, a model that combines large scale pretraining and transductive learning for text classification. BertGCN constructs a heterogeneous graph over the dataset and represents documents as nodes using BERT representations. By jointly training the BERT and GCN modules within BertGCN, the proposed model is able to leverage the advantages of both worlds: large-scale pretraining which takes the advantage of the massive amount of raw data and transductive learning which jointly learns representations for both training data and unlabeled test data by propagating label influence through graph convolution. Experiments show that BertGCN achieves SOTA performances on a wide range of text classification datasets. Code is available at https://github.com/ZeroRin/BertGCN.

Yuxiao Lin, Yuxian Meng, Xiaofei Sun, Qinghong Han, Kun Kuang, Jiwei Li, Fei Wu• 2021

Related benchmarks

TaskDatasetResultRank
Text ClassificationMR
Accuracy89.7
174
Text ClassificationMR (test)
Accuracy89.7
155
Text ClassificationR8
Accuracy98.2
91
Text ClassificationR52
Accuracy96.6
76
Text ClassificationR8 (test)
Accuracy98.2
56
Document ClassificationOhsumed (test)
Accuracy72.8
54
Multi-label Text ClassificationHotel Reviews (HR) (test)
F-Measure86.55
44
Text ClassificationTweetEVAL (test)
Accuracy (A)80.43
44
Text Classificationohsumed
Accuracy72.8
33
Text ClassificationR52 (test)
Accuracy96.6
30
Showing 10 of 12 rows

Other info

Code

Follow for update