TaxoEnrich: Self-Supervised Taxonomy Completion via Structure-Semantic Representations
About
Taxonomies are fundamental to many real-world applications in various domains, serving as structural representations of knowledge. To deal with the increasing volume of new concepts needed to be organized as taxonomies, researchers turn to automatically completion of an existing taxonomy with new concepts. In this paper, we propose TaxoEnrich, a new taxonomy completion framework, which effectively leverages both semantic features and structural information in the existing taxonomy and offers a better representation of candidate position to boost the performance of taxonomy completion. Specifically, TaxoEnrich consists of four components: (1) taxonomy-contextualized embedding which incorporates both semantic meanings of concept and taxonomic relations based on powerful pretrained language models; (2) a taxonomy-aware sequential encoder which learns candidate position representations by encoding the structural information of taxonomy; (3) a query-aware sibling encoder which adaptively aggregates candidate siblings to augment candidate position representations based on their importance to the query-position matching; (4) a query-position matching model which extends existing work with our new candidate position representations. Extensive experiments on four large real-world datasets from different domains show that \TaxoEnrich achieves the best performance among all evaluation metrics and outperforms previous state-of-the-art methods by a large margin.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Taxonomy Expansion | Science (SCI) SemEval-2016 Task 13 | Chi-Squared22.1 | 10 | |
| Taxonomy Expansion | SemEval-2016 Task 13 Environment | Mean Rank (MR)17.2 | 9 | |
| Taxonomy Expansion | Food SemEval-2015 Task 17 | Mean Rank (MR)101.7 | 9 | |
| Taxonomy Expansion | Medical Subject Headings (MeSH) | MR247.7 | 9 | |
| Taxonomy Expansion | WordNet sub-taxonomies | MR (Mean Rank)54.3 | 9 | |
| Taxonomy Enrichment | MAG-CS | Scaled MRR57.8 | 8 | |
| Taxonomy Enrichment | MAG-PSY | Scaled MRR0.583 | 8 | |
| Taxonomy completion | SemEval-Food 16 (test) | MR305 | 6 | |
| Taxonomy Enrichment | WordNet Noun | Scaled MRR0.442 | 6 | |
| Taxonomy Enrichment | WordNet Verb | Scaled MRR45.2 | 6 |