STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths
About
Taxonomies are important knowledge ontologies that underpin numerous applications on a daily basis, but many taxonomies used in practice suffer from the low coverage issue. We study the taxonomy expansion problem, which aims to expand existing taxonomies with new concept terms. We propose a self-supervised taxonomy expansion model named STEAM, which leverages natural supervision in the existing taxonomy for expansion. To generate natural self-supervision signals, STEAM samples mini-paths from the existing taxonomy, and formulates a node attachment prediction task between anchor mini-paths and query terms. To solve the node attachment task, it learns feature representations for query-anchor pairs from multiple views and performs multi-view co-training for prediction. Extensive experiments show that STEAM outperforms state-of-the-art methods for taxonomy expansion by 11.6\% in accuracy and 7.0\% in mean reciprocal rank on three public benchmarks. The implementation of STEAM can be found at \url{https://github.com/yueyu1030/STEAM}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Taxonomy Expansion | SemEval Sci 2016 (test) | Accuracy36.5 | 23 | |
| Taxonomy Expansion | SemEval Env 2016 (test) | Accuracy36.1 | 23 | |
| Taxonomy Expansion | SemEval Food 2016 (test) | Accuracy31.8 | 15 | |
| Taxonomy Expansion | WordNet (test) | Accuracy21.4 | 15 | |
| Taxonomy Expansion | Science (SCI) SemEval-2016 Task 13 | Chi-Squared31.7 | 10 | |
| Taxonomy Expansion | SemEval-2016 Task 13 Environment | Mean Rank (MR)27.1 | 9 | |
| Taxonomy Expansion | Food SemEval-2015 Task 17 | Mean Rank (MR)155.9 | 9 | |
| Taxonomy Expansion | Medical Subject Headings (MeSH) | MR372.6 | 9 | |
| Taxonomy Expansion | WordNet sub-taxonomies | MR (Mean Rank)61.1 | 9 |