Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
About
Commonsense reasoning systems should be able to generalize to diverse reasoning cases. However, most state-of-the-art approaches depend on expensive data annotations and overfit to a specific benchmark without learning how to perform general semantic reasoning. To overcome these drawbacks, zero-shot QA systems have shown promise as a robust learning scheme by transforming a commonsense knowledge graph (KG) into synthetic QA-form samples for model training. Considering the increasing type of different commonsense KGs, this paper aims to extend the zero-shot transfer learning scenario into multiple-source settings, where different KGs can be utilized synergetically. Towards this goal, we propose to mitigate the loss of knowledge from the interference among the different knowledge sources, by developing a modular variant of the knowledge aggregation as a new zero-shot commonsense reasoning framework. Results on five commonsense reasoning benchmarks demonstrate the efficacy of our framework, improving the performance with multiple KGs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Commonsense Reasoning | WinoGrande | Accuracy60.9 | 776 | |
| Physical Interaction Question Answering | PIQA | Accuracy73.1 | 323 | |
| Physical Commonsense Reasoning | PIQA (val) | Accuracy71.1 | 113 | |
| Social Interaction Question Answering | SIQA | Accuracy66.7 | 85 | |
| Abductive Natural Language Inference | aNLI (leaderboard) | Accuracy72.4 | 47 | |
| Commonsense Question Answering | SocialIQA (SIQA) (val) | Accuracy64.4 | 24 | |
| Commonsense Question Answering | CommonsenseQA (CSQA) (val) | Accuracy66.5 | 23 | |
| Commonsense Question Answering | Abductive NLI (aNLI) (val) | Accuracy0.713 | 21 | |
| Commonsense Question Answering | WinoGrande (WG) (val) | Accuracy60.3 | 21 |