Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
About
Commonsense reasoning systems should be able to generalize to diverse reasoning cases. However, most state-of-the-art approaches depend on expensive data annotations and overfit to a specific benchmark without learning how to perform general semantic reasoning. To overcome these drawbacks, zero-shot QA systems have shown promise as a robust learning scheme by transforming a commonsense knowledge graph (KG) into synthetic QA-form samples for model training. Considering the increasing type of different commonsense KGs, this paper aims to extend the zero-shot transfer learning scenario into multiple-source settings, where different KGs can be utilized synergetically. Towards this goal, we propose to mitigate the loss of knowledge from the interference among the different knowledge sources, by developing a modular variant of the knowledge aggregation as a new zero-shot commonsense reasoning framework. Results on five commonsense reasoning benchmarks demonstrate the efficacy of our framework, improving the performance with multiple KGs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Commonsense Reasoning | WinoGrande | Accuracy60.9 | 1085 | |
| Physical Commonsense Reasoning | PIQA | Accuracy72.9 | 572 | |
| Physical Interaction Question Answering | PIQA | Accuracy73.1 | 333 | |
| Physical Commonsense Reasoning | PIQA (val) | Accuracy71.1 | 116 | |
| Social Interaction Question Answering | SIQA | Accuracy66.7 | 109 | |
| Social Commonsense Reasoning | SIQA | Accuracy66.6 | 89 | |
| Commonsense Question Answering | CSQA | Accuracy68.2 | 58 | |
| Abductive Commonsense Reasoning | ANLI (test) | Accuracy72.5 | 53 | |
| Abductive Natural Language Inference | aNLI (leaderboard) | Accuracy72.4 | 47 | |
| Common Sense Reasoning | WG | Accuracy60.8 | 38 |