ANALOGYKB: Unlocking Analogical Reasoning of Language Models with A Million-scale Knowledge Base
About
Analogical reasoning is a fundamental cognitive ability of humans. However, current language models (LMs) still struggle to achieve human-like performance in analogical reasoning tasks due to a lack of resources for model training. In this work, we address this gap by proposing ANALOGYKB, a million-scale analogy knowledge base (KB) derived from existing knowledge graphs (KGs). ANALOGYKB identifies two types of analogies from the KGs: 1) analogies of the same relations, which can be directly extracted from the KGs, and 2) analogies of analogous relations, which are identified with a selection and filtering pipeline enabled by large language models (LLMs), followed by minor human efforts for data quality control. Evaluations on a series of datasets of two analogical reasoning tasks (analogy recognition and generation) demonstrate that ANALOGYKB successfully enables both smaller LMs and LLMs to gain better analogical reasoning capabilities.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Analogy Generation | E-KAR | Accuracy75 | 26 | |
| Analogy Generation | UNIT 4 | Accuracy92 | 22 | |
| Analogy Generation | SCAN (out-of-domain) | Accuracy15.3 | 15 | |
| Analogy recognition | BATS | Accuracy92.42 | 15 | |
| Analogy recognition | UNIT 2 | Accuracy88.32 | 15 | |
| Analogy recognition | UNIT 4 | Accuracy76.15 | 15 | |
| Analogy recognition | SAT | Accuracy60.78 | 15 | |
| Analogy recognition | Accuracy98.8 | 15 | ||
| Analogy Generation | SAT | Accuracy0.91 | 11 | |
| Analogy Generation | BATS | Accuracy96 | 11 |