| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Knowledge Graph Question Answering | CWQ | Hit@179.3 | 166 | |
| Knowledge Graph Question Answering | CWQ (test) | Hits@176.9 | 100 | |
| Multi-Hop Knowledge Graph Question Answering | CWQ | Hits@181.4 | 46 | |
| Knowledge Base Question Answering | CWQ (test) | F1 Score81.3 | 42 | |
| Knowledge Base Question Answering | CWQ Freebase (test) | Hits@186 | 38 | |
| Question Answering | CWQ | Accuracy23.62 | 30 | |
| Discriminative Evaluation | CWQ (test) | Binary Accuracy92.88 | 24 | |
| Knowledge Base Question Answering | CWQ | Answer F151.74 | 18 | |
| Question Answering | CWQ | Hits@172.5 | 17 | |
| Knowledge Base Completion | CWQ 50% KB | MRR61.4 | 16 | |
| Knowledge Base Completion | CWQ (30% KB) | MRR58.8 | 16 | |
| Knowledge Base Question Answering | CWQ 50% KB | Hits@150.8 | 12 | |
| Knowledge Base Question Answering | CWQ 30% KB | Hits@150.2 | 12 | |
| Multi-hop Reasoning | CWQ | Hits@182.2 | 10 | |
| Knowledge Base Question Answering | CWQ (hidden test) | Accuracy67.1 | 7 | |
| Complex Question Answering | CWQ | Total Score14.85 | 4 | |
| Knowledge Base Question Answering | CWQ w/o KB | Hits@146.4 | 3 |