| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | WikiHop (test) | Accuracy74 | 32 | |
| Multi-hop Reading Comprehension | WikiHop unmasked (dev) | Accuracy66.4 | 11 | |
| Multi-hop Question Answering | WikiHop (dev) | Accuracy79.8 | 10 | |
| Multi-hop Reading Comprehension | WikiHop unmasked (test) | Accuracy70.6 | 9 | |
| Question Answering | WikiHop (dev) | Accuracy75.9 | 8 | |
| Reading Comprehension | Wikihop (dev) | Follow61.4 | 6 | |
| Multi-hop Question Answering | WikiHop masked (dev) | Accuracy72.1 | 3 | |
| Question Answering | WikiHop May 2020 (leaderboard) | F1 Score81.9 | 2 | |
| Reading Comprehension | Wikihop (test) | Overall Score59.3 | 2 | |
| Multi-hop Question Answering | WikiHop leaderboard official (test) | Accuracy0.8225 | 1 |