| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | NewsQA (dev) | F1 Score75.5 | 101 | |
| Question Answering | NewsQA (test) | F173.6 | 31 | |
| Extractive Question Answering | NewsQA MRQA | F172.6 | 22 | |
| Question Answering | NewsQA trained on SQuAD OOD (test) | F1 Score52.41 | 20 | |
| Extractive Question Answering | NewsQA | F1 Score59.7 | 14 | |
| Question Answering | NewsQA trained on CausalQA OOD (test) | F1 Score9.54 | 10 | |
| Sentence Selection | NewsQA (dev) | Accuracy94.6 | 7 | |
| Question Answering | NewsQA | Score66.8 | 6 | |
| Question Answering | NewsQA | Latency (s)112.36 | 3 | |
| Document Retrieval | NewsQA | Recall@193.3 | 2 |