| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Reading Comprehension | QuAC | F1 Score74.4 | 28 | |
| Conversational Question Answering | QuAC | F1 Score67.7 | 9 | |
| Conversational Machine Comprehension | QuAC (test) | F1 Score80.8 | 8 | |
| Conversational Retrieval | QuAC | Top-1 Recall56.8 | 7 | |
| Conversational Question Answering | QuAC (test) | F1 Score66.1 | 7 | |
| Reading Comprehension | QuAC | Accuracy53.6 | 6 | |
| Reading Comprehension | QuAC (dev) | F1 Score44.3 | 6 | |
| Conversational Retrieval | OR-QuAC (test) | NDCG@343.5 | 4 | |
| Conversational Question Answering | QuAC (test dev) | F1 Score44.3 | 2 | |
| Dialogue Quality Evaluation | QuAC | BF1 (qt, at)0.43 | 1 |