| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multilingual Question Answering | TydiQA | Accuracy81.9 | 44 | |
| Question Answering | TyDiQA | Exact Match52.14 | 28 | |
| Question Answering | TyDiQA GoldP | F1 Score89.4 | 20 | |
| Multilinguality | TydiQA | F1 Score70.8 | 16 | |
| Question Answering | TyDiQA GoldP (test) | F1 Score87.7 | 12 | |
| Performance Prediction | TyDiQA | MAE4.29 | 9 | |
| Zero-shot performance prediction | TyDiQA | MAE3.42 | 9 | |
| Hallucination Detection | TyDiQA-GP | AUC ROC0.9404 | 8 | |
| Question Answering | TyDiQA | Score53.56 | 6 | |
| Question Answering | TyDiQA (test) | Average Score72.4 | 4 | |
| Multilingual Question Answering | TyDiQA GoldP (val) | Ar Score80 | 4 |