| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| XQuAD (test) | Alpaca-GPT4 + NAIT (MMLU) | Accuracy49.47 | 12 | 1mo ago | |
| TydiQA (test) | Alpaca-GPT4 + NAIT (TydiQA) | Accuracy47.78 | 12 | 1mo ago | |
| BELEBELE English Language | SDRRL | CES Score66.26 | 5 | 1mo ago | |
| BELEBELE Target Language | SDRRL | CES Performance52.11 | 5 | 1mo ago | |
| CMMLU | KEEL | Score72 | 2 | 1mo ago |