| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| XCOPA | PaLM 2 | Accuracy94.4 | 33 | 2d ago | |
| COPA | LAT | Accuracy90 | 29 | 4d ago | |
| CLadder 14 (original) | NLL0.465 | 14 | 4d ago | ||
| e-CARE | SE-GPT | Accuracy86.9 | 14 | 4d ago | |
| XCOPA (test) | PaLM 2 | Accuracy (id)97.2 | 13 | 4d ago | |
| XCOPA | TokAlign + LAT | Accuracy (zh)55.5 | 12 | 4d ago | |
| Copa100 | Our Trained Model | Accuracy83 | 12 | 4d ago | |
| IndicCOPA IndicXTREME (test) | IFT | Average F1 Score45.45 | 10 | 4d ago | |
| XCOPA ET | Llama-3.2-3B | Accuracy71.8 | 8 | 4d ago | |
| CLadder 1.0 (test) | Human | Overall Acc94.8 | 7 | 4d ago | |
| XCOPA | XGLM-7B5 | XCOPA ET Accuracy57 | 6 | 4d ago | |
| CausalT5K (L2) | Collapse Rate0.9 | 5 | 4d ago | ||
| CLadder | Llama3.1-8B-Instruct | Exact Match88 | 4 | 4d ago | |
| XCOPA Thai | Transport and Merge | Accuracy60 | 3 | 4d ago | |
| CausalProbe-E | GRPO | Accuracy80.5 | 3 | 4d ago | |
| CLEAR | GPT-4 | Accuracy60.5 | 3 | 4d ago | |
| CaLM Mathematical | GRPO | Accuracy93.5 | 3 | 4d ago | |
| XCOPA Māori | - | Accuracy- | 0 | 4d ago |