| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| HellaSwag | Falcon-180B | Accuracy87.5 | 133 | 2d ago | |
| COPA | T5(3B) + PE w/ ROE (ORC.) | Accuracy92.88 | 48 | 4d ago | |
| IndoCulture native prompts (test) | Qwen2.5-7B-IT | Avg Sentence Similarity42 | 18 | 4d ago | |
| ArabCulture native prompts (test) | Qwen2.5-7B-IT | Avg Sentence Similarity44.7 | 18 | 4d ago | |
| ArabCulture | Qwen2.5-7B-IT (+CCKG N-Asrt) | Sentence Similarity Score42.5 | 18 | 4d ago | |
| IndoCulture | Qwen2.5-7B-IT (+CCKG N-Path) | Sentence Similarity Score43 | 18 | 4d ago | |
| HellaSwag (test) | Coherence Boosting (GPT-3 175B) | Accuracy72.35 | 15 | 4d ago | |
| Reuters (test) | ARC-II | P@149.62 | 8 | 4d ago | |
| KORANI Sentence Completion | mT-En-CI | Kobest Copa80.8 | 5 | 4d ago | |
| P3 | COPA Accuracy85.3 | 5 | 4d ago | ||
| HellaSwag 0-shot | Accuracy79.3 | 4 | 4d ago | ||
| Hellaswag | Flexora | Time (h)4.71 | 2 | 4d ago |