| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Understanding | General Understanding Tasks ARC-E, BoolQ, Wino., PIQA, HellaSwag, TruthfulQA, OBQA, LogiQA | ARC-E Accuracy64.1 | 8 | |
| General Understanding | General Understanding Tasks Chinese (ZH) Translated | Accuracy45.92 | 3 | |
| General Understanding | General Understanding Tasks Japanese (JA) Translated | Average Accuracy44 | 3 | |
| General Understanding | General Understanding Tasks French (FR) Translated | Avg Accuracy47.52 | 3 | |
| General Understanding | General Understanding Tasks German (DE) Translated | Accuracy47.16 | 3 | |
| General Understanding | General Understanding Tasks English (EN) Translated | Avg Accuracy61.07 | 2 |