| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| General Reasoning | AGI Eval English | Score90.1 | 32 | |
| General Intelligence | AGI Eval | AGI Eval Score40.2 | 24 | |
| Reasoning | AGI Eval EN | Accuracy89.4 | 15 | |
| Reasoning | AGI Eval | Avg@1 (AGI Eval Reasoning)73.4 | 12 | |
| Mathematical Reasoning | AGI-Eval Math | Overall Accuracy94.7 | 11 | |
| General Intelligence Evaluation | AGI-Eval | Accuracy61.2 | 10 | |
| General Intelligence Evaluation | AGI Eval English | Score92.2 | 8 | |
| Text-to-Image Generation | AGI-Eval text-to-image arena 6 | ELO Score0.4859 | 6 |