| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Generation | Large-scale model pool 15 LLMs | Accuracy51.9 | 3 | |
| Math Reasoning | Large-scale model pool Math Reasoning 15 LLMs | Accuracy73.3 | 3 | |
| Logic Reasoning | Large-scale model pool Logic Reasoning 15 LLMs | Accuracy95.6 | 3 | |
| Reading & QA | Large-scale model pool Reading&QA 15 LLMs | Accuracy88 | 3 | |
| Language Understanding | Large-scale model pool Language Understanding | Accuracy84 | 3 |