| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Mathematical Reasoning | Mathematics out-of-domain (test) | Accuracy75.9 | 26 | |
| Mathematical Reasoning | Mathematics | Accuracy85.9 | 24 | |
| Mathematical Reasoning | MATHEMATICS | Accuracy74.1 | 22 | |
| Mathematical Reasoning | Mathematics | Pass@165.8 | 18 | |
| Category Retrieval | Mathematics Amazon (test) | R@5031.4 | 15 | |
| Link Prediction | Mathematics | PREC@171.22 | 14 | |
| Reranking | Mathematics | NDCG@547.1 | 14 |