| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MATH_MIX (test) | Agent-GWO | Accuracy88 | 24 | 1mo ago | |
| CLUTRR (test) | Agent-GWO | Accuracy76.4 | 24 | 1mo ago | |
| Date (test) | Agent-GWO | Accuracy84.5 | 24 | 1mo ago | |
| MMLU (test) | Agent-GWO | Accuracy75.3 | 24 | 1mo ago | |
| AQUA (test) | Agent-GWO | Accuracy78.5 | 24 | 1mo ago |