| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Language Tasks Suite (SST-2, RTE, CB, BoolQ, WSC, WIC, MultiRC, COPA, ReCoRD, SQuAD, DROP) OPT-13B (test) | AdaMeZO | SST-2 Accuracy92.7 | 6 | 1mo ago | |
| LLaMA3-3B Language Tasks Suite (SST-2, RTE, CB, BoolQ, WSC, WIC, MultiRC, COPA, ReCoRD, SQuAD, DROP) | AdaMeZO | SST-2 Accuracy92.6 | 6 | 1mo ago |