| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GLUE | Fast Post-Training Pruning Framework | SST-2156 | 531 | 4d ago | |
| GLUE (dev) | XDELECTRA-l | SST-2 (Acc)97.36 | 518 | 4d ago | |
| GLUE (test) | Z-Code++ | SST-2 Accuracy97.9 | 416 | 1mo ago | |
| GLUE (val) | RoBERTa-Large + MUPPET | SST-297.4 | 191 | 18d ago | |
| SuperGLUE (dev) | Average Score93.2 | 91 | 1mo ago | ||
| GLUE (test dev) | SL-SAM | MRPC Accuracy93.45 | 87 | 1mo ago | |
| SuperGLUE | Vega v2 | SGLUE Score91.3 | 84 | 1mo ago | |
| SuperGLUE (test) | ST-MoE-32B | BoolQ Accuracy92.4 | 63 | 1mo ago | |
| GLUE (test val) | Full-FT | MRPC Accuracy94 | 59 | 1mo ago | |
| GLUE | SIFT | SST-295.18 | 55 | 23d ago | |
| NLP Suite (BoolQ, RTE, HellaSwag, WinoG, ARC-E, ARC-C, OpenBookQA) zero-shot | Average Accuracy72.5 | 41 | 23d ago | ||
| GLUE | VB-LoRAall | COLA Score69.3 | 41 | 1mo ago | |
| GLUE (test) | RING | MNLI-mm98.6 | 39 | 1mo ago | |
| GLUE and SuperGLUE (test val) | SCALEARN UNIFORM | SST-295.7 | 37 | 1mo ago | |
| GLUE (test) | FedTT | SST-2 Accuracy95.64 | 33 | 1mo ago | |
| SuperGLUE | ARMADA | CB Accuracy94.5 | 32 | 1mo ago | |
| GLUE small | ARMADA | CoLA Mcc73.4 | 32 | 1mo ago | |
| GLUE | DeeBERT-base | SST-2 Speedup3.06 | 32 | 1mo ago | |
| GLUE v1 (dev) | AutoBERT-Zero* | MRPC Score93.8 | 30 | 1mo ago | |
| GLUE | LoRA | SST-2 Acc97.5 | 28 | 1mo ago | |
| NusaX | FLARE MT | Macro F181.37 | 28 | 1mo ago | |
| GLUE 1.0 (test) | SST-2 (Acc)97.8 | 28 | 1mo ago | ||
| Snips (test) | BiLSTMs + ELMo | Intent Acc99.29 | 27 | 1mo ago | |
| GLUE (test) | QNLI Score92.31 | 26 | 1mo ago | ||
| GLUE | COIN | MNLI Accuracy61.58 | 24 | 1mo ago |