| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GLUE | Fast Post-Training Pruning Framework | SST-2156 | 551 | 21d ago | |
| GLUE (dev) | XDELECTRA-l | SST-2 (Acc)97.36 | 529 | 1d ago | |
| GLUE (test) | Z-Code++ | SST-2 Accuracy97.9 | 416 | 3mo ago | |
| GLUE (val) | RoBERTa-Large + MUPPET | SST-297.4 | 201 | 21d ago | |
| SuperGLUE (dev) | Average Score93.2 | 91 | 3mo ago | ||
| GLUE (test dev) | SL-SAM | MRPC Accuracy93.45 | 90 | 5d ago | |
| SuperGLUE | Vega v2 | SGLUE Score91.3 | 84 | 3mo ago | |
| SuperGLUE (test) | ST-MoE-32B | BoolQ Accuracy92.4 | 74 | 12d ago | |
| GLUE (test) | QNLI7,564.6 | 64 | 15d ago | ||
| GLUE (test val) | Full-FT | MRPC Accuracy94 | 59 | 3mo ago | |
| GLUE | SIFT | SST-295.18 | 55 | 2mo ago | |
| GLUE (test) | LoRA | QNLI94.9 | 47 | 1d ago | |
| NLP Suite (BoolQ, RTE, HellaSwag, WinoG, ARC-E, ARC-C, OpenBookQA) zero-shot | Average Accuracy72.5 | 41 | 2mo ago | ||
| GLUE | VB-LoRAall | COLA Score69.3 | 41 | 3mo ago | |
| GLUE | CERSA | SST-296 | 40 | 5d ago | |
| GLUE and SuperGLUE (test val) | SCALEARN UNIFORM | SST-295.7 | 37 | 3mo ago | |
| ARC Easy | Arcana | Accuracy78.3 | 36 | 13d ago | |
| HellaSwag | NLS | Accuracy85.6 | 35 | 13d ago | |
| ARC-c | mPLUG-Owl2 | Accuracy65.8 | 34 | 21d ago | |
| GLUE (test) | FedTT | SST-2 Accuracy95.64 | 33 | 3mo ago | |
| SuperGLUE | ARMADA | CB Accuracy94.5 | 32 | 2mo ago | |
| GLUE small | ARMADA | CoLA Mcc73.4 | 32 | 2mo ago | |
| GLUE | DeeBERT-base | SST-2 Speedup3.06 | 32 | 3mo ago | |
| GLUE (test) | DP-SelFT+LoRA | MNLI Score82.2 | 30 | 15d ago | |
| GLUE | Average GLUE Score100 | 30 | 1d ago |