| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GLUE (dev) | XDELECTRA-l | SST-2 (Acc)97.36 | 504 | 3d ago | |
| GLUE | Fast Post-Training Pruning Framework | SST-2156 | 452 | 3d ago | |
| GLUE (test) | Z-Code++ | SST-2 Accuracy97.9 | 416 | 3d ago | |
| GLUE (val) | RoBERTa-Large + MUPPET | SST-297.4 | 170 | 3d ago | |
| SuperGLUE (dev) | Average Score93.2 | 91 | 2d ago | ||
| SuperGLUE | Vega v2 | SGLUE Score91.3 | 84 | 3d ago | |
| GLUE (test dev) | SL-SAM | MRPC Accuracy93.45 | 81 | 3d ago | |
| SuperGLUE (test) | ST-MoE-32B | BoolQ Accuracy92.4 | 63 | 2d ago | |
| GLUE (test val) | Full-FT | MRPC Accuracy94 | 59 | 3d ago | |
| GLUE | VB-LoRAall | COLA Score69.3 | 41 | 3d ago | |
| GLUE and SuperGLUE (test val) | SCALEARN UNIFORM | SST-295.7 | 37 | 3d ago | |
| GLUE (test) | FedTT | SST-2 Accuracy95.64 | 33 | 3d ago | |
| GLUE | DeeBERT-base | SST-2 Speedup3.06 | 32 | 3d ago | |
| GLUE v1 (dev) | AutoBERT-Zero* | MRPC Score93.8 | 30 | 3d ago | |
| NLP Suite (BoolQ, RTE, HellaSwag, WinoG, ARC-E, ARC-C, OpenBookQA) zero-shot | BoolQ Accuracy81.25 | 28 | 3d ago | ||
| GLUE | LoRA | SST-2 Acc97.5 | 28 | 3d ago | |
| NusaX | FLARE MT | Macro F181.37 | 28 | 3d ago | |
| Snips (test) | BiLSTMs + ELMo | Intent Acc99.29 | 27 | 3d ago | |
| GLUE (test) | QNLI Score92.31 | 26 | 3d ago | ||
| GLUE (test) | RING | MNLI-mm98.6 | 26 | 3d ago | |
| GLUE 1.0 (test) | CoLA (MCC)66.4 | 25 | 3d ago | ||
| AGIEval | Llama 3 405B | Accuracy71.6 | 24 | 3d ago | |
| GLUE RoBERTa LARGE (test dev) | LoRA | MNLI Accuracy90.57 | 22 | 3d ago | |
| Ag_news, Subj, MR, Boolq, RTE (test) | DecoQuant | Ag_news Accuracy71.6 | 22 | 3d ago | |
| ARC Easy | Arcana | Accuracy78.3 | 20 | 3d ago |