| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| RTE (test) | QWEN3-14B | Accuracy76.53 | 26 | 4d ago | |
| RTE | Llama2-13B | Delta 126.24 | 24 | 4d ago | |
| RTE (Recognizing Textual Entailment) GLUE (val) | SVD-Based Selection | Accuracy66.06 | 18 | 4d ago | |
| RTE | ELSA | Accuracy83.13 | 16 | 4d ago | |
| RTE | ELSA | Total Communication Time ($10^3$ s)4.29 | 9 | 3d ago | |
| FewGLUE RTE few-shot (32 examples) (dev) | iPET (ALBERT) | Accuracy74 | 6 | 4d ago | |
| RTE | F1 Macro92.1 | 5 | 4d ago | ||
| FewGLUE RTE few-shot (32 examples) (test) | iPET (ALBERT) | Accuracy70.5 | 4 | 4d ago |