| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| GLUE CoLA, MRPC, RTE, SST-2 | Task arithmetic | Absolute Accuracy75.9 | 60 | 1mo ago | |
| 8 Vision tasks (test) | Accuracy89 | 33 | 16d ago | ||
| 7 NLP tasks (test) | Accuracy79.2 | 22 | 16d ago | ||
| LLM Evaluation Suite | KARCHER | Normalized Score0.401 | 12 | 1mo ago |