| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| StrategyQA | UnifiedQA-3b | Accuracy83.4 | 16 | 4d ago | |
| CIKQA | UnifiedQA-3b | Accuracy66.9 | 16 | 4d ago | |
| e-SNLI | UnifiedQA-3b | Accuracy89.6 | 16 | 4d ago | |
| AGNews | UnifiedQA-3b | Accuracy84.5 | 16 | 4d ago | |
| 14 standard NLP tasks suite (held-out) | OPT 175B | StoryCloze86.9 | 8 | 4d ago |