| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| 5 Datasets Zero-shot | QuEPT | Average Accuracy72.87 | 33 | 4d ago | |
| Aggregated MMLU, BoolQ, OpenBookQA, RTE | Mixtral-8x22B | Average Accuracy70.4 | 22 | 4d ago | |
| English lm-evaluation-harness | Transformer + Spelling Bee Embeddings | AGIEval Acc (Norm)0.259 | 2 | 4d ago |