| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Downstream Reasoning Suite (Arc-e, PIQA, Hellaswag, OpenBookQA, Winogrande, MMLU, BoolQ) | BHyT | ARC-e49.23 | 14 | 4d ago | |
| Macro-average (MMLU, MATH, GSM8K, BBH) | CISC | Cost Reduction46 | 8 | 3d ago | |
| Macro-average (MMLU, MATH, GSM8K, BBH) (test) | - | Cost Reduction- | 0 | 4d ago |