| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | QA Zero-shot Average | QA Zero-shot Average73.45 | 57 | |
| Question Answering | QA | Speedup Factor3.66 | 47 | |
| Legal Text Classification | QA | Accuracy85.72 | 18 | |
| Question Answering | QA | Accuracy59.5 | 12 | |
| Steering | QA | Steering Success62.5 | 11 | |
| Question Answering | QA benchmarks | ReCoRD Score80.86 | 9 | |
| Question Answering | QA domain average | Best Accuracy85.2 | 8 | |
| Critique Quality Evaluation | QA | Win Rate75 | 6 | |
| Question Answering | QA 12 languages | Score72.9 | 5 | |
| Speculative Decoding | Qa | Speedup2.23 | 3 |