| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Question Answering | MultifieldQA | F1 Score57.2 | 52 | |
| Long Context Question Answering | MultiFieldQA | Accuracy57.33 | 15 | |
| Speculative Decoding | MultiFieldQA | Speculative Rate (SR)2.1 | 12 | |
| Long-context Question Answering | MultifieldQA | C Score88.6 | 9 | |
| Long-context answering with citations | MultifieldQA | Citation Recall79 | 9 | |
| Question Answering | MultiFieldQA | Rel. Perf vs Truncated ICL1.041 | 5 |