| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long-context language understanding | L-Eval | Coursera58.28 | 26 | |
| Long-context language understanding | L-Eval (test) | Coursera58.28 | 26 | |
| Long-context Summarization | L-Eval Sum | QMS22.66 | 13 | |
| Long-context Question Answering | L-Eval QA | NQ80.73 | 13 | |
| Long-context evaluation | L-Eval | Close Score68.8 | 13 | |
| Closed-ended Task Evaluation | L-Eval closed-ended tasks | Coursera Score41.86 | 12 |