| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Many-shot in-context learning | Long-context benchmarks | ICL Performance (8k Context)74.2 | 21 | |
| Context Management | Long-context (test) | mTokens1 | 19 | |
| Average across tasks | Long-context benchmarks | Performance (8k Context)45.9 | 8 | |
| Long-Context Training | Long-Context (train) | Metric- | 0 |