| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long Context Reasoning | AA-LCR | Score53.5 | 26 | |
| Long-context Reasoning | AA-LCR | Accuracy81 | 12 | |
| General Task (Agentic Coding) | AA-LCR | Score74 | 6 | |
| Long Context Reasoning | AA-LCR | Accuracy66.9 | 5 | |
| Long Context Understanding | AA-LCR | Accuracy68 | 5 | |
| Long Context & Context Learning | AA-LCR | Pass@158.5 | 4 |