| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Long Context Reasoning | AA-LCR | Score48.3 | 8 | |
| General Task (Agentic Coding) | AA-LCR | Score74 | 6 | |
| Long Context Understanding | AA-LCR | Accuracy68 | 5 | |
| Long Context & Context Learning | AA-LCR | Pass@158.5 | 4 | |
| Long Context Reasoning | AA-LCR | Accuracy66.9 | 3 |