| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Algorithmic Reasoning | MOST RELIABLE PATH 100 nodes | Key Accuracy579 | 6 | |
| Algorithmic Reasoning | MOST RELIABLE PATH 50 nodes | Key Identification Accuracy3.04 | 6 | |
| Algorithmic Reasoning | MOST RELIABLE PATH 20 nodes | Key Accuracy17.3 | 6 |