| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| Word Sorting | SELF-THOUGHT | Acc@t11 | 24 | 4d ago | |
| Algorithmic Reasoning Suite Unseen Length (test) | sin/cos (Randomized) | Even Pairs100 | 11 | 4d ago | |
| CLRS | MPNN | BFS Success Rate99.8 | 9 | 4d ago | |
| Big-Bench Hard Word Sorting and Multi-step Arithmetic (test) | StrategyLLM | WS Accuracy80 | 7 | 2d ago | |
| MOST RELIABLE PATH 100 nodes | NE++ | Key Accuracy579 | 6 | 3d ago | |
| MOST RELIABLE PATH 50 nodes | NE++ | Key Identification Accuracy3.04 | 6 | 3d ago | |
| MOST RELIABLE PATH 20 nodes | NE | Key Accuracy17.3 | 6 | 3d ago | |
| BELLMAN-FORD 100 nodes | NE | Key Value1,980,000 | 6 | 3d ago | |
| BELLMAN-FORD 50 nodes | NE | Key Path Identification Count59 | 6 | 3d ago | |
| BELLMAN-FORD 20 nodes | NE++ | Key Value0.0025 | 6 | 3d ago | |
| CLRS-30 n=64 (test) | FloydNet | Sort Accuracy100 | 6 | 4d ago | |
| Algorithmic Tasks Length Generalization, l=41-120 1.0 (test) | minGRU | PC0.07 | 5 | 4d ago | |
| CLRS-30 (test) | Hint-ReLIC | Kruskal MST Accuracy96.01 | 5 | 4d ago | |
| 86 Algorithmic Reasoning Tasks (overall) | PRIME | Average Accuracy93.8 | 2 | 4d ago |