| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Node Classification | Physics | Accuracy98.6 | 205 | |
| Node Classification | Physics | Accuracy97.44 | 79 | |
| Node Classification | Physics (test) | Median Test Accuracy95.56 | 48 | |
| Node Classification | Physics | Overall F191.45 | 34 | |
| Node Classification | Physics Co-authorship 5-way (test) | Accuracy88.92 | 33 | |
| Node Classification | Physics semi-supervised | Accuracy94.87 | 30 | |
| Community Detection | Physics | Avg Detected Communities185.8 | 27 | |
| Node Classification | Physics | AUROC99.82 | 25 | |
| Attributed Graph Clustering | Physics | NMI76 | 24 | |
| Node unlearning | Physics | Runtime (s)0.03 | 20 | |
| Physics Reasoning | Physics C-Eval WebInstruct | C-Eval Score80.15 | 18 | |
| Node Classification | Physics Homophilic (10 train/val/test) | Accuracy99.44 | 16 | |
| Scientific Reasoning | Physics | Avg@16 (1h)69.5 | 16 | |
| Link Prediction | Physics | AUC (%)98.79 | 15 | |
| Science Question Answering | Physics | Accuracy44.1 | 13 | |
| Node Classification Calibration | Physics | Brier Score6.12 | 12 | |
| Node Classification Calibration | Physics | KDE-ECE0.82 | 12 | |
| GNN calibration | Physics | Negative Log-Likelihood (NLL)0.1187 | 12 | |
| Node Classification | Physics | ECE0.42 | 12 | |
| GNN Calibration | Physics | ECE0.42 | 12 | |
| Link Prediction | Physics | Hits@5076.46 | 11 | |
| Physics-Scene Visual Reasoning | Physics | Accuracy54.35 | 10 | |
| Link Prediction | Physics (test) | AP98.12 | 10 | |
| Science Question Answering | Physics | Mean@1681.6 | 9 | |
| Structured Reasoning | PHYSICS | Score82.9 | 9 |