| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Node Classification | Pubmed | Accuracy97.46 | 742 | |
| Node classification | Pubmed (test) | Accuracy90.63 | 500 | |
| Node Classification | PubMed | Accuracy92.48 | 307 | |
| Link Prediction | PubMed | AUC99.2 | 123 | |
| Summarization | PubMed (test) | ROUGE-161.99 | 107 | |
| Transductive Node Classification | Pubmed (transductive) | Accuracy91 | 95 | |
| Node Classification | Pubmed standard (test) | Accuracy85.2 | 92 | |
| Summarization | PubMed | ROUGE-150.23 | 70 | |
| Link Prediction | Pubmed (test) | AUC98.3 | 65 | |
| Clustering | Pubmed | Accuracy82.86 | 61 | |
| Semi-supervised node classification | Pubmed | Accuracy88.48 | 60 | |
| LGT Detection | PubMed Fast-DetectGPT benchmark | AUROC0.97 | 54 | |
| Text Classification | PubMed | micro-F189.93 | 50 | |
| Node Classification | Pubmed full-supervised | Accuracy90.3 | 48 | |
| Node Classification | PubMed semi-supervised | Accuracy82.98 | 42 | |
| Node Classification | PubMed 0.1% labels | Accuracy76.5 | 37 | |
| Node Classification | PubMed 0.03% labels | Accuracy71.1 | 37 | |
| Node Classification | PubMed (random) | Accuracy83.8 | 37 | |
| Membership Inference Attack | PubMed Pythia | ROC AUC97 | 36 | |
| Confidence Calibration | Pubmed | ECE0.0308 | 36 | |
| Node Classification | PubMed 0.05% labels | Accuracy73.2 | 36 | |
| Node Classification | PubMed standard (fixed split) | Accuracy80.4 | 33 | |
| Next Token Prediction | PubMed | Next Token Accuracy42.33 | 32 | |
| Extractive Summarization | PubMed (test) | ROUGE-161.49 | 32 | |
| Graph Classification | PubMed | Accuracy91.67 | 31 |