| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Node Classification | Pubmed | Accuracy97.46 | 819 | |
| Node classification | Pubmed (test) | Accuracy91.18 | 546 | |
| Node Classification | PubMed | Accuracy94.65 | 396 | |
| Node Classification | Pubmed | Accuracy96.9 | 178 | |
| Link Prediction | PubMed | AUC99.2 | 128 | |
| Summarization | PubMed (test) | ROUGE-161.99 | 107 | |
| Graph Classification | PubMed | Accuracy91.67 | 101 | |
| Transductive Node Classification | Pubmed (transductive) | Accuracy91 | 95 | |
| Node Classification | Pubmed standard (test) | Accuracy85.2 | 92 | |
| Node Classification | Pubmed transductive (test) | Accuracy87.23 | 81 | |
| Summarization | PubMed | ROUGE-150.23 | 70 | |
| Link Prediction | Pubmed (test) | AUC98.3 | 65 | |
| Clustering | Pubmed | Accuracy82.86 | 61 | |
| Semi-supervised node classification | Pubmed | Accuracy88.48 | 60 | |
| LGT Detection | PubMed Fast-DetectGPT benchmark | AUROC0.97 | 54 | |
| Text Classification | PubMed | micro-F189.93 | 50 | |
| Node Classification | Pubmed full-supervised | Accuracy90.3 | 48 | |
| Node Classification | PubMed (test) | Accuracy89.08 | 47 | |
| Node Classification | PubMed semi-supervised | Accuracy82.98 | 42 | |
| Next Token Prediction | PubMed | Next Token Accuracy42.33 | 40 | |
| Language Modeling | PubMed | Perplexity6.52 | 38 | |
| Node Classification | PubMed 0.1% labels | Accuracy76.5 | 37 | |
| Node Classification | PubMed 0.03% labels | Accuracy71.1 | 37 | |
| Node Classification | PubMed (random) | Accuracy83.8 | 37 | |
| Membership Inference Attack | PubMed Pythia | ROC AUC97 | 36 |