| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Language Modeling | GitHub (test) | Perplexity2.42 | 113 | |
| Membership Inference Attack | GitHub Pythia | ROC AUC1 | 36 | |
| Membership Inference | GitHub Pythia (train) | TPR@1%FPR95.6 | 36 | |
| Membership Inference Attack | GitHub | AUC0.876 | 26 | |
| Semi-supervised graph classification | GITHUB 10-fold cross-validation | Accuracy0.6996 | 21 | |
| Language Modeling | GitHub (val) | Perplexity1.83 | 13 | |
| Language Modeling | GitHub tokens (test) | Bits Per Token (BPT)0.976 | 11 | |
| Website Navigation | GitHub (test) | Metric- | 0 |