Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GITHUB

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingGitHub (test)
Perplexity2.42
113
Membership Inference AttackGitHub Pythia
ROC AUC1
36
Membership InferenceGitHub Pythia (train)
TPR@1%FPR95.6
36
Membership Inference AttackGitHub
AUC0.876
26
Semi-supervised graph classificationGITHUB 10-fold cross-validation
Accuracy0.6996
21
Language ModelingGitHub (val)
Perplexity1.83
13
Language ModelingGitHub tokens (test)
Bits Per Token (BPT)0.976
11
Website NavigationGitHub (test)
Metric-
0
Showing 8 of 8 rows