Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Pile

Benchmarks

Task NameDataset NameSOTA ResultTrend
Membership Inference AttackPile CC
TPR @ 1%1.7
61
Membership Inference AttackPile CC Pythia
ROC AUC71
36
Conditional GenerationPile
Perplexity11.2
18
Language ModelingMiniPile (val)
Validation Perplexity40.83
10
Language ModelingMiniPile (train)
Training Perplexity30.87
10
Clutter removalPile single-view, random camera pose, Gaussian noise
GSR92
10
Language ModelingPile uncopyrighted (test)
Worst Log-Perplexity3.608
9
Language ModelingPile
Loss1.876
8
Clutter removalPile Real-world
GSR (%)79.3
7
Membership InferencePILE
Loss (AUROC)50.9
7
Membership Inference AttackPILE (train)
Loss8.2
7
Language ModelingPile (val)
Loss1.808
5
Language ModelingPile
BPB0.74
4
LanguagePile (test)
Accuracy59.4
3
Text reconstructionpile
PPL1.65
3
Language ModelingPile Non-AR tokens
Perplexity33.95
3
Language ModelingPile AR tokens
Perplexity3.07
3
Language Modelingpile (val)
Perplexity12.978
2
Showing 18 of 18 rows