Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Unsupervised OOD detection on The Pile ID Long-COVID OOD (test)
Loading...
91.79
AUROC
AP-OOD
69.7732
75.4891
81.205
86.9209
Feb 5, 2026
AUROC
FPR95
Updated 4d ago
Evaluation Results
Method
Method
Links
AUROC
FPR95
AP-OOD
Base Model=Pythia-160M...
2026.02
91.79
40.34
Perplexity
Base Model=Pythia-160M...
2026.02
89.51
68.6
Mahalanobis
Base Model=Pythia-160M...
2026.02
75.5
98.07
Deep SVDD
Base Model=Pythia-160M...
2026.02
72.54
99.52
KNN
Base Model=Pythia-160M...
2026.02
70.62
99.03
Feedback
Search any
task
Search any
task