Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Misclassification Detection benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Misclassification Detection
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
CIFAR-10
OpenMix
AUROC
94.81
28
4d ago
SMS Spam class
UQ-D
AUROC
0.982
27
4d ago
CIFAR-100
ODIN
AURC
167.53
27
4d ago
SMS Non-Spam class
UQ-D
AUROC
0.9369
26
4d ago
MNIST (test)
Softmax Baseline
AUROC
0.97
13
4d ago
Fashion-MNIST (test)
MC-Dropout
ROC-AUC
0.818
12
4d ago
CIFAR-100 (test)
D_alpha
AUROC
88.2
11
4d ago
CIFAR-10 (test)
D_alpha
AUROC
95.2
11
4d ago
SVHN (test)
D_alpha
AUROC (%)
93
10
4d ago
Tiny ImageNet (test)
D_alpha
AUROC
86.1
10
4d ago
CIFAR-100-C 1.0 (test)
OpenMix
AUROC
84.05
9
4d ago
CIFAR-10-C 1.0 (test)
OpenMix
AUROC
90.38
9
4d ago
DMNIST
F-EDL
AUPR (Conf)
96.17
8
4d ago
ImageNet-1K (val)
sample difficulty-aware entropy regularization
FPR@95% (MSP)
45.69
7
4d ago
CIFAR-10-LT (ρ = 0.1)
F-EDL
AUPR (Confidence)
97.6
6
4d ago
CIFAR-10-LT (ρ = 0.01)
F-EDL
AUPR (Confidence)
85.99
6
4d ago
IMDB (test)
D_alpha
AUROC
0.844
3
4d ago
AMAZON SOFTWARE (test)
D_alpha
AUROC
0.688
3
4d ago
AMAZON FASHION (test)
D_alpha
AUROC (%)
89.7
3
4d ago
CIFAR100
D_alpha
AUROC
88.2
2
4d ago
CIFAR10
D_alpha
AUROC
95.2
2
4d ago
Twitter POS-annotated tweets (test)
Softmax Baseline
AUROC
89
2
4d ago
WSJ (Penn Treebank) (test)
Softmax Baseline
AUROC
0.96
2
4d ago
Reuters 40 subset of 52 (test)
Softmax Baseline
AUROC
91
1
4d ago
Reuters 6 subset of Reuters 8 (test)
Softmax Baseline
AUROC
89
1
4d ago
Showing 25 of 26 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Terms of Service
FAQs