Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Targeted Failure Generation on ImageNet
Loading...
0.82
Misclassification Rate
Mimicry
0.8128
0.8614
0.91
0.9586
Jan 21, 2026
Misclassification Rate
Escape Ratio
Confidence Reduction
MS-SSIM
LPIPS
Diversity Score (T)
Trace Difference
Updated 4d ago
Evaluation Results
Method
Method
Links
Misclassification Rate
Escape Ratio
Confidence Reduction
MS-SSIM
LPIPS
Diversity Score (T)
Trace Difference
Mimicry
2026.01
0.82
0.52
-
-
-
0.147
197.21
HyNeA
2026.01
1
0
-
-
-
0.179
79.6
GIFTbench
2026.01
1
0.84
-
-
-
0.102
119.92
Feedback
Search any
task
Search any
task