Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Adversarial Detection on Wikipedia HiSPA word-shuffle benign adversarial (N=500 held-out)
Loading...
65
F1 Score
SpectralGuard
58.864
60.457
62.05
63.643
Mar 12, 2026
F1 Score
False Positive Rate (FPR)
Correlation (rho) Benign
Correlation (rho) Adversarial
Delta Correlation (rho)
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
False Positive Rate (FPR)
Correlation (rho) Benign
Correlation (rho) Adversarial
Delta Correlation (rho)
SpectralGuard
Model=Mamba-2.8B, Para...
2026.03
65
72.4
0.8787
0.8778
0.0009
SpectralGuard
Model=Mamba-130M, Para...
2026.03
61.9
67.2
0.9086
0.909
0.0004
SpectralGuard
Model=Mamba-1.4B, Para...
2026.03
59.1
67.2
0.9205
0.9211
0.0006
Feedback
Search any
task
Search any
task