One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control
About
Industrial anomaly detection (AD) is characterized by an abundance of normal images but a scarcity of anomalous ones. Although numerous few-shot anomaly synthesis methods have been proposed to augment anomalous data for downstream AD tasks, most existing approaches require time-consuming training and struggle to learn distributions that are faithful to real anomalies, thereby restricting the efficacy of AD models trained on such data. To address these limitations, we propose a training-free few-shot anomaly generation method, namely O2MAG, which leverages the self-attention in One reference anomalous image to synthesize More realistic anomalies, supporting effective downstream anomaly detection. Specifically, O2MAG manipulates three parallel diffusion processes via self-attention grafting and incorporates the anomaly mask to mitigate foreground-background query confusion, synthesizing text-guided anomalies that closely adhere to real anomalous distributions. To bridge the semantic gap between the encoded anomaly text prompts and the true anomaly semantics, Anomaly-Guided Optimization is further introduced to align the synthesis process with the target anomalous distribution, steering the generation toward realistic and text-consistent anomalies. Moreover, to mitigate faint anomaly synthesis inside anomaly masks, Dual-Attention Enhancement is adopted during generation to reinforce both self- and cross-attention on masked regions. Extensive experiments validate the effectiveness of O2MAG, demonstrating its superior performance over prior state-of-the-art methods on downstream AD tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anomaly Localization | MVTec AD | Pixel AUROC99.9 | 513 | |
| Anomaly Detection | MVTec-AD (test) | I-AUROC99.6 | 327 | |
| Anomaly Generation | MVTec AD | KID1.48 | 85 | |
| Anomaly Detection | MVTec AD | AP-I100 | 80 | |
| Image-level Anomaly Detection | VisA (test) | AP (Image)99 | 75 | |
| Anomaly Detection and Localization | VisA (test) | P-AUROC99.9 | 70 | |
| Anomaly Generation | VisA | IC-LPIPS12 | 62 | |
| Anomaly Classification | MVTec AD | Accuracy (bottle)86.05 | 10 | |
| Pixel-level Anomaly Detection and Localization | MVTec AD | AUC-P99.2 | 5 | |
| Anomaly Detection and Localization | Real-IAD (test) | AUROC (Instance)79.3 | 3 |