Segment Any Anomaly without Training via Hybrid Prompt Regularization
About
We present a novel framework, i.e., Segment Any Anomaly + (SAA+), for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models. Existing anomaly segmentation models typically rely on domain-specific fine-tuning, limiting their generalization across countless anomaly patterns. In this work, inspired by the great zero-shot generalization ability of foundation models like Segment Anything, we first explore their assembly to leverage diverse multi-modal prior knowledge for anomaly localization. For non-parameter foundation model adaptation to anomaly segmentation, we further introduce hybrid prompts derived from domain expert knowledge and target image context as regularization. Our proposed SAA+ model achieves state-of-the-art performance on several anomaly segmentation benchmarks, including VisA, MVTec-AD, MTD, and KSDD2, in the zero-shot setting. We will release the code at \href{https://github.com/caoyunkang/Segment-Any-Anomaly}{https://github.com/caoyunkang/Segment-Any-Anomaly}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anomaly Localization | MVTec AD | Pixel AUROC75.5 | 369 | |
| Anomaly Detection | VisA | AUROC76.5 | 199 | |
| Anomaly Detection | KSDD | AUROC0.686 | 40 | |
| Image-level Anomaly Detection | BTAD | AUROC59 | 39 | |
| Anomaly Segmentation | MVTec AD | -- | 33 | |
| Pixel-level Anomaly Detection | BTAD | AUROC65.8 | 30 | |
| Anomaly Detection | DTD | AUROC94.4 | 28 | |
| Image-level Anomaly Detection | DAGM | AUROC87.1 | 28 | |
| Image-level Anomaly Detection | MVTec AD | AUROC63.5 | 28 | |
| Image-level Anomaly Detection | HeadCT | AUROC46.8 | 24 |