Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GroundingAnomaly: Spatially-Grounded Diffusion for Few-Shot Anomaly Synthesis

About

The performance of visual anomaly inspection in industrial quality control is often constrained by the scarcity of real anomalous samples. Consequently, anomaly synthesis techniques have been developed to enlarge training sets and enhance downstream inspection. However, existing methods either suffer from poor integration caused by inpainting or fail to provide accurate masks. To address these limitations, we propose GroundingAnomaly, a novel few-shot anomaly image generation framework. Our framework introduces a Spatial Conditioning Module that leverages per-pixel semantic maps to enable precise spatial control over the synthesized anomalies. Furthermore, a Gated Self-Attention Module is designed to inject conditioning tokens into a frozen U-Net via gated attention layers. This carefully preserves pretrained priors while ensuring stable few-shot adaptation. Extensive evaluations on the MVTec AD and VisA datasets demonstrate that GroundingAnomaly generates high-quality anomalies and achieves state-of-the-art performance across multiple downstream tasks, including anomaly detection, segmentation, and instance-level detection.

Yishen Liu, Hongcang Chen, Pengcheng Zhao, Yunfan Bao, Yuxi Tian, Jieming Zhang, Hao Chen, Zheng Zhi, Yongchun Liu, Ying Li, Dongpu Cao• 2026

Related benchmarks

TaskDatasetResultRank
Anomaly DetectionVisA--
261
Anomaly SegmentationMVTec AD--
105
Image-level Anomaly DetectionMVTec-AD (test)--
86
Image-level Anomaly DetectionMVTec AD
AUROC98.74
82
Image-level Anomaly DetectionVisA
AUC91.74
80
Image-level Anomaly DetectionVisA (test)
AP (Image)99.6
75
Anomaly GenerationVisA
IC-LPIPS31
62
Pixel-level SegmentationVisA (test)
AUC-P99.8
23
Anomaly SegmentationMVTec AD unified (test)
AUROC98.53
15
Anomaly SegmentationVisA unified (test)
AUROC97.71
15
Showing 10 of 14 rows

Other info

Follow for update