Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Hierarchical Vector Quantized Transformer for Multi-class Unsupervised Anomaly Detection

About

Unsupervised image Anomaly Detection (UAD) aims to learn robust and discriminative representations of normal samples. While separate solutions per class endow expensive computation and limited generalizability, this paper focuses on building a unified framework for multiple classes. Under such a challenging setting, popular reconstruction-based networks with continuous latent representation assumption always suffer from the "identical shortcut" issue, where both normal and abnormal samples can be well recovered and difficult to distinguish. To address this pivotal issue, we propose a hierarchical vector quantized prototype-oriented Transformer under a probabilistic framework. First, instead of learning the continuous representations, we preserve the typical normal patterns as discrete iconic prototypes, and confirm the importance of Vector Quantization in preventing the model from falling into the shortcut. The vector quantized iconic prototype is integrated into the Transformer for reconstruction, such that the abnormal data point is flipped to a normal data point.Second, we investigate an exquisite hierarchical framework to relieve the codebook collapse issue and replenish frail normal patterns. Third, a prototype-oriented optimal transport method is proposed to better regulate the prototypes and hierarchically evaluate the abnormal score. By evaluating on MVTec-AD and VisA datasets, our model surpasses the state-of-the-art alternatives and possesses good interpretability. The code is available at https://github.com/RuiyingLu/HVQ-Trans.

Ruiying Lu, YuJie Wu, Long Tian, Dongsheng Wang, Bo Chen, Xiyang Liu, Ruimin Hu• 2023

Related benchmarks

TaskDatasetResultRank
Anomaly LocalizationMVTec AD--
534
Anomaly DetectionMVTec-AD (test)--
348
Anomaly DetectionVisA
AUROC93.2
293
Anomaly LocalizationMVTec-AD (test)--
211
Anomaly DetectionVisA (test)
I-AUROC88.88
148
Anomaly LocalizationVisA
P-AUROC0.994
127
Anomaly DetectionMPDD (test)
Image-level AU-ROC86.5
104
Anomaly DetectionMVTec AD
Image AUROC96.71
92
Anomaly LocalizationVisA (test)
AUPRO84.51
68
Anomaly DetectionMVTecAD (test)--
55
Showing 10 of 40 rows

Other info

Code

Follow for update