Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Your VAR Model is Secretly an Efficient and Explainable Generative Classifier

About

Generative classifiers, which leverage conditional generative models for classification, have recently demonstrated desirable properties such as robustness to distribution shifts. However, recent progress in this area has been largely driven by diffusion-based models, whose substantial computational cost severely limits scalability. This exclusive focus on diffusion-based methods has also constrained our understanding of generative classifiers. In this work, we propose a novel generative classifier built on recent advances in visual autoregressive (VAR) modeling, which offers a new perspective for studying generative classifiers. To further enhance its performance, we introduce the Adaptive VAR Classifier$^+$ (A-VARC$^+$), which achieves a superior trade-off between accuracy and inference speed, thereby significantly improving practical applicability. Moreover, we show that the VAR-based method exhibits fundamentally different properties from diffusion-based methods. In particular, due to its tractable likelihood, the VAR-based classifier enables visual explainability via token-wise mutual information and demonstrates inherent resistance to catastrophic forgetting in class-incremental learning tasks.

Yi-Chung Chen, David I. Inouye, Jing Gao• 2025

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet A
Top-1 Acc10
654
Image ClassificationImageNet V2
Top-1 Acc79.3
611
Image ClassificationImageNet-R
Top-1 Acc33.1
529
Image ClassificationImageNet-Sketch
Top-1 Accuracy34
407
Image ClassificationObjectNet
Top-1 Accuracy24.51
219
Image ClassificationImageNet-R--
217
Image ClassificationImageNet A--
50
Image ClassificationImageNet-C Gaussian Noise
Top-1 Accuracy15.5
24
Image ClassificationImageNet-C JPEG Corruptions
Top-1 Accuracy37.7
24
Visual ExplanationImageNet-100
Insertion AUC94.4
6
Showing 10 of 11 rows

Other info

Follow for update