SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning

About

Generalized Category Discovery (GCD) aims to classify unlabelled images from both `seen' and `unseen' classes by transferring knowledge from a set of labelled `seen' class images. A key theme in existing GCD approaches is adapting large-scale pre-trained models for the GCD task. An alternate perspective, however, is to adapt the data representation itself for better alignment with the pre-trained model. As such, in this paper, we introduce a two-stage adaptation approach termed SPTNet, which iteratively optimizes model parameters (i.e., model-finetuning) and data parameters (i.e., prompt learning). Furthermore, we propose a novel spatial prompt tuning method (SPT) which considers the spatial property of image data, enabling the method to better focus on object parts, which can transfer between seen and unseen classes. We thoroughly evaluate our SPTNet on standard benchmarks and demonstrate that our method outperforms existing GCD methods. Notably, we find our method achieves an average accuracy of 61.4% on the SSB, surpassing prior state-of-the-art methods by approximately 10%. The improvement is particularly remarkable as our method yields extra parameters amounting to only 0.117% of those in the backbone architecture. Project page: https://visual-ai.github.io/sptnet.

Hongjun Wang, Sagar Vaze, Kai Han• 2024

Related benchmarks

Task	Dataset	Result
Generalized Category Discovery	CIFAR-100	Accuracy (All)89	268
Generalized Category Discovery	ImageNet-100	All Accuracy90.1	252
Generalized Category Discovery	Stanford Cars	Accuracy (All)72.3	228
Generalized Category Discovery	CUB	Accuracy (All)76.3	186
Generalized Category Discovery	CIFAR-10	All Accuracy98.9	152
Generalized Category Discovery	FGVC Aircraft	Accuracy (All)59.3	115
Generalized Category Discovery	CUB-200 (test)	Overall Accuracy65.8	81
Generalized Category Discovery	Herbarium19	Score (All Categories)43.4	71
Generalized Category Discovery	Herbarium19 (test)	Score (All Categories)43.4	52
Generalized Category Discovery	Aircraft (test)	Accuracy (All)59.3	38

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord