Interpret Your Decision: Logical Reasoning Regularization for Generalization in Visual Classification

About

Vision models excel in image classification but struggle to generalize to unseen data, such as classifying images from unseen domains or discovering novel categories. In this paper, we explore the relationship between logical reasoning and deep learning generalization in visual classification. A logical regularization termed L-Reg is derived which bridges a logical analysis framework to image classification. Our work reveals that L-Reg reduces the complexity of the model in terms of the feature distribution and classifier weights. Specifically, we unveil the interpretability brought by L-Reg, as it enables the model to extract the salient features, such as faces to persons, for classification. Theoretical analysis and experiments demonstrate that L-Reg enhances generalization across various scenarios, including multi-domain generalization and generalized category discovery. In complex real-world scenarios where images span unknown classes and unseen domains, L-Reg consistently improves generalization, highlighting its practical efficacy.

Zhaorui Tan, Xi Yang, Qiufeng Wang, Anh Nguyen, Kaizhu Huang• 2024

Related benchmarks

Task	Dataset	Result
Generalized Category Discovery	CIFAR-100	Accuracy (All)80.8	268
Generalized Category Discovery	ImageNet-100	All Accuracy83.4	252
Generalized Category Discovery	Stanford Cars	Accuracy (All)44.8	228
Generalized Category Discovery	CIFAR-10	All Accuracy94.8	152
Generalized Category Discovery	CUB-200 (test)	Overall Accuracy65.3	81
Generalized Category Discovery	Herbarium19	Score (All Categories)43.7	71
Image Classification	OfficeHome DomainBed suite (test)	Accuracy80.9	45
Image Classification	DomainBed v1.0 (test)	Average Accuracy55.3	36
Image Classification	Terra-Incognita (test)	Accuracy62.9	25
Image Classification	PACS DomainBed suite (test)	Accuracy97.4	20

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord