Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adaptive receptive field-based spatial-frequency feature reconstruction network for few-shot fine-grained image classification

About

Feature reconstruction techniques are widely applied for few-shot fine-grained image classification (FSFGIC). Our research indicates that one of the main challenges facing existing feature-based FSFGIC methods is how to choose the size of the receptive field to extract feature descriptors (including spatial and frequency feature descriptors) from different category input images, thereby better performing the FSFGIC tasks. To address this, an adaptive receptive field-based spatial-frequency feature reconstruction network (ARF-SFR-Net) is proposed. The designed ARF-SFR-Net has the capability to adaptively determine receptive field sizes for obtaining spatial and frequency features, and effectively fuse them for reconstruction and FSFGIC tasks. The designed ARF-SFR-Net can be easily embedded into a given episodic training mechanism for end-to-end training from scratch. Extensive experiments on multiple FSFGIC benchmarks demonstrate the effectiveness and superiority of the proposed ARF-SFR-Net over state-of-the-art approaches. The code is available at: https://github.com/ICL-SUST/ARF-SFR-Net.git.

Linyue Zhang, Wenyi Zeng, Zicheng Pan, Yongsheng Gao, Changming Sun, Jun Hu, Lixian Liu, Weichuan Zhang, Tuo Wang• 2026

Related benchmarks

TaskDatasetResultRank
Few-shot classificationCUB-200 2011
5-way 1-shot Acc86.96
66
Fine-grained 5-way classificationStanford Cars
1-shot Accuracy92.62
40
Fine-grained 5-way classificationStanford Dogs
1-shot Acc80.93
37
5-way Image Classificationmeta-iNat
1-shot Accuracy (5-way)76.11
14
5-way Image Classificationmeta-iNat tiered
1-Shot Accuracy53.95
14
Showing 5 of 5 rows

Other info

Follow for update