Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Compact Bilinear Pooling

About

Bilinear models has been shown to achieve impressive performance on a wide range of visual tasks, such as semantic segmentation, fine grained recognition and face recognition. However, bilinear features are high dimensional, typically on the order of hundreds of thousands to a few million, which makes them impractical for subsequent analysis. We propose two compact bilinear representations with the same discriminative power as the full bilinear representation but with only a few thousand dimensions. Our compact representations allow back-propagation of classification errors enabling an end-to-end optimization of the visual recognition system. The compact bilinear representations are derived through a novel kernelized analysis of bilinear pooling which provide insights into the discriminative power of bilinear pooling, and a platform for further research in compact pooling methods. Experimentation illustrate the utility of the proposed representations for image classification and few-shot learning across several datasets.

Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell• 2015

Related benchmarks

TaskDatasetResultRank
Image ClassificationDTD
Accuracy84
599
Fine-grained Image ClassificationCUB200 2011 (test)
Accuracy84.3
567
ClassificationCars
Accuracy91.2
492
Fine-grained Image ClassificationStanford Cars (test)
Accuracy91.2
372
Image ClassificationFGVC-Aircraft (test)--
322
Fine-grained Image ClassificationCUB-200 2011
Accuracy84
314
Fine-grained visual classificationFGVC-Aircraft (test)
Top-1 Acc84.1
312
Fine-grained Visual CategorizationStanford Cars (test)
Accuracy91.2
114
Image ClassificationBirds
Accuracy84.3
75
ClassificationAirplane
Accuracy84.1
47
Showing 10 of 15 rows

Other info

Follow for update