Back to Search Start Over

Learning class-specific pooling shapes for image classification

Authors :
Ronggang Wang
Wenmin Wang
Wen Gao
Jinzhuo Wang
Source :
ICME
Publication Year :
2015
Publisher :
IEEE, 2015.

Abstract

Spatial pyramid (SP) representation is an extension of bag-of-feature model which embeds spatial layout information of local features by pooling feature codes over pre-defined spatial shapes. However, the uniform style of spatial pooling shapes used in standard SP is an ad-hoc manner without theoretical motivation, thus lacking the generalization power to adapt to different distribution of geometric properties across image classes. In this paper, we propose a data-driven approach to adaptively learn class-specific pooling shapes (CSPS). Specifically, we first establish an over-complete set of spatial shapes providing candidates with more flexible geometric patterns. Then the optimal subset for each class is selected by training a linear classifier with structured sparsity constraint and color distribution cues. To further enhance the robust of our model, the representations over CSPS are compressed according to the shape importance and finally fed to SVM with a multi-shape matching kernel for classification task. Experimental results on three challenging datasets (Caltech-256, Scene-15 and Indoor-67) demonstrate the effectiveness of the proposed method on both object and scene images.

Details

Database :
OpenAIRE
Journal :
2015 IEEE International Conference on Multimedia and Expo (ICME)
Accession number :
edsair.doi...........c8c691d78149e7f69bdf1af2315ac91f