Back to Search Start Over

Learning Hierarchical Image Representation with Sparsity, Saliency and Locality

Authors :
Ming-Hsuan Yang
Jimei Yang
Source :
BMVC
Publication Year :
2011
Publisher :
British Machine Vision Association, 2011.

Abstract

This paper presents a deep learning model of building up hierarchical image representation. Each layer of hierarchy consists of three components: sparse coding, saliency pooling and local grouping. With sparse coding we identify distinctive coefficients for representing raw features of each lower layer; saliency pooling helps suppress noise and enhance translation invariance of sparse representation; we group locally pooled sparse codes to form more complex representations. Instead of using hand-crafted descriptors, our model learns an effective image representation directly from images in a unsupervised data-driven manner. We evaluate our algorithm with several benchmark databases of object recognition and analyze the contributions of different components. Experimental results show that our algorithm performs favorably against the state-of-the-art methods.

Details

Database :
OpenAIRE
Journal :
Procedings of the British Machine Vision Conference 2011
Accession number :
edsair.doi...........7b0ef12b99391c41dc4d99c9cd80a143