Back to Search Start Over

Learning Multiviewpoint Context-Aware Representation for RGB-D Scene Classification

Authors :
Li Wang
Jian Pu
Yingbin Zheng
Hao Ye
Source :
IEEE Signal Processing Letters. 25:30-34
Publication Year :
2018
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2018.

Abstract

Effective visual representation plays an important role in the scene classification systems. While many existing methods are focused on the generic descriptors extracted from the RGB color channels, we argue the importance of depth context, since scenes are composed with spatial variability and depth is an essential component in understanding the geometry. In this letter, we present a novel depth representation for RGB-D scene classification based on a specific designed convolutional neural network (CNN). Contrast to previous deep models that transfer from pretrained RGB CNN models, we harness model by using the multiviewpoint depth image augmentation to overcome the data scarcity problem. The proposed CNN framework contains the dilated convolutions to expand the receptive field and a subsequent spatial pooling to aggregate multiscale contextual information. The combination of contextual design and multiviewpoint depth images are important toward a more compact representation, compared to directly using original depth images or off-the-shelf networks. Through extensive experiments on SUN RGB-D dataset, we demonstrate that the representation outperforms recent state of the arts, and combining it with standard CNN-based RGB features can lead to further improvements.

Details

ISSN :
15582361 and 10709908
Volume :
25
Database :
OpenAIRE
Journal :
IEEE Signal Processing Letters
Accession number :
edsair.doi...........ac7a743c09a966dcf1b977bd73261629
Full Text :
https://doi.org/10.1109/lsp.2017.2764489