Back to Search Start Over

Acoustic scene classification using deep CNN with fine-resolution feature.

Authors :
Zhang, Tao
Liang, Jinhua
Ding, Biyun
Source :
Expert Systems with Applications. Apr2020, Vol. 143, pN.PAG-N.PAG. 1p.
Publication Year :
2020

Abstract

• A new model for acoustic scene classification is proposed. • A relationship between time-frequency resolution and receptive field is modeled. • A fine-resolution feature with semantic information is used for classification. • The proposed method can customize feature representations in various resolutions. • Compared to other deep CNNs, the proposed model reduces computational complexity. Convolutional neural networks with spectrogram feature representation for acoustic scene classification are attracting more and more attentions due to its favorable performance. However, most of the existing methods are still restricted to the tradeoff between the minimum coverage area across time-frequency feature representation, i.e. time-frequency feature resolution, and the depth of CNN models. Thus, it is unfeasible to improve the performance by simply deepening networks. In this paper, fine-resolution convolutional neural network (FRCNN) is proposed to embrace the progress in very deep architecture, feature fusion and convolutional operation. Specifically, lateral construction is applied to generate a fine-resolution feature map with semantic information, and depth-wise separable convolution is utilized to reduce the number of trainable parameters. Extensive experiments demonstrate that the proposed FRCNN exhibits high performance on several metrics, with low computational complexity. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
143
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
140985107
Full Text :
https://doi.org/10.1016/j.eswa.2019.113067