Back to Search
Start Over
Redundant feature pruning for accelerated inference in deep neural networks.
- Source :
-
Neural networks : the official journal of the International Neural Network Society [Neural Netw] 2019 Oct; Vol. 118, pp. 148-158. Date of Electronic Publication: 2019 May 09. - Publication Year :
- 2019
-
Abstract
- This paper presents an efficient technique to reduce the inference cost of deep and/or wide convolutional neural network models by pruning redundant features (or filters). Previous studies have shown that over-sized deep neural network models tend to produce a lot of redundant features that are either shifted version of one another or are very similar and show little or no variations, thus resulting in filtering redundancy. We propose to prune these redundant features along with their related feature maps according to their relative cosine distances in the feature space, thus leading to smaller networks with reduced post-training inference computational costs and competitive performance. We empirically show on select models (VGG-16, ResNet-56, ResNet-110, and ResNet-34) and dataset (MNIST Handwritten digits, CIFAR-10, and ImageNet) that inference costs (in FLOPS) can be significantly reduced while overall performance is still competitive with the state-of-the-art.<br /> (Copyright © 2019 Elsevier Ltd. All rights reserved.)
- Subjects :
- Humans
Deep Learning trends
Neural Networks, Computer
Subjects
Details
- Language :
- English
- ISSN :
- 1879-2782
- Volume :
- 118
- Database :
- MEDLINE
- Journal :
- Neural networks : the official journal of the International Neural Network Society
- Publication Type :
- Academic Journal
- Accession number :
- 31279285
- Full Text :
- https://doi.org/10.1016/j.neunet.2019.04.021