Redundant feature pruning for accelerated inference in deep neural networks.

Authors :: Ayinde BO
Inanc T
Zurada JM
Source :: Neural networks : the official journal of the International Neural Network Society [Neural Netw] 2019 Oct; Vol. 118, pp. 148-158. Date of Electronic Publication: 2019 May 09.
Publication Year :: 2019
Abstract: This paper presents an efficient technique to reduce the inference cost of deep and/or wide convolutional neural network models by pruning redundant features (or filters). Previous studies have shown that over-sized deep neural network models tend to produce a lot of redundant features that are either shifted version of one another or are very similar and show little or no variations, thus resulting in filtering redundancy. We propose to prune these redundant features along with their related feature maps according to their relative cosine distances in the feature space, thus leading to smaller networks with reduced post-training inference computational costs and competitive performance. We empirically show on select models (VGG-16, ResNet-56, ResNet-110, and ResNet-34) and dataset (MNIST Handwritten digits, CIFAR-10, and ImageNet) that inference costs (in FLOPS) can be significantly reduced while overall performance is still competitive with the state-of-the-art.<br /> (Copyright © 2019 Elsevier Ltd. All rights reserved.)

Language :: English
ISSN :: 1879-2782
Volume :: 118
Database :: MEDLINE
Journal :: Neural networks : the official journal of the International Neural Network Society
Publication Type :: Academic Journal
Accession number :: 31279285
Full Text :: https://doi.org/10.1016/j.neunet.2019.04.021