Back to Search Start Over

A statistical approach for neural network pruning with application to internet of things

Authors :
Chengchen Mao
Qilian Liang
Chenyun Pan
Ioannis Schizas
Source :
EURASIP Journal on Wireless Communications and Networking, Vol 2023, Iss 1, Pp 1-21 (2023)
Publication Year :
2023
Publisher :
SpringerOpen, 2023.

Abstract

Abstract Pruning is showing huge potential for compressing and accelerating deep neural networks by eliminating redundant parameters. Along with more terminal chips integrated with AI accelerators for internet of things (IoT) devices, structured pruning is gaining popularity with the edge computing research area. Different from filter pruning and group-wise pruning, stripe-wise pruning (SWP) conducts pruning at the level of stripes in each filter. By introducing filter skeleton (FS) to each stripe, the existing SWP method sets an absolute threshold for the values in FS and removes the stripes whose corresponding values in FS could not meet the threshold. Starting with investigation into the process of stripe wise convolution, we use the statistical properties of the weights located on each stripe to learn the importance between those stripes in a filter and remove stripes with low importance. Our pruned VGG-16 achieves the existing results by a fourfold reduction in parameter with only 0.4% decrease in accuracy. Results from comprehensive experiments on IoT devices are also presented.

Details

Language :
English
ISSN :
16871499
Volume :
2023
Issue :
1
Database :
Directory of Open Access Journals
Journal :
EURASIP Journal on Wireless Communications and Networking
Publication Type :
Academic Journal
Accession number :
edsdoj.00e8dd12b3874edcaac45445c95ce54f
Document Type :
article
Full Text :
https://doi.org/10.1186/s13638-023-02254-3