Back to Search Start Over

Dense capsule networks with fewer parameters.

Authors :
Sun, Kun
Wen, Xianbin
Yuan, Liming
Xu, Haixia
Source :
Soft Computing - A Fusion of Foundations, Methodologies & Applications; May2021, Vol. 25 Issue 10, p6927-6945, 19p
Publication Year :
2021

Abstract

The capsule network (CapsNet) is a promising model in computer vision. It has achieved excellent results on MNIST, but it is still slightly insufficient in real images. Deepening capsule architectures is an effective way to improve performance, but the computational cost hinders their development. To overcome parameter growth and build an efficient architecture, this paper proposes a tensor capsule layer based on multistage separable convolutions and a dense capsule architecture. Multistage separable convolutions can effectively reduce the parameters at the cost of a small performance loss. In the dense capsule architecture, the use of dense connections allows the capsule network to be deeper and easier to train. Combining these two can achieve a novel lightweight dense capsule network. Experiments show that this network uses only 0.05% of the parameters of the CapsNet, but the performance is improved by 8.25% on CIFAR10. In addition, the full tensor capsule method is proposed to solve the problem of capsule network parameters changing with image scale. Experiments prove that this method can keep the parameters unchanged while affecting the performance in a small amount. In order to lighten the fully connected capsule layer, a dynamic routing based on separable matrices is proposed. In addition to applying it to our models, this algorithm also compresses the CapsNet by 41.25% while losing only 0.47% performance on CIFAR10. The parameter utilization index is proposed to quantify the relationship between parameters and performance. To our knowledge, this is the first paper to study lightweight capsule network. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14327643
Volume :
25
Issue :
10
Database :
Complementary Index
Journal :
Soft Computing - A Fusion of Foundations, Methodologies & Applications
Publication Type :
Academic Journal
Accession number :
149906171
Full Text :
https://doi.org/10.1007/s00500-021-05774-6