Back to Search
Start Over
A real-time semantic segmentation model using iteratively shared features in multiple sub-encoders.
- Source :
-
Pattern Recognition . Aug2023, Vol. 140, pN.PAG-N.PAG. 1p. - Publication Year :
- 2023
-
Abstract
- • A new yet effective lightweight shared multiple sub-encoders design for feature sharing and feature reusing across the encoder. • A novel Cascading Context Mining (CCM) module which segments the regions with different receptive fields at different spatial dimensions of the input. • A new Hybrid Path Attention Semantic Aggregation (HPA-SA) module for identifying and localizing all the objects in a complex scene accurately. • The proposed model produced the state-of-the-art result on four datasets among the existing semantic models having 2 million parameters or less. Recent studies show a significant growth in semantic segmentation. However, many semantic segmentation models still have a large number of parameters, making them unsuitable for resource-constrained embedded devices. To address this issue, we propose an efficient Shared Feature Reuse Segmentation (SFRSeg) model containing several novelties: a new yet effective shared-branch multiple sub-encoders design, a context mining module and a semantic aggregating module for better context granularity. In particular, our shared-branch approach improves the entire feature hierarchy by sharing the spatial and context knowledge in both shallow and deep branches. After every shared point in each sub-encoder, a proposed cascading context mining (CCM) module is deployed to filter out the noisy spatial details from the feature maps and provides a diverse size of receptive fields for capturing the latent context between multi-scale geometric shapes in the scene. To overcome the gradient vanishing issue at the early stage, we reduce the number of layers in the first sub-encoder and employ a unique multiple sub-encoders design which reprocesses the rich global feature maps through multiple sub-encoders for better feature refinement. Later, the rich semantic features generated by the efficient sub-encoders at different levels are fused by the proposed Hybrid Path Attention Semantic Aggregation (HPA-SA) module that effectively reduces the semantic gap between feature maps at different levels and alleviate the well-known boundary degeneration effect. To make it computationally efficient for resource-constrained embedded devices, a series of lightweight methods such as a lightweight encoder, a squeeze-and-excitation design, separable convolution filters, channel reduction (CR) are carefully exploited. With an exceptional performance on Cityscapes (70.6% test mIoU) and CamVid (74.7% test mIoU) data sets, the proposed model is shown to be superior over existing light real-time semantic segmentation models whilst having only 1.6 million parameters. [ABSTRACT FROM AUTHOR]
- Subjects :
- *CONVOLUTIONAL neural networks
*GEOMETRIC shapes
Subjects
Details
- Language :
- English
- ISSN :
- 00313203
- Volume :
- 140
- Database :
- Academic Search Index
- Journal :
- Pattern Recognition
- Publication Type :
- Academic Journal
- Accession number :
- 163267102
- Full Text :
- https://doi.org/10.1016/j.patcog.2023.109557